[
https://issues.apache.org/jira/browse/BEAM-10983?focusedWorklogId=502754&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-502754
]
ASF GitHub Bot logged work on BEAM-10983:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 20/Oct/20 15:54
Start Date: 20/Oct/20 15:54
Worklog Time Spent: 10m
Work Description: davidcavazos commented on pull request #12963:
URL: https://github.com/apache/beam/pull/12963#issuecomment-712953191
> You may also want to make some mention of the Spark `DataFrame`
[API](https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.DataFrame),
at least in the introduction. This is technically newer and in many cases
superior to the RDD API, and where a lot of users would be coming from.
What kind of mention do you think would be useful? I'm not familiar with
Spark nor Spark Dataframes. I was looking at the API and it looks like it has
about the same functionality, but some method names are different, and it
supports SQL. Right now BeamSQL is only available in Java so we won't mention
it here for the time being.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 502754)
Time Spent: 1.5h (was: 1h 20m)
> Have a getting started for Spark users
> --------------------------------------
>
> Key: BEAM-10983
> URL: https://issues.apache.org/jira/browse/BEAM-10983
> Project: Beam
> Issue Type: New Feature
> Components: website
> Reporter: David Cavazos
> Assignee: David Cavazos
> Priority: P2
> Time Spent: 1.5h
> Remaining Estimate: 0h
>
> Have a friendlier getting started experience for users who already know Spark.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)