[ 
https://issues.apache.org/jira/browse/SPARK-45861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17786487#comment-17786487
 ] 

Allison Wang commented on SPARK-45861:
--------------------------------------

[~panbingkun] again, thanks for working on this. Let me give you more details.

When people search on Google for example "spark create dataframe", you can see 
there are many results, one of them being the PySpark documentation - 
createDataFrame.

But there are many other ways to create a dataframe, for example from various 
data sources (CSV, JDBC, Parquet, etc), from pandas dataframe, from 
`spark.sql`, etc. 

We want to create a new documentation page under `{*}User Guides{*}` to explain 
all kinds of ways people can use to create a Spark data frame. It's different 
from the quickstart in that the user guide will provide more comprehensive 
examples.

Feel free to take a look at the results when you search "spark create 
dataframe" or even "create dataframe" to get more inspirations.

cc [~afolting] [~smilegator]

> Add user guide for dataframe creation
> -------------------------------------
>
>                 Key: SPARK-45861
>                 URL: https://issues.apache.org/jira/browse/SPARK-45861
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Documentation, PySpark
>    Affects Versions: 4.0.0
>            Reporter: Allison Wang
>            Priority: Major
>         Attachments: screenshot-1.png, screenshot-2.png
>
>
> Add a simple user guide for data frame creation.
> This user guide should cover the following APIs:
>  # df.createDataFrame
>  # spark.read.format(...) (can be csv, json, parquet



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to