PJ Fanning created SPARK-15463:
----------------------------------
Summary: Support for creating a dataframe from CSV in RDD[String]
Key: SPARK-15463
URL: https://issues.apache.org/jira/browse/SPARK-15463
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 2.0.0
Reporter: PJ Fanning
I currently use Databrick's spark-csv lib but some features don't work with
Apache Spark 2.0.0-SNAPSHOT. I understand that with the addition of CSV support
into spark-sql directly, that spark-csv won't be modified.
I currently read some CSV data that has been pre-processed and is in
RDD[String] format.
There is sqlContext.read.json(rdd: RDD[String]) but other formats don't appear
to support the creation of DataFrames based on loading from RDD[String].
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]