[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-15 Thread falaki
Github user falaki commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-172107705 This PR was merged with https://github.com/apache/spark/pull/10766 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-15 Thread falaki
Github user falaki closed the pull request at: https://github.com/apache/spark/pull/10615 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-14 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r49826113 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParser.scala --- @@ -0,0 +1,243 @@ +/* + * Licensed to the Apache S

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-14 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-171884149 **[Test build #2384 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2384/consoleFull)** for PR 10615 at commit [`319e0ed`](https://

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-14 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-171883737 **[Test build #2384 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2384/consoleFull)** for PR 10615 at commit [`319e0ed`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-11 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r49377643 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVInferSchema.scala --- @@ -0,0 +1,228 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread HyukjinKwon
Github user HyukjinKwon commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169855191 Cool! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature ena

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r49147677 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVSuite.scala --- @@ -0,0 +1,341 @@ +/* + * Licensed to the Ap

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r49147704 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVSuite.scala --- @@ -0,0 +1,341 @@ +/* + * Licensed to the Ap

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r49147610 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParser.scala --- @@ -0,0 +1,243 @@ +/* + * Licensed to the A

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread mohitjaggi
Github user mohitjaggi commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169852430 this is great...thanks @falaki --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169850598 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169850601 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169850361 **[Test build #48975 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48975/consoleFull)** for PR 10615 at commit [`319e0ed`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169825583 **[Test build #48975 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48975/consoleFull)** for PR 10615 at commit [`319e0ed`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169610545 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169573499 **[Test build #48905 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48905/consoleFull)** for PR 10615 at commit [`1e312a5`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169548304 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169548309 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169548127 **[Test build #48883 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48883/consoleFull)** for PR 10615 at commit [`0fd4bd3`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169508841 **[Test build #48883 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48883/consoleFull)** for PR 10615 at commit [`0fd4bd3`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169500498 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169500341 **[Test build #48879 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48879/consoleFull)** for PR 10615 at commit [`1856ed3`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169500494 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread falaki
Github user falaki commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169490033 @steveloughran this adds uniVocity-parsers as a dependency to SparkSQL. With assembly jar distribution, that jar will be included. If/When Spark moves away from assembly

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169486663 **[Test build #48879 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48879/consoleFull)** for PR 10615 at commit [`1856ed3`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169453618 **[Test build #48869 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48869/consoleFull)** for PR 10615 at commit [`e364c28`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169453637 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169453630 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169451909 **[Test build #48869 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48869/consoleFull)** for PR 10615 at commit [`e364c28`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread steveloughran
Github user steveloughran commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169314710 Is this going to require the new parser JAR on the classpath everywhere, or will everything excluding CSV parsing still work without it? --- If your project is s

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169285927 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169285920 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169285302 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169285294 **[Test build #48852 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48852/consoleFull)** for PR 10615 at commit [`b31cb89`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169285300 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-06 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169284984 **[Test build #48852 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48852/consoleFull)** for PR 10615 at commit [`b31cb89`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r48933011 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVRelation.scala --- @@ -0,0 +1,305 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r48932909 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVInferSchema.scala --- @@ -0,0 +1,231 @@ +/* + * Licensed to

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/10615#discussion_r48932885 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVInferSchema.scala --- @@ -0,0 +1,231 @@ +/* + * Licensed to

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169253683 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169253676 **[Test build #48842 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48842/consoleFull)** for PR 10615 at commit [`da314cb`](https://g

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169253681 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10615#issuecomment-169253615 **[Test build #48842 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48842/consoleFull)** for PR 10615 at commit [`da314cb`](https://gi

[GitHub] spark pull request: [SPARK-12420][SQL] Have a built-in CSV data so...

2016-01-05 Thread falaki
GitHub user falaki opened a pull request: https://github.com/apache/spark/pull/10615 [SPARK-12420][SQL] Have a built-in CSV data source implementation CSV is the most common data format in the "small data" world. It is often the first format people want to try when they see Spark on