[jira] [Comment Edited] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127656#comment-15127656 ] Hyukjin Kwon edited comment on SPARK-13108 at 2/2/16 4:55 AM: -- Sure. It

[jira] [Issue Comment Deleted] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13108: - Comment: was deleted (was: Sure. It needs to re-write Hadoop's LineRecordReader, LineReader and

[jira] [Updated] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13108: - Description: This library uses Hadoop's

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127656#comment-15127656 ] Hyukjin Kwon commented on SPARK-13108: -- Sure. It needs to re-write Hadoop's LineRecordReader,

[jira] [Commented] (SPARK-13114) java.lang.NegativeArraySizeException in CSV

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127756#comment-15127756 ] Hyukjin Kwon commented on SPARK-13114: -- Are you working on this? Could I quickly submit a PR for

[jira] [Comment Edited] (SPARK-12997) Use cast expression to perform type cast in csv

2016-02-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15160270#comment-15160270 ] Hyukjin Kwon edited comment on SPARK-12997 at 2/24/16 6:43 AM: --- If I got

[jira] [Commented] (SPARK-12997) Use cast expression to perform type cast in csv

2016-02-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15160270#comment-15160270 ] Hyukjin Kwon commented on SPARK-12997: -- If I got this correctly, I think the issue itself is a

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168431#comment-15168431 ] Hyukjin Kwon commented on SPARK-13503: -- [~rxin] Sure. > Support to specify the (writing) option for

[jira] [Created] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13507: Summary: Documentation for compression options for JSON and TEXT data sources Key: SPARK-13507 URL: https://issues.apache.org/jira/browse/SPARK-13507 Project: Spark

[jira] [Updated] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13507: - Description: Compression options are added for CSV, JSON and TEXT data sources (SPARK-12872,

[jira] [Updated] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13507: - Description: Compression options are added for CSV, JSON and TEXT data sources (SPARK-12872,

[jira] [Updated] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13503: - Description: CSV and JSON support to specify compression option for writing (this was done by

[jira] [Commented] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168665#comment-15168665 ] Hyukjin Kwon commented on SPARK-13507: -- I will work on this > Documentation for compression options

[jira] [Commented] (SPARK-13509) Support for writing CSV with a single function call

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168689#comment-15168689 ] Hyukjin Kwon commented on SPARK-13509: -- [~rxin] I also forgot to add the API for writing. For

[jira] [Comment Edited] (SPARK-13509) Support for writing CSV with a single function call

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168689#comment-15168689 ] Hyukjin Kwon edited comment on SPARK-13509 at 2/26/16 9:05 AM: --- [~rxin] I

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168439#comment-15168439 ] Hyukjin Kwon commented on SPARK-13503: -- I completely forgot I actually did this for JSON before in

[jira] [Updated] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13507: - Description: Compression options are added for CSV, JSON and TEXT data sources (SPARK-12872,

[jira] [Updated] (SPARK-13507) Documentation for compression options for JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13507: - Description: Compression options are added for CSV, JSON and TEXT data sources (SPARK-12872,

[jira] [Updated] (SPARK-13507) Documentation for compression options for CSV, JSON and TEXT data sources

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13507: - Summary: Documentation for compression options for CSV, JSON and TEXT data sources (was:

[jira] [Commented] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-02-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171365#comment-15171365 ] Hyukjin Kwon commented on SPARK-13543: -- I will work on this as soon as the PRs I submitted are

[jira] [Created] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-02-28 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13543: Summary: Support for specifying compression codec for Parquet/ORC via option() Key: SPARK-13543 URL: https://issues.apache.org/jira/browse/SPARK-13543 Project: Spark

[jira] [Commented] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-02-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171370#comment-15171370 ] Hyukjin Kwon commented on SPARK-13543: -- I added this link (SPARK-12307) as a {{relate}} because for

[jira] [Commented] (SPARK-13174) Add API and options for csv data sources

2016-02-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15166563#comment-15166563 ] Hyukjin Kwon commented on SPARK-13174: -- [~davies] I carelessly opened (I think) the same issue and

[jira] [Updated] (SPARK-13174) Add API and options for csv data sources

2016-02-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13174: - Affects Version/s: 2.0.0 > Add API and options for csv data sources >

[jira] [Updated] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-02-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13184: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-12420 > Support minPartitions

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1516#comment-1516 ] Hyukjin Kwon commented on SPARK-11691: -- I apologize that I carelessly open the same issue and

[jira] [Resolved] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11691. -- Resolution: Fixed Fix Version/s: 2.0.0 > Allow to specify compression codec in

[jira] [Comment Edited] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15169160#comment-15169160 ] Hyukjin Kwon edited comment on SPARK-11691 at 2/26/16 3:10 PM: --- This issue

[jira] [Comment Edited] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1516#comment-1516 ] Hyukjin Kwon edited comment on SPARK-11691 at 2/26/16 12:28 PM: I

[jira] [Comment Edited] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1516#comment-1516 ] Hyukjin Kwon edited comment on SPARK-11691 at 2/26/16 12:27 PM: I

[jira] [Comment Edited] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15169071#comment-15169071 ] Hyukjin Kwon edited comment on SPARK-11691 at 2/26/16 2:23 PM: --- I gave a

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15169160#comment-15169160 ] Hyukjin Kwon commented on SPARK-11691: -- This issue deals with a bit more generalized compression

[jira] [Created] (SPARK-13442) Make type inference recognize boolean types

2016-02-22 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13442: Summary: Make type inference recognize boolean types Key: SPARK-13442 URL: https://issues.apache.org/jira/browse/SPARK-13442 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15169066#comment-15169066 ] Hyukjin Kwon commented on SPARK-11691: -- SPARK-13503 is merged and resolved. That issue addresses the

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168898#comment-15168898 ] Hyukjin Kwon commented on SPARK-13503: -- Sorry, I will. > Support to specify the (writing) option

[jira] [Resolved] (SPARK-12119) Support compression in PySpark

2016-02-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12119. -- Resolution: Fixed Fix Version/s: 2.0.0 > Support compression in PySpark >

[jira] [Created] (SPARK-13509) Support for writing CSV with a single function call

2016-02-26 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13509: Summary: Support for writing CSV with a single function call Key: SPARK-13509 URL: https://issues.apache.org/jira/browse/SPARK-13509 Project: Spark Issue

[jira] [Created] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13503: Summary: Support to specify the (writing) option for compression codec for JSON and Text Key: SPARK-13503 URL: https://issues.apache.org/jira/browse/SPARK-13503

[jira] [Updated] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13503: - Description: CSV supports to specify compression option for writing (this was done [this

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168319#comment-15168319 ] Hyukjin Kwon commented on SPARK-13503: -- [~rxin] I can work on this but just want to confirm, could I

[jira] [Commented] (SPARK-12863) missing api for renaming and mapping result of operations on GroupedDataset to case classes

2016-01-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110045#comment-15110045 ] Hyukjin Kwon commented on SPARK-12863: -- I understand this might possibly be an issue although I have

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon commented on SPARK-12890: -- Actually I don't still understand what is an issue here.

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 1:44 PM: --- Actually I

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 1:46 PM: --- Actually I

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114794#comment-15114794 ] Hyukjin Kwon commented on SPARK-12890: -- In that case, it will not read all the data but only footer

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114794#comment-15114794 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 5:49 AM: --- In that

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114794#comment-15114794 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 5:53 AM: --- In that

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114794#comment-15114794 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 5:52 AM: --- In that

[jira] [Created] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13323: Summary: Type cast support in type inference during merging types. Key: SPARK-13323 URL: https://issues.apache.org/jira/browse/SPARK-13323 Project: Spark

[jira] [Commented] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147067#comment-15147067 ] Hyukjin Kwon commented on SPARK-13323: -- [~davies] Could you please look through this? I want to try

[jira] [Comment Edited] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147067#comment-15147067 ] Hyukjin Kwon edited comment on SPARK-13323 at 2/15/16 8:45 AM: --- [~davies]

[jira] [Updated] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13323: - Description: As described in {{types.py}}, there is a todo {{TODO: type cast (such as int ->

[jira] [Commented] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147851#comment-15147851 ] Hyukjin Kwon commented on SPARK-13323: -- [~davies] Yes it's complicated but dealimg with numeric

[jira] [Comment Edited] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147851#comment-15147851 ] Hyukjin Kwon edited comment on SPARK-13323 at 2/15/16 10:43 PM: [~davies]

[jira] [Commented] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147862#comment-15147862 ] Hyukjin Kwon commented on SPARK-13323: -- Let me add some codes here to reproduce in an hour. > Type

[jira] [Commented] (SPARK-13323) Type cast support in type inference during merging types.

2016-02-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15147929#comment-15147929 ] Hyukjin Kwon commented on SPARK-13323: -- {code} sqlCtx.createDataFrame([["a"], [1]]).show() {code}

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153568#comment-15153568 ] Hyukjin Kwon commented on SPARK-8000: - Actually I sent a email to dev mailing list. The contents was

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153570#comment-15153570 ] Hyukjin Kwon commented on SPARK-8000: - And I got an email from you which was {quote} Thanks for the

[jira] [Created] (SPARK-13381) Support for loading CSV with a single function call

2016-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13381: Summary: Support for loading CSV with a single function call Key: SPARK-13381 URL: https://issues.apache.org/jira/browse/SPARK-13381 Project: Spark Issue

[jira] [Updated] (SPARK-13381) Support for loading CSV with a single function call

2016-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13381: - Description: Just like {{json()}}, {{text()}}, {{orc()}} and {{parquet()}}, it would be great if

[jira] [Commented] (SPARK-13381) Support for loading CSV with a single function call

2016-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153695#comment-15153695 ] Hyukjin Kwon commented on SPARK-13381: -- [~rxin] I would like to work on this (since it would be

[jira] [Created] (SPARK-13425) Documentation for CSV datasource options

2016-02-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13425: Summary: Documentation for CSV datasource options Key: SPARK-13425 URL: https://issues.apache.org/jira/browse/SPARK-13425 Project: Spark Issue Type:

[jira] [Updated] (SPARK-13425) Documentation for CSV datasource options

2016-02-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13425: - Description: As said https://github.com/apache/spark/pull/11262#discussion_r53508815, CSV

[jira] [Updated] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13137: - Issue Type: Sub-task (was: Bug) Parent: SPARK-12420 > NullPoingException in schema

[jira] [Updated] (SPARK-13114) java.lang.NegativeArraySizeException in CSV

2016-02-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13114: - Issue Type: Sub-task (was: Bug) Parent: SPARK-12420 >

[jira] [Commented] (SPARK-13425) Documentation for CSV datasource options

2016-02-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15156433#comment-15156433 ] Hyukjin Kwon commented on SPARK-13425: -- Could I maybe try this as well (based on json documentation

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15156644#comment-15156644 ] Hyukjin Kwon commented on SPARK-13108: -- [~falaki] Indeed. I will create another issue and fix them

[jira] [Created] (SPARK-13997) Use Hadoop 2.0 default value for compression in data sources

2016-03-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13997: Summary: Use Hadoop 2.0 default value for compression in data sources Key: SPARK-13997 URL: https://issues.apache.org/jira/browse/SPARK-13997 Project: Spark

[jira] [Updated] (SPARK-13764) Parse modes in JSON data source

2016-03-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13764: - Description: Currently, JSON data source just fails to read if some JSON documents are

[jira] [Created] (SPARK-13899) Produce InternalRow instead of external Row

2016-03-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13899: Summary: Produce InternalRow instead of external Row Key: SPARK-13899 URL: https://issues.apache.org/jira/browse/SPARK-13899 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186276#comment-15186276 ] Hyukjin Kwon edited comment on SPARK-13719 at 3/9/16 1:34 AM: -- [~rxin]

[jira] [Commented] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186276#comment-15186276 ] Hyukjin Kwon commented on SPARK-13719: -- [~rxin] Actually, shouldn't we maybe need modes such as

[jira] [Comment Edited] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186276#comment-15186276 ] Hyukjin Kwon edited comment on SPARK-13719 at 3/9/16 1:33 AM: -- [~rxin]

[jira] [Created] (SPARK-13764) Parse modes in JSON data source

2016-03-08 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13764: Summary: Parse modes in JSON data source Key: SPARK-13764 URL: https://issues.apache.org/jira/browse/SPARK-13764 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186554#comment-15186554 ] Hyukjin Kwon commented on SPARK-13719: -- I opened a JIRA here, SPARK-13764. Could we maybe make

[jira] [Commented] (SPARK-13764) Parse modes in JSON data source

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186557#comment-15186557 ] Hyukjin Kwon commented on SPARK-13764: -- I will try to work on this (after looking a bit deeper). >

[jira] [Created] (SPARK-13766) Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources

2016-03-08 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13766: Summary: Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources Key: SPARK-13766 URL:

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186576#comment-15186576 ] Hyukjin Kwon commented on SPARK-13766: -- I will work on this. > Inconsistent file extensions and

[jira] [Updated] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13766: - Summary: Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON

[jira] [Updated] (SPARK-13766) Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13766: - Description: Currently, the output (part-files) from CSV, TEXT and JSON data sources do not

[jira] [Updated] (SPARK-13766) Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources

2016-03-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13766: - Description: Currently, the output (part-files) from CSV, TEXT and JSON data sources do not

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186908#comment-15186908 ] Hyukjin Kwon commented on SPARK-13766: -- Partly due to "auto detection" for data source SPARK-8000.

[jira] [Issue Comment Deleted] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13766: - Comment: was deleted (was: Partly due to "auto detection" for data source SPARK-8000. If that

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186914#comment-15186914 ] Hyukjin Kwon commented on SPARK-13766: -- Partly due to "auto detection" for data source SPARK-8000.

[jira] [Comment Edited] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186908#comment-15186908 ] Hyukjin Kwon edited comment on SPARK-13766 at 3/9/16 10:17 AM: --- Partly due

[jira] [Comment Edited] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186890#comment-15186890 ] Hyukjin Kwon edited comment on SPARK-13766 at 3/9/16 10:02 AM: --- Are there

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186963#comment-15186963 ] Hyukjin Kwon commented on SPARK-11691: -- Could anybody take an action for this Jira? Compressions

[jira] [Commented] (SPARK-3308) Ability to read JSON Arrays as tables

2016-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198543#comment-15198543 ] Hyukjin Kwon commented on SPARK-3308: - I removed the PR link,

[jira] [Commented] (SPARK-13764) Parse modes in JSON data source

2016-03-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15196665#comment-15196665 ] Hyukjin Kwon commented on SPARK-13764: -- The issue SPARK-3308 is related with supporting each row

[jira] [Commented] (SPARK-14428) [SQL] Allow more flexibility when parsing dates and timestamps in json datasources

2016-04-06 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15229506#comment-15229506 ] Hyukjin Kwon commented on SPARK-14428: -- I can work on this if it is decided to be supported. (I am

[jira] [Created] (SPARK-14596) Remove not used SqlNewHadoopRDD

2016-04-13 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-14596: Summary: Remove not used SqlNewHadoopRDD Key: SPARK-14596 URL: https://issues.apache.org/jira/browse/SPARK-14596 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Created] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-14480: Summary: Simplify CSV parsing process with a better performance Key: SPARK-14480 URL: https://issues.apache.org/jira/browse/SPARK-14480 Project: Spark

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Commented] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15231674#comment-15231674 ] Hyukjin Kwon commented on SPARK-14480: -- [~rxin] [~srowen] Could I maybe try to open a PR for this

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Updated] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Description: Currently, CSV data source reads and parses CSV data bytes by bytes (not line by

[jira] [Updated] (SPARK-13764) Parse modes in JSON data source

2016-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13764: - Description: Currently, JSON data source just fails to read if some JSON documents are

<    1   2   3   4   5   6   7   8   9   10   >