[jira] [Commented] (SPARK-12315) isnotnull operator not pushed down for JDBC datasource.

2015-12-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15055347#comment-15055347 ] Hyukjin Kwon commented on SPARK-12315: -- I will work on this. > isnotnull operator not pushed down

[jira] [Updated] (SPARK-12355) Implement unhandledFilter interface for Parquet

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12355: - Fix Version/s: (was: 1.6.0) > Implement unhandledFilter interface for Parquet >

[jira] [Updated] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12354: - Affects Version/s: 1.6.0 > Implement unhandledFilter interface >

[jira] [Updated] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12354: - Fix Version/s: (was: 1.6.0) > Implement unhandledFilter interface >

[jira] [Issue Comment Deleted] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12354: - Comment: was deleted (was: Currently the filters are not being tested properly. This might be

[jira] [Commented] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059495#comment-15059495 ] Hyukjin Kwon commented on SPARK-12354: -- Currently the filters are not being tested properly. This

[jira] [Updated] (SPARK-12355) Implement unhandledFilter interface for Parquet

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12355: - Priority: Critical (was: Major) > Implement unhandledFilter interface for Parquet >

[jira] [Created] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12356: Summary: Implement unhandledFilter interface for ORC Key: SPARK-12356 URL: https://issues.apache.org/jira/browse/SPARK-12356 Project: Spark Issue Type:

[jira] [Updated] (SPARK-12357) Implement unhandledFilter interface for JDBC

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12357: - Fix Version/s: (was: 1.6.0) > Implement unhandledFilter interface for JDBC >

[jira] [Updated] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12356: - Fix Version/s: (was: 1.6.0) > Implement unhandledFilter interface for ORC >

[jira] [Updated] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12356: - Affects Version/s: 1.6.0 > Implement unhandledFilter interface for ORC >

[jira] [Created] (SPARK-12355) Implement unhandledFilter interface for Parquet

2015-12-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12355: Summary: Implement unhandledFilter interface for Parquet Key: SPARK-12355 URL: https://issues.apache.org/jira/browse/SPARK-12355 Project: Spark Issue Type:

[jira] [Created] (SPARK-12357) Implement unhandledFilter interface for JDBC

2015-12-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12357: Summary: Implement unhandledFilter interface for JDBC Key: SPARK-12357 URL: https://issues.apache.org/jira/browse/SPARK-12357 Project: Spark Issue Type:

[jira] [Created] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12354: Summary: Implement unhandledFilter interface Key: SPARK-12354 URL: https://issues.apache.org/jira/browse/SPARK-12354 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12356: - Priority: Critical (was: Major) > Implement unhandledFilter interface for ORC >

[jira] [Commented] (SPARK-12354) Implement unhandledFilter interface

2015-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059496#comment-15059496 ] Hyukjin Kwon commented on SPARK-12354: -- I can work on them. > Implement unhandledFilter interface >

[jira] [Closed] (SPARK-12227) Support drop multiple columns specified by Column class in DataFrame API

2015-12-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon closed SPARK-12227. Resolution: Won't Fix This would not be fixed

[jira] [Commented] (SPARK-12225) Support adding or replacing multiple columns at once in DataFrame API

2015-12-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052162#comment-15052162 ] Hyukjin Kwon commented on SPARK-12225: -- I also agree with adding this feature. I have seen several

[jira] [Commented] (SPARK-12218) Boolean logic in sql does not work "not (A and B)" is not the same as "(not A) or (not B)"

2015-12-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052172#comment-15052172 ] Hyukjin Kwon commented on SPARK-12218: -- Then, does this mean closing this issue? [~smilegator] >

[jira] [Created] (SPARK-12314) isnull operator not pushed down for JDBC datasource.

2015-12-13 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12314: Summary: isnull operator not pushed down for JDBC datasource. Key: SPARK-12314 URL: https://issues.apache.org/jira/browse/SPARK-12314 Project: Spark Issue

[jira] [Comment Edited] (SPARK-12506) Push down WHERE clause arithmetic operator to JDBC layer

2015-12-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072575#comment-15072575 ] Hyukjin Kwon edited comment on SPARK-12506 at 12/28/15 9:54 AM:

[jira] [Comment Edited] (SPARK-12420) Have a built-in CSV data source implementation

2015-12-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072580#comment-15072580 ] Hyukjin Kwon edited comment on SPARK-12420 at 12/28/15 10:05 AM: - +1, I

[jira] [Commented] (SPARK-12420) Have a built-in CSV data source implementation

2015-12-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072580#comment-15072580 ] Hyukjin Kwon commented on SPARK-12420: -- +1, I was wondering why it has been staying third party. >

[jira] [Commented] (SPARK-12506) Push down WHERE clause arithmetic operator to JDBC layer

2015-12-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072575#comment-15072575 ] Hyukjin Kwon commented on SPARK-12506: -- [~huaxing] Maybe we should do this one first

[jira] [Commented] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15067923#comment-15067923 ] Hyukjin Kwon commented on SPARK-12356: -- Actually, I am not too sure if we have to implement this for

[jira] [Commented] (SPARK-12356) Implement unhandledFilter interface for ORC

2015-12-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070344#comment-15070344 ] Hyukjin Kwon commented on SPARK-12356: -- [~yhuai] I think this should not be fixed as ORC filters

[jira] [Comment Edited] (SPARK-11949) Query on DataFrame from cube gives wrong results

2015-11-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15026508#comment-15026508 ] Hyukjin Kwon edited comment on SPARK-11949 at 11/25/15 9:48 AM: Oops. I

[jira] [Commented] (SPARK-11949) Query on DataFrame from cube gives wrong results

2015-11-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15026508#comment-15026508 ] Hyukjin Kwon commented on SPARK-11949: -- Oops. I did the same test a bit ago. Here {code} case

[jira] [Comment Edited] (SPARK-11621) ORC filter pushdown not working properly after new unhandled filter interface.

2015-11-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15026534#comment-15026534 ] Hyukjin Kwon edited comment on SPARK-11621 at 11/25/15 9:58 AM: Could

[jira] [Commented] (SPARK-11621) ORC filter pushdown not working properly after new unhandled filter interface.

2015-11-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15026534#comment-15026534 ] Hyukjin Kwon commented on SPARK-11621: -- Could anybody check the Resolution please? > ORC filter

[jira] [Commented] (SPARK-9182) filter and groupBy on DataFrames are not passed through to jdbc source

2015-11-29 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15031266#comment-15031266 ] Hyukjin Kwon commented on SPARK-9182: - Just a thought, Currently

[jira] [Comment Edited] (SPARK-9182) filter and groupBy on DataFrames are not passed through to jdbc source

2015-11-29 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15031266#comment-15031266 ] Hyukjin Kwon edited comment on SPARK-9182 at 11/30/15 3:00 AM: --- Just a

[jira] [Commented] (SPARK-11868) wrong results returned from dataframe create from Rows without consistent schma on pyspark

2015-11-29 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15031387#comment-15031387 ] Hyukjin Kwon commented on SPARK-11868: -- Are you working on this? > wrong results returned from

[jira] [Commented] (SPARK-11844) can not read class org.apache.parquet.format.PageHeader: don't know what type: 13

2015-11-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015497#comment-15015497 ] Hyukjin Kwon commented on SPARK-11844: -- Just a question.. Does this just happen randomly? if the

[jira] [Comment Edited] (SPARK-11844) can not read class org.apache.parquet.format.PageHeader: don't know what type: 13

2015-11-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015497#comment-15015497 ] Hyukjin Kwon edited comment on SPARK-11844 at 11/20/15 9:45 AM: Just a

[jira] [Commented] (SPARK-11620) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException

2015-11-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015468#comment-15015468 ] Hyukjin Kwon commented on SPARK-11620: -- [~swethakasireddy] It uses 1.6.0rc3. Hm.. Would please you

[jira] [Comment Edited] (SPARK-11620) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException

2015-11-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015468#comment-15015468 ] Hyukjin Kwon edited comment on SPARK-11620 at 11/20/15 9:19 AM:

[jira] [Comment Edited] (SPARK-11620) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException

2015-11-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015468#comment-15015468 ] Hyukjin Kwon edited comment on SPARK-11620 at 11/20/15 9:20 AM:

[jira] [Comment Edited] (SPARK-15840) New csv reader does not "determine the input schema"

2016-06-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322140#comment-15322140 ] Hyukjin Kwon edited comment on SPARK-15840 at 6/9/16 8:24 AM: -- There is

[jira] [Commented] (SPARK-15840) New csv reader does not "determine the input schema"

2016-06-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322140#comment-15322140 ] Hyukjin Kwon commented on SPARK-15840: -- There is {{inferSchema}} option but it seems it was missed

[jira] [Commented] (SPARK-15840) New csv reader does not "determine the input schema"

2016-06-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322147#comment-15322147 ] Hyukjin Kwon commented on SPARK-15840: -- For custom dateFormat, here there are,

[jira] [Commented] (SPARK-13638) Support for saving with a quote mode

2016-05-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15301013#comment-15301013 ] Hyukjin Kwon commented on SPARK-13638: -- [~rxin][~jurriaanpruis] Yes, it is about that. I also agree

[jira] [Commented] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15327106#comment-15327106 ] Hyukjin Kwon commented on SPARK-15916: -- Indeed. Do you mind if I submit a PR for this? > JDBC

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2016-06-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328851#comment-15328851 ] Hyukjin Kwon commented on SPARK-15918: -- Actually, I met this case before and was thinking it might

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15309966#comment-15309966 ] Hyukjin Kwon commented on SPARK-15393: -- But if it does not write anything, it will lose its schema.

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15310023#comment-15310023 ] Hyukjin Kwon commented on SPARK-15393: -- Yes but how can we read the schema back if there isn't any

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15310033#comment-15310033 ] Hyukjin Kwon commented on SPARK-15393: -- Hive would be okay because the schemas are stored in a

[jira] [Updated] (SPARK-14480) Remove meaningless StringIteratorReader for CSV data source for better performance

2016-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Summary: Remove meaningless StringIteratorReader for CSV data source for better performance

[jira] [Created] (SPARK-16103) Share a single Row for CSV data source rather than creating every time

2016-06-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16103: Summary: Share a single Row for CSV data source rather than creating every time Key: SPARK-16103 URL: https://issues.apache.org/jira/browse/SPARK-16103 Project:

[jira] [Commented] (SPARK-16099) Refatoring CSV data source and improve performance

2016-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341659#comment-15341659 ] Hyukjin Kwon commented on SPARK-16099: -- Basically it was splitted because the original PR was a bit

[jira] [Created] (SPARK-16101) Refactoring CSV data source to be consistent with JSON data source

2016-06-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16101: Summary: Refactoring CSV data source to be consistent with JSON data source Key: SPARK-16101 URL: https://issues.apache.org/jira/browse/SPARK-16101 Project: Spark

[jira] [Created] (SPARK-16099) Refatoring CSV data source and improve performance

2016-06-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16099: Summary: Refatoring CSV data source and improve performance Key: SPARK-16099 URL: https://issues.apache.org/jira/browse/SPARK-16099 Project: Spark Issue

[jira] [Updated] (SPARK-14480) Remove meaningless StringIteratorReader for CSV data source for better performance

2016-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14480: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-16099 > Remove meaningless

[jira] [Created] (SPARK-16102) Use Record API from Univocity rather than current data cast API.

2016-06-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16102: Summary: Use Record API from Univocity rather than current data cast API. Key: SPARK-16102 URL: https://issues.apache.org/jira/browse/SPARK-16102 Project: Spark

[jira] [Updated] (SPARK-16102) Use Record API from Univocity rather than current data cast API.

2016-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-16102: - Affects Version/s: 2.0.0 > Use Record API from Univocity rather than current data cast API. >

[jira] [Created] (SPARK-16104) Do not creaate CSV writer object for every flush when writing

2016-06-21 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16104: Summary: Do not creaate CSV writer object for every flush when writing Key: SPARK-16104 URL: https://issues.apache.org/jira/browse/SPARK-16104 Project: Spark

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341715#comment-15341715 ] Hyukjin Kwon commented on SPARK-15393: -- This is said in the comment above. The example I ran is

[jira] [Created] (SPARK-16044) input_file_name() returns empty strings in data sources based on NewHadoopRDD.

2016-06-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16044: Summary: input_file_name() returns empty strings in data sources based on NewHadoopRDD. Key: SPARK-16044 URL: https://issues.apache.org/jira/browse/SPARK-16044

[jira] [Commented] (SPARK-16168) Spark sql can not read ORC table

2016-06-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15347481#comment-15347481 ] Hyukjin Kwon commented on SPARK-16168: -- I think it would be nicer if there are some cores to

[jira] [Commented] (SPARK-16172) SQL Context's

2016-06-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15347477#comment-15347477 ] Hyukjin Kwon commented on SPARK-16172: -- Do you mind updating the title to be in more details and

[jira] [Created] (SPARK-12871) Support to specify the option for compression codec.

2016-01-17 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12871: Summary: Support to specify the option for compression codec. Key: SPARK-12871 URL: https://issues.apache.org/jira/browse/SPARK-12871 Project: Spark Issue

[jira] [Commented] (SPARK-12872) Support to specify the option for compression codec for JSON datasource.

2016-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15104864#comment-15104864 ] Hyukjin Kwon commented on SPARK-12872: -- I will work on this. > Support to specify the option for

[jira] [Commented] (SPARK-12871) Support to specify the option for compression codec.

2016-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15104862#comment-15104862 ] Hyukjin Kwon commented on SPARK-12871: -- I will work on this. > Support to specify the option for

[jira] [Created] (SPARK-12872) Support to specify the option for compression codec for JSON datasource.

2016-01-17 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12872: Summary: Support to specify the option for compression codec for JSON datasource. Key: SPARK-12872 URL: https://issues.apache.org/jira/browse/SPARK-12872 Project:

[jira] [Updated] (SPARK-12871) Support to specify the option for compression codec.

2016-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12871: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-12420 > Support to specify the

[jira] [Commented] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15106354#comment-15106354 ] Hyukjin Kwon commented on SPARK-12901: -- Ah.. got it. > Refector options to be correctly formed in a

[jira] [Updated] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-12901: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-12420 > Refector options to be

[jira] [Created] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-12901: Summary: Refector options to be correctly formed in a case class Key: SPARK-12901 URL: https://issues.apache.org/jira/browse/SPARK-12901 Project: Spark

[jira] [Commented] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15106347#comment-15106347 ] Hyukjin Kwon commented on SPARK-12901: -- Sure. > Refector options to be correctly formed in a case

[jira] [Reopened] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-12901: -- > Refector options to be correctly formed in a case class >

[jira] [Commented] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15106310#comment-15106310 ] Hyukjin Kwon commented on SPARK-12901: -- I will work on this. > Refector options to be correctly

[jira] [Resolved] (SPARK-12901) Refector options to be correctly formed in a case class

2016-01-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12901. -- Resolution: Invalid > Refector options to be correctly formed in a case class >

[jira] [Created] (SPARK-16250) Can't use escapeQuotes option in DataFrameWriter.csv()

2016-06-28 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16250: Summary: Can't use escapeQuotes option in DataFrameWriter.csv() Key: SPARK-16250 URL: https://issues.apache.org/jira/browse/SPARK-16250 Project: Spark Issue

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15350025#comment-15350025 ] Hyukjin Kwon commented on SPARK-15393: -- Because we need to keep the schema somewhere. This is not

[jira] [Created] (SPARK-16216) CSV data source does not write date and timestamp correctly

2016-06-25 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-16216: Summary: CSV data source does not write date and timestamp correctly Key: SPARK-16216 URL: https://issues.apache.org/jira/browse/SPARK-16216 Project: Spark

[jira] [Comment Edited] (SPARK-16188) Spark sql create a lot of small files

2016-06-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15349581#comment-15349581 ] Hyukjin Kwon edited comment on SPARK-16188 at 6/25/16 10:04 AM: Is this a

[jira] [Commented] (SPARK-16188) Spark sql create a lot of small files

2016-06-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15349581#comment-15349581 ] Hyukjin Kwon commented on SPARK-16188: -- Is this a duplicated of SPARK-10216 maybe? > Spark sql

[jira] [Commented] (SPARK-13260) count(*) does not work with CSV data source

2016-02-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15142427#comment-15142427 ] Hyukjin Kwon commented on SPARK-13260: -- [~falaki] Since this is a quicky fix, I will submit a PR

[jira] [Commented] (SPARK-13260) count(*) does not work with CSV data source

2016-02-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15142421#comment-15142421 ] Hyukjin Kwon commented on SPARK-13260: -- [~falaki] Could I work on this if you are not? > count(*)

[jira] [Commented] (SPARK-13260) count(*) does not work with CSV data source

2016-02-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15142420#comment-15142420 ] Hyukjin Kwon commented on SPARK-13260: -- It is 0 and it just does not work. This was because of the

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15142412#comment-15142412 ] Hyukjin Kwon commented on SPARK-8000: - [~yanboliang] Are you working on this or is this already fixed?

[jira] [Commented] (SPARK-12671) Improve tests for better coverage

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118748#comment-15118748 ] Hyukjin Kwon commented on SPARK-12671: -- I do not want to make it complicated though. I think this

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118840#comment-15118840 ] Hyukjin Kwon commented on SPARK-12890: -- [~rxin] Could you confirm if this is an issue? > Spark SQL

[jira] [Comment Edited] (SPARK-12996) CSVRelation should be based on HadoopFsRelation

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118727#comment-15118727 ] Hyukjin Kwon edited comment on SPARK-12996 at 1/27/16 6:20 AM: --- Yes I

[jira] [Comment Edited] (SPARK-12997) Use cast expression to perform type cast in csv

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118745#comment-15118745 ] Hyukjin Kwon edited comment on SPARK-12997 at 1/27/16 7:00 AM: --- I would

[jira] [Comment Edited] (SPARK-12997) Use cast expression to perform type cast in csv

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118745#comment-15118745 ] Hyukjin Kwon edited comment on SPARK-12997 at 1/27/16 7:01 AM: --- I would

[jira] [Commented] (SPARK-12996) CSVRelation should be based on HadoopFsRelation

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118727#comment-15118727 ] Hyukjin Kwon commented on SPARK-12996: -- Yes I would. But does this already extend

[jira] [Commented] (SPARK-12997) Use cast expression to perform type cast in csv

2016-01-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15118745#comment-15118745 ] Hyukjin Kwon commented on SPARK-12997: -- I would like to try this; however, how about just using

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-01-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125802#comment-15125802 ] Hyukjin Kwon commented on SPARK-13108: -- I will work on this but here are several things to say. Now

[jira] [Updated] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-01-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13108: - Description: This library uses Hadoop's

[jira] [Created] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-01-31 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13108: Summary: Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.) Key: SPARK-13108 URL: https://issues.apache.org/jira/browse/SPARK-13108 Project:

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131843#comment-15131843 ] Hyukjin Kwon commented on SPARK-13108: -- [~rxin] I just made an {{InputFormat}} for this. Let me push

[jira] [Created] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-02-03 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13184: Summary: Support minPartitions parameter for JSON and CSV datasources as options Key: SPARK-13184 URL: https://issues.apache.org/jira/browse/SPARK-13184 Project:

[jira] [Commented] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127861#comment-15127861 ] Hyukjin Kwon commented on SPARK-13137: -- I will work on this. > NullPoingException in schema

[jira] [Commented] (SPARK-13114) java.lang.NegativeArraySizeException in CSV

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127792#comment-15127792 ] Hyukjin Kwon commented on SPARK-13114: -- This is already fixed in Spark,

[jira] [Created] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-01 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13137: Summary: NullPoingException in schema inference for CSV when the first line is empty Key: SPARK-13137 URL: https://issues.apache.org/jira/browse/SPARK-13137 Project:

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127623#comment-15127623 ] Hyukjin Kwon commented on SPARK-13108: -- I think both would be great. > Encoding not working with

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127651#comment-15127651 ] Hyukjin Kwon commented on SPARK-13108: -- Oh sorry I re-read and it looks funny. I meant I will submit

[jira] [Commented] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127657#comment-15127657 ] Hyukjin Kwon commented on SPARK-13108: -- Sure. It needs to re-write Hadoop's LineRecordReader,

[jira] [Comment Edited] (SPARK-13108) Encoding not working with non-ascii compatible encodings (UTF-16/32 etc.)

2016-02-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127656#comment-15127656 ] Hyukjin Kwon edited comment on SPARK-13108 at 2/2/16 4:56 AM: -- Sure. It

<    1   2   3   4   5   6   7   8   9   10   >