[jira] [Comment Edited] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873021#comment-15873021 ] Takeshi Yamamuro edited comment on SPARK-19638 at 2/18/17 6:52 AM: ---

[jira] [Comment Edited] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873021#comment-15873021 ] Takeshi Yamamuro edited comment on SPARK-19638 at 2/18/17 6:53 AM: ---

[jira] [Updated] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-19638: - Issue Type: Improvement (was: Bug) > Filter pushdown not working for struct fields >

[jira] [Commented] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873021#comment-15873021 ] Takeshi Yamamuro commented on SPARK-19638: -- Aha, I got you and you're right; in that case,

[jira] [Updated] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guifeng updated SPARK-19645: Description: We are trying to use Structured Streaming in product, however currently there exists a

[jira] [Assigned] (SPARK-19654) Structured Streaming API for R

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19654: Assignee: Apache Spark (was: Felix Cheung) > Structured Streaming API for R >

[jira] [Commented] (SPARK-19654) Structured Streaming API for R

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873001#comment-15873001 ] Apache Spark commented on SPARK-19654: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-19654) Structured Streaming API for R

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19654: Assignee: Felix Cheung (was: Apache Spark) > Structured Streaming API for R >

[jira] [Created] (SPARK-19654) Structured Streaming API for R

2017-02-17 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-19654: Summary: Structured Streaming API for R Key: SPARK-19654 URL: https://issues.apache.org/jira/browse/SPARK-19654 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-19637) add to_json APIs to SQL

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19637: Assignee: (was: Apache Spark) > add to_json APIs to SQL > --- > >

[jira] [Commented] (SPARK-19637) add to_json APIs to SQL

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872984#comment-15872984 ] Apache Spark commented on SPARK-19637: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19637) add to_json APIs to SQL

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19637: Assignee: Apache Spark > add to_json APIs to SQL > --- > >

[jira] [Commented] (SPARK-19617) Fix the race condition when starting and stopping a query quickly

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872977#comment-15872977 ] Apache Spark commented on SPARK-19617: -- User 'gf53520' has created a pull request for this issue:

[jira] [Updated] (SPARK-19617) Fix the race condition when starting and stopping a query quickly

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19617: - Fix Version/s: 2.2.0 > Fix the race condition when starting and stopping a query quickly >

[jira] [Comment Edited] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872921#comment-15872921 ] guifeng edited comment on SPARK-19645 at 2/18/17 3:17 AM: -- [~zsxwing] ok, I

[jira] [Comment Edited] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872938#comment-15872938 ] Liang-Chi Hsieh edited comment on SPARK-19653 at 2/18/17 3:12 AM: --

[jira] [Commented] (SPARK-19617) Fix the race condition when starting and stopping a query quickly

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872948#comment-15872948 ] Apache Spark commented on SPARK-19617: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872938#comment-15872938 ] Liang-Chi Hsieh commented on SPARK-19653: - Actually some Spark SQL functions like the mentioned

[jira] [Comment Edited] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872921#comment-15872921 ] guifeng edited comment on SPARK-19645 at 2/18/17 2:37 AM: -- [~zsxwing] ok, I

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872921#comment-15872921 ] guifeng commented on SPARK-19645: - [~zsxwing] ok, I think the solution is that determining whether the

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872901#comment-15872901 ] Kazuaki Ishizaki commented on SPARK-19653: -- cc: [~cloud_fan] > `Vector` Type Should Be A

[jira] [Updated] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2017-02-17 Thread Evan Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Evan Chan updated SPARK-13219: -- Hi Gagan, That is an interesting optimization but not the same one that Venu speaks of (I worked on

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Mike Dusenberry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872847#comment-15872847 ] Mike Dusenberry commented on SPARK-19653: - cc [~sethah] > `Vector` Type Should Be A First-Class

[jira] [Assigned] (SPARK-19652) REST API does not perform user auth for individual apps

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19652: Assignee: Apache Spark > REST API does not perform user auth for individual apps >

[jira] [Assigned] (SPARK-19652) REST API does not perform user auth for individual apps

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19652: Assignee: (was: Apache Spark) > REST API does not perform user auth for individual

[jira] [Commented] (SPARK-19652) REST API does not perform user auth for individual apps

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872845#comment-15872845 ] Apache Spark commented on SPARK-19652: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872819#comment-15872819 ] Xiao Li commented on SPARK-19653: - cc [~mengxr] [~josephkb] > `Vector` Type Should Be A First-Class

[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2017-02-17 Thread gagan taneja (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872811#comment-15872811 ] gagan taneja commented on SPARK-13219: -- This is what we are looking for For example Table Address

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Mike Dusenberry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872810#comment-15872810 ] Mike Dusenberry commented on SPARK-19653: - cc [~mlnick], [~smilegator] > `Vector` Type Should Be

[jira] [Created] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-17 Thread Mike Dusenberry (JIRA)
Mike Dusenberry created SPARK-19653: --- Summary: `Vector` Type Should Be A First-Class Citizen In Spark SQL Key: SPARK-19653 URL: https://issues.apache.org/jira/browse/SPARK-19653 Project: Spark

[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2017-02-17 Thread gagan taneja (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872809#comment-15872809 ] gagan taneja commented on SPARK-13219: -- Venu This is very interesting i would like to look at the

[jira] [Commented] (SPARK-19609) Broadcast joins should pushdown join constraints as Filter to the larger relation

2017-02-17 Thread gagan taneja (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872797#comment-15872797 ] gagan taneja commented on SPARK-19609: -- This can be further extended to join on the column that are

[jira] [Created] (SPARK-19652) REST API does not perform user auth for individual apps

2017-02-17 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-19652: -- Summary: REST API does not perform user auth for individual apps Key: SPARK-19652 URL: https://issues.apache.org/jira/browse/SPARK-19652 Project: Spark

[jira] [Assigned] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19651: Assignee: Wenchen Fan (was: Apache Spark) > ParallelCollectionRDD.collect should not

[jira] [Commented] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872769#comment-15872769 ] Apache Spark commented on SPARK-19651: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19651: Assignee: Apache Spark (was: Wenchen Fan) > ParallelCollectionRDD.collect should not

[jira] [Updated] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job

2017-02-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-19651: Summary: ParallelCollectionRDD.collect should not issue a Spark job (was:

[jira] [Updated] (SPARK-19651) ParallelCollectionRDD.collect should not issuse a Spark job

2017-02-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-19651: Summary: ParallelCollectionRDD.collect should not issuse a Spark job (was:

[jira] [Created] (SPARK-19651) ParallelCollectionRDD.colect should not issuse a Spark job

2017-02-17 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-19651: --- Summary: ParallelCollectionRDD.colect should not issuse a Spark job Key: SPARK-19651 URL: https://issues.apache.org/jira/browse/SPARK-19651 Project: Spark

[jira] [Assigned] (SPARK-19650) Metastore-only operations shouldn't trigger a spark job

2017-02-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell reassigned SPARK-19650: - Assignee: Sameer Agarwal > Metastore-only operations shouldn't trigger a spark

[jira] [Created] (SPARK-19650) Metastore-only operations shouldn't trigger a spark job

2017-02-17 Thread Sameer Agarwal (JIRA)
Sameer Agarwal created SPARK-19650: -- Summary: Metastore-only operations shouldn't trigger a spark job Key: SPARK-19650 URL: https://issues.apache.org/jira/browse/SPARK-19650 Project: Spark

[jira] [Commented] (SPARK-19525) Enable Compression of RDD Checkpoints

2017-02-17 Thread Aaditya Ramesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872728#comment-15872728 ] Aaditya Ramesh commented on SPARK-19525: Great, I will get the patch ready. > Enable Compression

[jira] [Commented] (SPARK-3877) The exit code of spark-submit is still 0 when an yarn application fails

2017-02-17 Thread Joshua Caplan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872720#comment-15872720 ] Joshua Caplan commented on SPARK-3877: -- Done, as SPARK-19649 . > The exit code of spark-submit is

[jira] [Commented] (SPARK-19525) Enable Compression of RDD Checkpoints

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872717#comment-15872717 ] Shixiong Zhu commented on SPARK-19525: -- I see. This is RDD checkpointing. Sounds a good idea. >

[jira] [Updated] (SPARK-19525) Enable Compression of Spark Streaming Checkpoints

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19525: - Component/s: (was: Structured Streaming) Spark Core > Enable Compression of

[jira] [Updated] (SPARK-19525) Enable Compression of RDD Checkpoints

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19525: - Summary: Enable Compression of RDD Checkpoints (was: Enable Compression of Spark Streaming

[jira] [Updated] (SPARK-19649) Spark YARN client throws exception if job succeeds and max-completed-applications=0

2017-02-17 Thread Joshua Caplan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joshua Caplan updated SPARK-19649: -- Description: I believe the patch in SPARK-3877 created a new race condition between YARN and

[jira] [Commented] (SPARK-19525) Enable Compression of Spark Streaming Checkpoints

2017-02-17 Thread Aaditya Ramesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872698#comment-15872698 ] Aaditya Ramesh commented on SPARK-19525: We are suggesting to compress only before we write the

[jira] [Created] (SPARK-19649) Spark YARN client throws exception if job succeeds and max-completed-applications=0

2017-02-17 Thread Joshua Caplan (JIRA)
Joshua Caplan created SPARK-19649: - Summary: Spark YARN client throws exception if job succeeds and max-completed-applications=0 Key: SPARK-19649 URL: https://issues.apache.org/jira/browse/SPARK-19649

[jira] [Commented] (SPARK-19649) Spark YARN client throws exception if job succeeds and max-completed-applications=0

2017-02-17 Thread Joshua Caplan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872695#comment-15872695 ] Joshua Caplan commented on SPARK-19649: --- Hadoop encountered the same situation and fixed it in

[jira] [Commented] (SPARK-19644) Memory leak in Spark Streaming

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872672#comment-15872672 ] Shixiong Zhu commented on SPARK-19644: -- [~deenbandhu] Could you check the GC root, please? These

[jira] [Commented] (SPARK-19525) Enable Compression of Spark Streaming Checkpoints

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872652#comment-15872652 ] Shixiong Zhu commented on SPARK-19525: -- Hm, Spark should support compression for data in RDD. Which

[jira] [Closed] (SPARK-19640) Incorrect documentation for MLlib CountVectorizerModel for spark 1.5.2

2017-02-17 Thread Stephen Kinser (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephen Kinser closed SPARK-19640. -- Resolution: Won't Fix > Incorrect documentation for MLlib CountVectorizerModel for spark 1.5.2

[jira] [Commented] (SPARK-19637) add to_json APIs to SQL

2017-02-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872470#comment-15872470 ] Michael Armbrust commented on SPARK-19637: -- >From JSON is harder because the second argument is

[jira] [Assigned] (SPARK-19517) KafkaSource fails to initialize partition offsets

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-19517: Assignee: Roberto Agostino Vitillo > KafkaSource fails to initialize partition offsets >

[jira] [Commented] (SPARK-19525) Enable Compression of Spark Streaming Checkpoints

2017-02-17 Thread Aaditya Ramesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872441#comment-15872441 ] Aaditya Ramesh commented on SPARK-19525: [~zsxwing] Actually, we are compressing the data in the

[jira] [Resolved] (SPARK-19517) KafkaSource fails to initialize partition offsets

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19517. -- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > KafkaSource fails to

[jira] [Resolved] (SPARK-18285) approxQuantile in R support multi-column

2017-02-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18285. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.2.0 > approxQuantile in

[jira] [Commented] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Nick Dimiduk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872413#comment-15872413 ] Nick Dimiduk commented on SPARK-19638: -- Debugging. I'm looking at the match expression in

[jira] [Resolved] (SPARK-18986) ExternalAppendOnlyMap shouldn't fail when forced to spill before calling its iterator

2017-02-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-18986. Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.2.0 >

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872319#comment-15872319 ] Shixiong Zhu commented on SPARK-19645: -- [~guifengl...@gmail.com] Thanks for reporting. Could you

[jira] [Assigned] (SPARK-19610) multi line support for CSV

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19610: Assignee: (was: Apache Spark) > multi line support for CSV >

[jira] [Assigned] (SPARK-19610) multi line support for CSV

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19610: Assignee: Apache Spark > multi line support for CSV > -- > >

[jira] [Commented] (SPARK-19610) multi line support for CSV

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872270#comment-15872270 ] Apache Spark commented on SPARK-19610: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Resolved] (SPARK-19500) Fail to spill the aggregated hash map when radix sort is used

2017-02-17 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-19500. Resolution: Fixed Fix Version/s: 2.2.0 2.0.3 2.1.1

[jira] [Assigned] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19522: Assignee: Apache Spark (was: Andrew Or) > --executor-memory flag doesn't work in

[jira] [Commented] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872109#comment-15872109 ] Apache Spark commented on SPARK-19522: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19522: Assignee: Andrew Or (was: Apache Spark) > --executor-memory flag doesn't work in

[jira] [Commented] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-17 Thread Nick Dimiduk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15872060#comment-15872060 ] Nick Dimiduk commented on SPARK-19638: -- [~maropu] I have placed a breakpoint in the ES connector's

[jira] [Commented] (SPARK-14194) spark csv reader not working properly if CSV content contains CRLF character (newline) in the intermediate cell

2017-02-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871975#comment-15871975 ] Dongjoon Hyun commented on SPARK-14194: --- +1 for @srowen 's opinion. > spark csv reader not working

[jira] [Resolved] (SPARK-19593) Records read per each kinesis transaction

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19593. --- Resolution: Invalid > Records read per each kinesis transaction >

[jira] [Resolved] (SPARK-19547) KafkaUtil throw 'No current assignment for partition' Exception

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19547. --- Resolution: Invalid > KafkaUtil throw 'No current assignment for partition' Exception >

[jira] [Updated] (SPARK-19622) Fix a http error in a paged table when using a `Go` button to search.

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19622: -- Fix Version/s: 2.1.1 > Fix a http error in a paged table when using a `Go` button to search. >

[jira] [Resolved] (SPARK-19647) Spark query hive is extremelly slow even the result data is small

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19647. --- Resolution: Invalid Questions should go to u...@spark.apache.org > Spark query hive is extremelly

[jira] [Resolved] (SPARK-19622) Fix a http error in a paged table when using a `Go` button to search.

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19622. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16953

[jira] [Assigned] (SPARK-19622) Fix a http error in a paged table when using a `Go` button to search.

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-19622: - Assignee: StanZhai > Fix a http error in a paged table when using a `Go` button to search. >

[jira] [Created] (SPARK-19648) Unable to access column containing '.' for approxQuantile function on DataFrame

2017-02-17 Thread John Compitello (JIRA)
John Compitello created SPARK-19648: --- Summary: Unable to access column containing '.' for approxQuantile function on DataFrame Key: SPARK-19648 URL: https://issues.apache.org/jira/browse/SPARK-19648

[jira] [Commented] (SPARK-19633) FileSource read from FileSink

2017-02-17 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871876#comment-15871876 ] Liwei Lin commented on SPARK-19633: --- Hi [~marmbrus], I'd like to take this if it's ok by you. Just to

[jira] [Created] (SPARK-19647) Spark query hive is extremelly slow even the result data is small

2017-02-17 Thread wuchang (JIRA)
wuchang created SPARK-19647: --- Summary: Spark query hive is extremelly slow even the result data is small Key: SPARK-19647 URL: https://issues.apache.org/jira/browse/SPARK-19647 Project: Spark

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871716#comment-15871716 ] guifeng commented on SPARK-19645: - So, I think current workaround is to delete(if delta file exist) and

[jira] [Commented] (SPARK-18091) Deep if expressions cause Generated SpecificUnsafeProjection code to exceed JVM code size limit

2017-02-17 Thread Jose Soltren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871702#comment-15871702 ] Jose Soltren commented on SPARK-18091: -- FWIW, anyone pulling this fix in the future will also want

[jira] [Commented] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871678#comment-15871678 ] Sean Owen commented on SPARK-19646: --- I think it's because the array is copied elsewhere as it moves

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871680#comment-15871680 ] Sean Owen commented on SPARK-19645: --- I'm not suggesting it works with Hadoop 2.6; I'm responding to the

[jira] [Commented] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858

2017-02-17 Thread Mahmoud Rawas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871635#comment-15871635 ] Mahmoud Rawas commented on SPARK-16920: --- It seems that there is no N^2 complexity issue, and as for

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871627#comment-15871627 ] guifeng commented on SPARK-19645: - aBut, spark mater don't set rename options that support overwrite, so

[jira] [Issue Comment Deleted] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guifeng updated SPARK-19645: Comment: was deleted (was: aBut, spark mater don't set rename options that support overwrite, so I think

[jira] [Comment Edited] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871552#comment-15871552 ] guifeng edited comment on SPARK-19645 at 2/17/17 10:36 AM: --- But, spark mater

[jira] [Commented] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread BahaaEddin AlAila (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871597#comment-15871597 ] BahaaEddin AlAila commented on SPARK-19646: --- What's puzzling though, is I looked at pyspark's

[jira] [Commented] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread BahaaEddin AlAila (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871595#comment-15871595 ] BahaaEddin AlAila commented on SPARK-19646: --- Thank you very much for the speedy fix! >

[jira] [Comment Edited] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871552#comment-15871552 ] guifeng edited comment on SPARK-19645 at 2/17/17 9:50 AM: -- But, spark mater

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871552#comment-15871552 ] guifeng commented on SPARK-19645: - But, spark mater don't set rename options that support overwrite. >

[jira] [Commented] (SPARK-19533) Convert Java examples to use lambdas, Java 8 features

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871545#comment-15871545 ] Apache Spark commented on SPARK-19533: -- User 'srowen' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-19534) Convert Java tests to use lambdas, Java 8 features

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19534: -- Comment: was deleted (was: User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19533) Convert Java examples to use lambdas, Java 8 features

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19533: Assignee: Sean Owen (was: Apache Spark) > Convert Java examples to use lambdas, Java 8

[jira] [Assigned] (SPARK-19533) Convert Java examples to use lambdas, Java 8 features

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19533: Assignee: Apache Spark (was: Sean Owen) > Convert Java examples to use lambdas, Java 8

[jira] [Commented] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871543#comment-15871543 ] Apache Spark commented on SPARK-19646: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871542#comment-15871542 ] Sean Owen commented on SPARK-19645: --- Note that master requires Hadoop 2.6+ now. > structured streaming

[jira] [Assigned] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19646: Assignee: Sean Owen (was: Apache Spark) > binaryRecords replicates records in scala API

[jira] [Assigned] (SPARK-19646) binaryRecords replicates records in scala API

2017-02-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19646: Assignee: Apache Spark (was: Sean Owen) > binaryRecords replicates records in scala API

[jira] [Commented] (SPARK-19645) structured streaming job restart bug

2017-02-17 Thread guifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15871536#comment-15871536 ] guifeng commented on SPARK-19645: - spark's default hadoop version is hadoop 2.2 that rename method don't

  1   2   >