[jira] [Commented] (SPARK-14171) UDAF aggregates argument object inspector not parsed correctly

2016-10-27 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614192#comment-15614192 ] Song Jun commented on SPARK-14171: -- This issue does not reproduce on the recently spark branch master

[jira] [Assigned] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18137: Assignee: (was: Apache Spark) > RewriteDistinctAggregates UnresolvedException when a

[jira] [Commented] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614176#comment-15614176 ] Apache Spark commented on SPARK-18137: -- User 'windpiger' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18137: Assignee: Apache Spark > RewriteDistinctAggregates UnresolvedException when a UDAF has a

[jira] [Updated] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-16845: -- Attachment: error.txt.zip Does this generated code help in resolving this? >

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614100#comment-15614100 ] Don Drake commented on SPARK-16845: --- I'm struggling to get a simple case created. I'm curious though,

[jira] [Updated] (SPARK-18055) Dataset.flatMap can't work with types from customized jar

2016-10-27 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18055: Attachment: test-jar_2.11-1.0.jar This jar is built with file MyData.scala {code} case class

[jira] [Created] (SPARK-18149) build side decision based on cbo

2016-10-27 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-18149: Summary: build side decision based on cbo Key: SPARK-18149 URL: https://issues.apache.org/jira/browse/SPARK-18149 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-17079) broadcast decision based on cbo

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17079: - Description: We decide if broadcast join should be used based on the cardinality and size of

[jira] [Assigned] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18107: Assignee: Apache Spark > Insert overwrite statement runs much slower in spark-sql than it

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614009#comment-15614009 ] Apache Spark commented on SPARK-18107: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18107: Assignee: (was: Apache Spark) > Insert overwrite statement runs much slower in

[jira] [Created] (SPARK-18148) Misleading Error Message for Aggregation Without Window/GroupBy

2016-10-27 Thread Pat McDonough (JIRA)
Pat McDonough created SPARK-18148: - Summary: Misleading Error Message for Aggregation Without Window/GroupBy Key: SPARK-18148 URL: https://issues.apache.org/jira/browse/SPARK-18148 Project: Spark

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613944#comment-15613944 ] J.P Feng commented on SPARK-18107: -- Ok, it sounds good, thanks! I would have a try later. > Insert

[jira] [Comment Edited] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-10-27 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613846#comment-15613846 ] Lianhui Wang edited comment on SPARK-15616 at 10/28/16 12:54 AM: - Yes, I

[jira] [Commented] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-10-27 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613846#comment-15613846 ] Lianhui Wang commented on SPARK-15616: -- Yes, I think it can. But now the PR is based on branch 2.0,

[jira] [Commented] (SPARK-13331) AES support for over-the-wire encryption

2016-10-27 Thread Junjie Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613837#comment-15613837 ] Junjie Chen commented on SPARK-13331: - Hi [~vanzin] Do we need more review? > AES support for

[jira] [Updated] (SPARK-18121) Unable to query global temp views when hive support is enabled.

2016-10-27 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18121: Assignee: Sunitha Kambhampati > Unable to query global temp views when hive support is enabled.

[jira] [Resolved] (SPARK-18121) Unable to query global temp views when hive support is enabled.

2016-10-27 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18121. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15649

[jira] [Commented] (SPARK-11421) Add the ability to add a jar to the current class loader

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613802#comment-15613802 ] Apache Spark commented on SPARK-11421: -- User 'mariusvniekerk' has created a pull request for this

[jira] [Updated] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17153: - Labels: release_notes releasenotes (was: release_notes) > [Structured streams] readStream ignores

[jira] [Comment Edited] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613682#comment-15613682 ] Ray Qiu edited comment on SPARK-18125 at 10/27/16 11:54 PM: Try this in

[jira] [Commented] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613705#comment-15613705 ] Yin Huai commented on SPARK-17153: -- This change needs a release note because

[jira] [Issue Comment Deleted] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Qiu updated SPARK-18125: Comment: was deleted (was: Same thing works fine in 2.0.0 -- Regards, Ray ) > Spark generated code

[jira] [Comment Edited] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613682#comment-15613682 ] Ray Qiu edited comment on SPARK-18125 at 10/27/16 11:54 PM: Try this in

[jira] [Updated] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17153: - Labels: release_notes (was: ) > [Structured streams] readStream ignores partition columns >

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613704#comment-15613704 ] Ray Qiu commented on SPARK-18125: - Same thing works fine in 2.0.0 -- Regards, Ray > Spark

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613682#comment-15613682 ] Ray Qiu commented on SPARK-18125: - Try this in spark-shell: case class Route(src: String, dest: String,

[jira] [Comment Edited] (SPARK-16648) LAST_VALUE(FALSE) OVER () throws IndexOutOfBoundsException

2016-10-27 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613656#comment-15613656 ] Emlyn Corrin edited comment on SPARK-16648 at 10/27/16 11:33 PM: - Since

[jira] [Commented] (SPARK-16648) LAST_VALUE(FALSE) OVER () throws IndexOutOfBoundsException

2016-10-27 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613656#comment-15613656 ] Emlyn Corrin commented on SPARK-16648: -- Since Spark 2.0.1, the following snippet fails (I believe it

[jira] [Assigned] (SPARK-18146) Avoid using Union to chain together create table and repair partition commands

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18146: Assignee: Apache Spark > Avoid using Union to chain together create table and repair

[jira] [Assigned] (SPARK-18146) Avoid using Union to chain together create table and repair partition commands

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18146: Assignee: (was: Apache Spark) > Avoid using Union to chain together create table and

[jira] [Commented] (SPARK-18146) Avoid using Union to chain together create table and repair partition commands

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613584#comment-15613584 ] Apache Spark commented on SPARK-18146: -- User 'ericl' has created a pull request for this issue:

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613580#comment-15613580 ] Herman van Hovell commented on SPARK-18125: --- I tried something like this on master and on

[jira] [Updated] (SPARK-18147) Broken Spark SQL Codegen

2016-10-27 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18147: - Target Version/s: 2.1.0 Priority: Critical (was: Minor) > Broken Spark SQL

[jira] [Commented] (SPARK-18147) Broken Spark SQL Codegen

2016-10-27 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613555#comment-15613555 ] Michael Armbrust commented on SPARK-18147: -- /cc [~cloud_fan] > Broken Spark SQL Codegen >

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613497#comment-15613497 ] Herman van Hovell commented on SPARK-18125: --- Could one of you provide a reproducible example.

[jira] [Updated] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18125: -- Description: Code logic looks like this: {noformat} .groupByKey

[jira] [Commented] (SPARK-18147) Broken Spark SQL Codegen

2016-10-27 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613472#comment-15613472 ] koert kuipers commented on SPARK-18147: --- it also breaks with an option of a case class. like this:

[jira] [Updated] (SPARK-18147) Broken Spark SQL Codegen

2016-10-27 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] koert kuipers updated SPARK-18147: -- Description: this is me on purpose trying to break spark sql codegen to uncover potential

[jira] [Commented] (SPARK-11046) Pass schema from R to JVM using JSON format

2016-10-27 Thread Sammie Durugo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613459#comment-15613459 ] Sammie Durugo commented on SPARK-11046: --- I'm not sure anyone has noticed that nested schema cannot

[jira] [Assigned] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18123: Assignee: (was: Apache Spark) >

[jira] [Commented] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613453#comment-15613453 ] Apache Spark commented on SPARK-18123: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18123: Assignee: Apache Spark >

[jira] [Created] (SPARK-18147) Broken Spark SQL Codegen

2016-10-27 Thread koert kuipers (JIRA)
koert kuipers created SPARK-18147: - Summary: Broken Spark SQL Codegen Key: SPARK-18147 URL: https://issues.apache.org/jira/browse/SPARK-18147 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-18146) Avoid using Union to chain together create table and repair partition commands

2016-10-27 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18146: -- Summary: Avoid using Union to chain together create table and repair partition commands Key: SPARK-18146 URL: https://issues.apache.org/jira/browse/SPARK-18146 Project:

[jira] [Updated] (SPARK-18145) Update documentation

2016-10-27 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18145: --- Issue Type: Sub-task (was: Documentation) Parent: SPARK-17861 > Update documentation >

[jira] [Updated] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-10-27 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18145: --- Component/s: SQL > Update documentation for hive partition management in 2.1 >

[jira] [Updated] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-10-27 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18145: --- Summary: Update documentation for hive partition management in 2.1 (was: Update documentation) >

[jira] [Created] (SPARK-18145) Update documentation

2016-10-27 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18145: -- Summary: Update documentation Key: SPARK-18145 URL: https://issues.apache.org/jira/browse/SPARK-18145 Project: Spark Issue Type: Documentation

[jira] [Resolved] (SPARK-17970) Use metastore for managing filesource table partitions as well

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-17970. -- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15515

[jira] [Commented] (SPARK-17829) Stable format for offset log

2016-10-27 Thread Tyson Condie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613253#comment-15613253 ] Tyson Condie commented on SPARK-17829: -- Thanks Code for the clarification. My background is mostly

[jira] [Issue Comment Deleted] (SPARK-17891) SQL-based three column join loses first column

2016-10-27 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-17891: Comment: was deleted (was: *Workaround:* # Disable BroadcastHashJoin by setting

[jira] [Updated] (SPARK-10561) Provide tooling for auto-generating Spark SQL reference manual

2016-10-27 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-10561: --- Labels: tool (was: ) > Provide tooling for auto-generating Spark SQL reference manual >

[jira] [Updated] (SPARK-16137) Random Forest wrapper in SparkR

2016-10-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16137: -- Shepherd: (was: Joseph K. Bradley) > Random Forest wrapper in SparkR >

[jira] [Updated] (SPARK-18109) Log instrumentation in GMM

2016-10-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18109: -- Shepherd: Joseph K. Bradley > Log instrumentation in GMM > --

[jira] [Updated] (SPARK-18109) Log instrumentation in GMM

2016-10-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18109: -- Assignee: zhengruifeng > Log instrumentation in GMM > -- > >

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613099#comment-15613099 ] Ray Qiu commented on SPARK-18125: - Move to Priority Critical unless a workaround is identified. This is

[jira] [Updated] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Ray Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Qiu updated SPARK-18125: Priority: Critical (was: Major) > Spark generated code causes CompileException when groupByKey,

[jira] [Comment Edited] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613088#comment-15613088 ] Dongjoon Hyun edited comment on SPARK-18123 at 10/27/16 8:20 PM: - Hi,

[jira] [Commented] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613088#comment-15613088 ] Dongjoon Hyun commented on SPARK-18123: --- Hi, [~zwu@gmail.com]. There is two related things

[jira] [Updated] (SPARK-18144) StreamingQueryListener.QueryStartedEvent is not written to event log

2016-10-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18144: - Affects Version/s: 2.0.0 2.0.1 > StreamingQueryListener.QueryStartedEvent

[jira] [Created] (SPARK-18144) StreamingQueryListener.QueryStartedEvent is not written to event log

2016-10-27 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18144: Summary: StreamingQueryListener.QueryStartedEvent is not written to event log Key: SPARK-18144 URL: https://issues.apache.org/jira/browse/SPARK-18144 Project: Spark

[jira] [Assigned] (SPARK-18143) History Server is broken because of the refactoring work in Structured Streaming

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18143: Assignee: (was: Apache Spark) > History Server is broken because of the refactoring

[jira] [Commented] (SPARK-18143) History Server is broken because of the refactoring work in Structured Streaming

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612998#comment-15612998 ] Apache Spark commented on SPARK-18143: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18143) History Server is broken because of the refactoring work in Structured Streaming

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18143: Assignee: Apache Spark > History Server is broken because of the refactoring work in

[jira] [Updated] (SPARK-18143) History Server is broken because of the refactoring work in Structured Streaming

2016-10-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18143: - Issue Type: Sub-task (was: Bug) Parent: SPARK-8360 > History Server is broken because

[jira] [Created] (SPARK-18143) History Server is broken because of the refactoring work in Structured Streaming

2016-10-27 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18143: Summary: History Server is broken because of the refactoring work in Structured Streaming Key: SPARK-18143 URL: https://issues.apache.org/jira/browse/SPARK-18143

[jira] [Comment Edited] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-10-27 Thread Franck Tago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612937#comment-15612937 ] Franck Tago edited comment on SPARK-15616 at 10/27/16 7:35 PM: --- Hi In my

[jira] [Commented] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-10-27 Thread Franck Tago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612937#comment-15612937 ] Franck Tago commented on SPARK-15616: - Hi In my case the filter is on a partition key , so i

[jira] [Created] (SPARK-18142) Spark Master tries to launch workers 145 times within 1 minute

2016-10-27 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-18142: --- Summary: Spark Master tries to launch workers 145 times within 1 minute Key: SPARK-18142 URL: https://issues.apache.org/jira/browse/SPARK-18142 Project: Spark

[jira] [Updated] (SPARK-18142) Spark Master tries to launch workers 145 times within 1 minute

2016-10-27 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-18142: Component/s: Spark Core > Spark Master tries to launch workers 145 times within 1 minute >

[jira] [Resolved] (SPARK-17219) QuantileDiscretizer should handle NaN values gracefully

2016-10-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17219. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15428

[jira] [Assigned] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18141: Assignee: (was: Apache Spark) > jdbc datasource read fails when quoted columns

[jira] [Assigned] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18141: Assignee: Apache Spark > jdbc datasource read fails when quoted columns (eg:mixed case,

[jira] [Commented] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612762#comment-15612762 ] Apache Spark commented on SPARK-18141: -- User 'sureshthalamati' has created a pull request for this

[jira] [Commented] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612686#comment-15612686 ] Davies Liu commented on SPARK-18105: It turned out that the bug in LZ4 is a false alarm, so close the

[jira] [Updated] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-18105: --- Priority: Major (was: Blocker) > LZ4 failed to decompress a stream of shuffled data >

[jira] [Created] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Suresh Thalamati (JIRA)
Suresh Thalamati created SPARK-18141: Summary: jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter. Key: SPARK-18141 URL:

[jira] [Commented] (SPARK-17935) Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module

2016-10-27 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612663#comment-15612663 ] Cody Koeninger commented on SPARK-17935: So the main thing to point out is that Kafka producers

[jira] [Commented] (SPARK-16963) Change Source API so that sources do not need to keep unbounded state

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612609#comment-15612609 ] Apache Spark commented on SPARK-16963: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Resolved] (SPARK-17813) Maximum data per trigger

2016-10-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-17813. -- Resolution: Fixed Assignee: Cody Koeninger Fix Version/s: 2.1.0

[jira] [Updated] (SPARK-16078) from_utc_timestamp/to_utc_timestamp may give different result in different timezone

2016-10-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-16078: --- Fix Version/s: 1.6.3 > from_utc_timestamp/to_utc_timestamp may give different result in different >

[jira] [Updated] (SPARK-18085) Better History Server scalability for many / large applications

2016-10-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-18085: --- Summary: Better History Server scalability for many / large applications (was: Scalability

[jira] [Updated] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17822: - Target Version/s: 2.0.3, 2.1.0 (was: 2.0.2, 2.1.0) > JVMObjectTracker.objMap may leak JVM objects >

[jira] [Updated] (SPARK-17823) Make JVMObjectTracker.objMap thread-safe

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17823: - Target Version/s: 2.0.3, 2.1.0 (was: 2.0.2, 2.1.0) > Make JVMObjectTracker.objMap thread-safe >

[jira] [Commented] (SPARK-18123) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils.saveTable the case senstivity issue

2016-10-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612167#comment-15612167 ] Dongjoon Hyun commented on SPARK-18123: --- Thank you. I'll make a PR to fix this. >

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-10-27 Thread Christian Zorneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612136#comment-15612136 ] Christian Zorneck commented on SPARK-18134: --- Spark was compatible in this case until Spark 1.4.

[jira] [Commented] (SPARK-18085) Scalability enhancements for the History Server

2016-10-27 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612086#comment-15612086 ] Thomas Graves commented on SPARK-18085: --- Perhaps we can clarify the title on this jira to be

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612047#comment-15612047 ] J.P Feng commented on SPARK-18107: -- Here is the execution logs of Hive 1.2.1, [Insert into]: 0:

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612044#comment-15612044 ] Liang-Chi Hsieh commented on SPARK-18107: - Looks like HIVE-11940 largely improves insert

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612050#comment-15612050 ] J.P Feng commented on SPARK-18107: -- Here is the execution logs of Hive 2.0.1, [Insert overwrite]: 0:

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612043#comment-15612043 ] J.P Feng commented on SPARK-18107: -- Here is the execution logs of Hive 1.2.1, [Insert overwrite] 0:

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612033#comment-15612033 ] J.P Feng commented on SPARK-18107: -- Thanks for your reply. I have tested the performance between hive

[jira] [Commented] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-10-27 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611926#comment-15611926 ] Benjamin Fradet commented on SPARK-16857: - I was wondering why a KMeansEvalutor computing the

[jira] [Updated] (SPARK-18128) Add support for publishing to PyPI

2016-10-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18128: Description: After SPARK-1267 is done we should add support for publishing to PyPI similar to how we

[jira] [Updated] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18137: -- Description: when run a sql with distinct(on spark github master branch), it throw

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611712#comment-15611712 ] Herman van Hovell commented on SPARK-18134: --- Maps are not comparable. This makes them unusable

[jira] [Commented] (SPARK-18140) Parquet NPE / Update to 1.9

2016-10-27 Thread dori waldman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611691#comment-15611691 ] dori waldman commented on SPARK-18140: -- Is there any suggestion how to solve this issue now ? I cant

[jira] [Commented] (SPARK-18139) Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611674#comment-15611674 ] Sean Owen commented on SPARK-18139: --- I'm pretty sure this is just another instance of "case classes

  1   2   >