[jira] [Commented] (SPARK-19148) do not expose the external table concept in Catalog

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15952057#comment-15952057 ] Apache Spark commented on SPARK-19148: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-9478) Add sample weights to Random Forest

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951685#comment-15951685 ] Joseph K. Bradley edited comment on SPARK-9478 at 4/1/17 2:35 AM: --

[jira] [Assigned] (SPARK-20183) Add outlierRatio option to testOutliersWithSmallWeights

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20183: Assignee: Seth Hendrickson (was: Apache Spark) > Add outlierRatio option to

[jira] [Commented] (SPARK-20183) Add outlierRatio option to testOutliersWithSmallWeights

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951865#comment-15951865 ] Apache Spark commented on SPARK-20183: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20183) Add outlierRatio option to testOutliersWithSmallWeights

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20183: Assignee: Apache Spark (was: Seth Hendrickson) > Add outlierRatio option to

[jira] [Created] (SPARK-20183) Add outlierRatio option to testOutliersWithSmallWeights

2017-03-31 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-20183: - Summary: Add outlierRatio option to testOutliersWithSmallWeights Key: SPARK-20183 URL: https://issues.apache.org/jira/browse/SPARK-20183 Project: Spark

[jira] [Updated] (SPARK-19591) Add sample weights to decision trees

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19591: -- Description: Add sample weights to decision trees. See [SPARK-9478] for details on

[jira] [Updated] (SPARK-19591) Add sample weights to decision trees

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19591: -- Issue Type: New Feature (was: Sub-task) Parent: (was: SPARK-9478) > Add

[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-03-31 Thread Ron Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ron Hu updated SPARK-19408: --- Description: In SPARK-17075, we estimate cardinality of predicate expression "column (op) literal", where

[jira] [Updated] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20003: -- Target Version/s: 2.2.0 > FPGrowthModel setMinConfidence should affect rules

[jira] [Updated] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20003: -- Shepherd: Joseph K. Bradley > FPGrowthModel setMinConfidence should affect rules

[jira] [Assigned] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20003: - Assignee: yuhao yang > FPGrowthModel setMinConfidence should affect rules

[jira] [Created] (SPARK-20182) Dot in DataFrame Column title causes errors

2017-03-31 Thread Evan Zamir (JIRA)
Evan Zamir created SPARK-20182: -- Summary: Dot in DataFrame Column title causes errors Key: SPARK-20182 URL: https://issues.apache.org/jira/browse/SPARK-20182 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20164) AnalysisException not tolerant of null query plan

2017-03-31 Thread Kunal Khamar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kunal Khamar updated SPARK-20164: - Description: The query plan in an AnalysisException may be null when an AnalysisException

[jira] [Updated] (SPARK-20164) AnalysisException not tolerant of null query plan

2017-03-31 Thread Kunal Khamar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kunal Khamar updated SPARK-20164: - Description: The query plan in an `AnalysisException` may be `null` when an `AnalysisException`

[jira] [Closed] (SPARK-20163) Kill all running tasks in a stage in case of fetch failure

2017-03-31 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sital Kedia closed SPARK-20163. --- Resolution: Duplicate > Kill all running tasks in a stage in case of fetch failure >

[jira] [Commented] (SPARK-20163) Kill all running tasks in a stage in case of fetch failure

2017-03-31 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951738#comment-15951738 ] Sital Kedia commented on SPARK-20163: - Thanks [~imranr], closing this as this is duplicate of

[jira] [Assigned] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20181: Assignee: (was: Apache Spark) > Avoid noisy Jetty WARN log when failing to bind a

[jira] [Assigned] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20181: Assignee: Apache Spark > Avoid noisy Jetty WARN log when failing to bind a port >

[jira] [Commented] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951699#comment-15951699 ] Apache Spark commented on SPARK-20181: -- User 'd2r' has created a pull request for this issue:

[jira] [Commented] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-03-31 Thread Derek Dagit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951686#comment-15951686 ] Derek Dagit commented on SPARK-20181: - Working on this... > Avoid noisy Jetty WARN log when failing

[jira] [Commented] (SPARK-9478) Add sample weights to Random Forest

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951685#comment-15951685 ] Joseph K. Bradley commented on SPARK-9478: -- [~clamus] The current vote is to *not use* weights

[jira] [Created] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-03-31 Thread Derek Dagit (JIRA)
Derek Dagit created SPARK-20181: --- Summary: Avoid noisy Jetty WARN log when failing to bind a port Key: SPARK-20181 URL: https://issues.apache.org/jira/browse/SPARK-20181 Project: Spark Issue

[jira] [Updated] (SPARK-9478) Add sample weights to Random Forest

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9478: - Description: Currently, this implementation of random forest does not support sample

[jira] [Updated] (SPARK-9478) Add sample weights to Random Forest

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9478: - Shepherd: Joseph K. Bradley > Add sample weights to Random Forest >

[jira] [Updated] (SPARK-9478) Add sample weights to Random Forest

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9478: - Shepherd: (was: Joseph K. Bradley) > Add sample weights to Random Forest >

[jira] [Commented] (SPARK-20161) Default log4j properties file should print thread-id in ConversionPattern

2017-03-31 Thread Sahil Takiar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951656#comment-15951656 ] Sahil Takiar commented on SPARK-20161: -- [~xuefuz] could you comment on

[jira] [Assigned] (SPARK-20161) Default log4j properties file should print thread-id in ConversionPattern

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20161: Assignee: (was: Apache Spark) > Default log4j properties file should print thread-id

[jira] [Commented] (SPARK-20161) Default log4j properties file should print thread-id in ConversionPattern

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951595#comment-15951595 ] Apache Spark commented on SPARK-20161: -- User 'sahilTakiar' has created a pull request for this

[jira] [Assigned] (SPARK-20161) Default log4j properties file should print thread-id in ConversionPattern

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20161: Assignee: Apache Spark > Default log4j properties file should print thread-id in

[jira] [Updated] (SPARK-20161) Default log4j properties file should print thread-id in ConversionPattern

2017-03-31 Thread Sahil Takiar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sahil Takiar updated SPARK-20161: - Summary: Default log4j properties file should print thread-id in ConversionPattern (was:

[jira] [Resolved] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20179. --- Resolution: Duplicate > Major improvements to Spark's Prefix span >

[jira] [Updated] (SPARK-20109) Need a way to convert from IndexedRowMatrix to Dense Block Matrices

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20109: -- Priority: Minor (was: Major) > Need a way to convert from IndexedRowMatrix to Dense Block Matrices >

[jira] [Updated] (SPARK-20180) Unlimited max pattern length in Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20180: --- Description: Right now, we need to use .setMaxPatternLength() method to specify is

[jira] [Updated] (SPARK-20180) Unlimited max pattern length in Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20180: --- Description: Right now, we need to use .setMaxPatternLength() method to specify is

[jira] [Created] (SPARK-20180) Unlimited max pattern length in Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
Cyril de Vogelaere created SPARK-20180: -- Summary: Unlimited max pattern length in Prefix span Key: SPARK-20180 URL: https://issues.apache.org/jira/browse/SPARK-20180 Project: Spark

[jira] [Commented] (SPARK-20176) Spark Dataframe UDAF issue

2017-03-31 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951473#comment-15951473 ] Kazuaki Ishizaki commented on SPARK-20176: -- Could you please post the program that can reproduce

[jira] [Commented] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951438#comment-15951438 ] Cyril de Vogelaere commented on SPARK-20179: Hello Joseph, Thanks for your very helpfull

[jira] [Resolved] (SPARK-20165) Resolve state encoder's deserializer in driver in FlatMapGroupsWithStateExec

2017-03-31 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-20165. --- Resolution: Fixed Issue resolved by pull request 17488

[jira] [Resolved] (SPARK-20160) Move ParquetConversions and OrcConversions Out Of HiveSessionCatalog

2017-03-31 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20160. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17484

[jira] [Commented] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951269#comment-15951269 ] Joseph K. Bradley commented on SPARK-20179: --- Thanks for the thoughts & work. Sean's right that

[jira] [Resolved] (SPARK-20084) Remove internal.metrics.updatedBlockStatuses accumulator from history files

2017-03-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-20084. Resolution: Fixed Assignee: Ryan Blue Fix Version/s: 2.1.2

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement for

[jira] [Resolved] (SPARK-20164) AnalysisException not tolerant of null query plan

2017-03-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20164. - Resolution: Fixed Assignee: Kunal Khamar Fix Version/s: 2.2.0 2.1.2 >

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951202#comment-15951202 ] Li Jin commented on SPARK-20144: Thanks Sean! I appreciate your time and help very much. >

[jira] [Commented] (SPARK-20178) Improve Scheduler fetch failures

2017-03-31 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951194#comment-15951194 ] Imran Rashid commented on SPARK-20178: -- Thanks for writing this up Tom. The only way I see to have

[jira] [Issue Comment Deleted] (SPARK-20156) Local dependent library used for upper and lowercase conversions.

2017-03-31 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-20156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Serkan Taş updated SPARK-20156: --- Comment: was deleted (was: console log before setting locale) > Local dependent library used for

[jira] [Commented] (SPARK-20163) Kill all running tasks in a stage in case of fetch failure

2017-03-31 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951136#comment-15951136 ] Imran Rashid commented on SPARK-20163: -- I think this is a duplicate of SPARK-2666, which has more

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement for

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement for

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Priority: Major (was: Minor) > Major improvements to Spark's Prefix span >

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951095#comment-15951095 ] Sean Owen commented on SPARK-20144: --- Probably best to wait for an informed opinion but I would assume

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement for

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20179: --- Description: The code I would like to push allows major performances improvement

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951084#comment-15951084 ] Li Jin commented on SPARK-20144: Also, I am not sure about "If the data were sorted, sorting would be

[jira] [Commented] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951077#comment-15951077 ] Cyril de Vogelaere commented on SPARK-20179: Hello Sean, I did have a look at the

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951073#comment-15951073 ] Li Jin commented on SPARK-20144: I totally agree Correctness takes precedence. If sorting is the only

[jira] [Updated] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20179: -- Shepherd: (was: Joseph K. Bradley) Flags: (was: Important) Target

[jira] [Commented] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951051#comment-15951051 ] Cyril de Vogelaere commented on SPARK-20179: I forgot to mention, I am ready to push the code

[jira] [Commented] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951044#comment-15951044 ] Sean Owen commented on SPARK-20179: --- It's not clear what you are proposing _for Spark_. You're

[jira] [Commented] (SPARK-10678) Specialize PrefixSpan for single-item patterns

2017-03-31 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15951040#comment-15951040 ] Cyril de Vogelaere commented on SPARK-10678: I hadn't seen this issue as I created mine. I

[jira] [Created] (SPARK-20179) Major improvements to Spark's Prefix span

2017-03-31 Thread Cyril de Vogelaere (JIRA)
Cyril de Vogelaere created SPARK-20179: -- Summary: Major improvements to Spark's Prefix span Key: SPARK-20179 URL: https://issues.apache.org/jira/browse/SPARK-20179 Project: Spark Issue

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950988#comment-15950988 ] Sean Owen commented on SPARK-20144: --- If the data were sorted, sorting would be pretty cheap, in

[jira] [Comment Edited] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950979#comment-15950979 ] Li Jin edited comment on SPARK-20144 at 3/31/17 2:14 PM: - Thanks for getting back

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950979#comment-15950979 ] Li Jin commented on SPARK-20144: Thanks for getting back to me. Sorting in this case will just add extra

[jira] [Updated] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-20177: --- Description: Document compression way little detail changes. 1.spark.eventLog.compress add

[jira] [Commented] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950939#comment-15950939 ] Apache Spark commented on SPARK-20177: -- User 'guoxiaolongzte' has created a pull request for this

[jira] [Comment Edited] (SPARK-20178) Improve Scheduler fetch failures

2017-03-31 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950917#comment-15950917 ] Thomas Graves edited comment on SPARK-20178 at 3/31/17 1:53 PM: Overall

[jira] [Commented] (SPARK-20178) Improve Scheduler fetch failures

2017-03-31 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950917#comment-15950917 ] Thomas Graves commented on SPARK-20178: --- Overall what I would like to accomplish is not throwing

[jira] [Closed] (SPARK-19443) The function to generate constraints takes too long when the query plan grows continuously

2017-03-31 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-19443. --- Resolution: Won't Fix > The function to generate constraints takes too long when the query

[jira] [Closed] (SPARK-19665) Improve constraint propagation

2017-03-31 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-19665. --- Resolution: Won't Fix > Improve constraint propagation > -- > >

[jira] [Created] (SPARK-20178) Improve Scheduler fetch failures

2017-03-31 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-20178: - Summary: Improve Scheduler fetch failures Key: SPARK-20178 URL: https://issues.apache.org/jira/browse/SPARK-20178 Project: Spark Issue Type: Epic

[jira] [Assigned] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20177: Assignee: Apache Spark > Document about compression way has some little detail changes. >

[jira] [Assigned] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20177: Assignee: (was: Apache Spark) > Document about compression way has some little detail

[jira] [Commented] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950826#comment-15950826 ] Apache Spark commented on SPARK-20177: -- User 'guoxiaolongzte' has created a pull request for this

[jira] [Created] (SPARK-20177) Document about compression way has some little detail changes.

2017-03-31 Thread guoxiaolongzte (JIRA)
guoxiaolongzte created SPARK-20177: -- Summary: Document about compression way has some little detail changes. Key: SPARK-20177 URL: https://issues.apache.org/jira/browse/SPARK-20177 Project: Spark

[jira] [Comment Edited] (SPARK-14492) Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its not backwards compatible with earlier version

2017-03-31 Thread Sunil Rangwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950778#comment-15950778 ] Sunil Rangwani edited comment on SPARK-14492 at 3/31/17 12:27 PM: -- My

[jira] [Commented] (SPARK-14492) Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its not backwards compatible with earlier version

2017-03-31 Thread Sunil Rangwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950778#comment-15950778 ] Sunil Rangwani commented on SPARK-14492: My problem exactly was a) Interacting with Hive

[jira] [Commented] (SPARK-14492) Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its not backwards compatible with earlier version

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950768#comment-15950768 ] Sean Owen commented on SPARK-14492: --- You are still describing two different things I think: a)

[jira] [Commented] (SPARK-14492) Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its not backwards compatible with earlier version

2017-03-31 Thread Sunil Rangwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950760#comment-15950760 ] Sunil Rangwani commented on SPARK-14492: [~sowen] Can you please explain why is it not a problem?

[jira] [Comment Edited] (SPARK-18936) Infrastructure for session local timezone support

2017-03-31 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950748#comment-15950748 ] Navya Krishnappa edited comment on SPARK-18936 at 3/31/17 11:51 AM: I

[jira] [Commented] (SPARK-18936) Infrastructure for session local timezone support

2017-03-31 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950748#comment-15950748 ] Navya Krishnappa commented on SPARK-18936: -- I think this fix helps us to set the time zone in

[jira] [Commented] (SPARK-20152) Time zone is not respected while parsing csv for timeStampFormat "MM-dd-yyyy'T'HH:mm:ss.SSSZZ"

2017-03-31 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950745#comment-15950745 ] Navya Krishnappa commented on SPARK-20152: -- [~srowen] & [~hyukjin.kwon] Thank you for your

[jira] [Created] (SPARK-20176) Spark Dataframe UDAF issue

2017-03-31 Thread Dinesh Man Amatya (JIRA)
Dinesh Man Amatya created SPARK-20176: - Summary: Spark Dataframe UDAF issue Key: SPARK-20176 URL: https://issues.apache.org/jira/browse/SPARK-20176 Project: Spark Issue Type: IT Help

[jira] [Commented] (SPARK-20173) Throw NullPointerException when HiveThriftServer2 is shutdown

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950683#comment-15950683 ] Apache Spark commented on SPARK-20173: -- User 'zuotingbing' has created a pull request for this

[jira] [Commented] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950640#comment-15950640 ] Sean Owen commented on SPARK-20139: --- So is the lesson here that the driver can't keep up at this scale

[jira] [Resolved] (SPARK-14492) Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its not backwards compatible with earlier version

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14492. --- Resolution: Not A Problem > Spark SQL 1.6.0 does not work with Hive version lower than 1.2.0; its

[jira] [Issue Comment Deleted] (SPARK-19862) In SparkEnv.scala,shortShuffleMgrNames tungsten-sort can be deleted.

2017-03-31 Thread guoxiaolong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolong updated SPARK-19862: Comment: was deleted (was: @srowen In spark2.1.0,"tungsten-sort" ->

[jira] [Updated] (SPARK-19690) Join a streaming DataFrame with a batch DataFrame may not work

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19690: -- Target Version/s: 2.2.0 (was: 2.1.1, 2.2.0) > Join a streaming DataFrame with a batch DataFrame may

[jira] [Resolved] (SPARK-20167) In SqlBase.g4,some of the comments is not correct.

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20167. --- Resolution: Not A Problem > In SqlBase.g4,some of the comments is not correct. >

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950627#comment-15950627 ] Sean Owen commented on SPARK-20144: --- If you need a particular ordering, I think you need to sort. I am

[jira] [Assigned] (SPARK-20175) Exists should not be evaluated in Join operator and can be converted to ScalarSubquery if no correlated reference

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20175: Assignee: Apache Spark > Exists should not be evaluated in Join operator and can be

[jira] [Assigned] (SPARK-20175) Exists should not be evaluated in Join operator and can be converted to ScalarSubquery if no correlated reference

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20175: Assignee: (was: Apache Spark) > Exists should not be evaluated in Join operator and

[jira] [Commented] (SPARK-20175) Exists should not be evaluated in Join operator and can be converted to ScalarSubquery if no correlated reference

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950624#comment-15950624 ] Apache Spark commented on SPARK-20175: -- User 'viirya' has created a pull request for this issue:

[jira] [Created] (SPARK-20175) Exists should not be evaluated in Join operator and can be converted to ScalarSubquery if no correlated reference

2017-03-31 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-20175: --- Summary: Exists should not be evaluated in Join operator and can be converted to ScalarSubquery if no correlated reference Key: SPARK-20175 URL:

[jira] [Commented] (SPARK-20173) Throw NullPointerException when HiveThriftServer2 is shutdown

2017-03-31 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950619#comment-15950619 ] Xiaochen Ouyang commented on SPARK-20173: - +1 > Throw NullPointerException when

[jira] [Assigned] (SPARK-20172) Event log without read permission should be filtered out before actually reading it

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20172: Assignee: (was: Apache Spark) > Event log without read permission should be filtered

[jira] [Assigned] (SPARK-20172) Event log without read permission should be filtered out before actually reading it

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20172: Assignee: Apache Spark > Event log without read permission should be filtered out before

[jira] [Commented] (SPARK-20172) Event log without read permission should be filtered out before actually reading it

2017-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15950616#comment-15950616 ] Apache Spark commented on SPARK-20172: -- User 'jerryshao' has created a pull request for this issue:

  1   2   >