[jira] [Created] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-3146: -- Summary: Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM Key: SPARK-3146 URL:

[jira] [Commented] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103486#comment-14103486 ] Saisai Shao commented on SPARK-3146: This issue can actually solve the problem

[jira] [Created] (SPARK-3147) Implement A/B testing

2014-08-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-3147: Summary: Implement A/B testing Key: SPARK-3147 URL: https://issues.apache.org/jira/browse/SPARK-3147 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-2121) Not fully cached when there is enough memory in ALS

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2121: - Summary: Not fully cached when there is enough memory in ALS (was: Not fully cached when there

[jira] [Updated] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-3146: --- Description: Currently Spark Streaming Kafka API stores the key and value of each message into BM

[jira] [Updated] (SPARK-3066) Support recommendAll in matrix factorization model

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3066: - Target Version/s: 1.2.0 Support recommendAll in matrix factorization model

[jira] [Commented] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103520#comment-14103520 ] Apache Spark commented on SPARK-3146: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-3135) Avoid memory copy in TorrentBroadcast serialization

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103588#comment-14103588 ] Apache Spark commented on SPARK-3135: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-3144) No need to set spark.local.dir in ExecutorLauncher

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103593#comment-14103593 ] Guoqiang Li commented on SPARK-3144: [~carlmartin] [PR

[jira] [Commented] (SPARK-3015) Removing broadcast in quick successions causes Akka timeout

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103597#comment-14103597 ] Guoqiang Li commented on SPARK-3015: In version 1.0.1, I also found the issue.

[jira] [Resolved] (SPARK-934) spark-mllib occasionally throw java.io.IOException (java.io.IOException: Corrupt data: overrun in decompress, input offset 51381, output offset 57509)

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li resolved SPARK-934. --- Resolution: Fixed spark-mllib occasionally throw java.io.IOException (java.io.IOException: Corrupt

[jira] [Resolved] (SPARK-1125) The maven build error for Spark Examples

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li resolved SPARK-1125. Resolution: Fixed The maven build error for Spark Examples

[jira] [Commented] (SPARK-2312) Spark Actors do not handle unknown messages in their receive methods

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103619#comment-14103619 ] Apache Spark commented on SPARK-2312: - User 'isaias' has created a pull request for

[jira] [Commented] (SPARK-3139) Akka timeouts from ContextCleaner when cleaning shuffles

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103651#comment-14103651 ] Apache Spark commented on SPARK-3139: - User 'witgo' has created a pull request for

[jira] [Resolved] (SPARK-3054) Add tests for SparkSink

2014-08-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-3054. -- Resolution: Fixed Target Version/s: 1.1.0 Add tests for SparkSink

[jira] [Updated] (SPARK-3054) Add tests for SparkSink

2014-08-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3054: - Target Version/s: (was: 1.1.0) Add tests for SparkSink ---

[jira] [Updated] (SPARK-3054) Add tests for SparkSink

2014-08-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3054: - Fix Version/s: 1.1.0 Add tests for SparkSink --- Key:

[jira] [Created] (SPARK-3148) Update global variables of HttpBroadcast so that multiple SparkContexts can coexist

2014-08-20 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-3148: --- Summary: Update global variables of HttpBroadcast so that multiple SparkContexts can coexist Key: SPARK-3148 URL: https://issues.apache.org/jira/browse/SPARK-3148

[jira] [Comment Edited] (SPARK-3098) In some cases, operation zipWithIndex get a wrong results

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103985#comment-14103985 ] Guoqiang Li edited comment on SPARK-3098 at 8/20/14 3:20 PM: -

[jira] [Comment Edited] (SPARK-3098) In some cases, operation zipWithIndex get a wrong results

2014-08-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103985#comment-14103985 ] Guoqiang Li edited comment on SPARK-3098 at 8/20/14 3:21 PM: -

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-08-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104077#comment-14104077 ] Thomas Graves commented on SPARK-3129: -- Yes that probably means using reflection. I

[jira] [Updated] (SPARK-1442) Add Window function support

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1442: Fix Version/s: (was: 1.1.0) Add Window function support ---

[jira] [Created] (SPARK-3149) Connection establishment information is not enough.

2014-08-20 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-3149: - Summary: Connection establishment information is not enough. Key: SPARK-3149 URL: https://issues.apache.org/jira/browse/SPARK-3149 Project: Spark Issue

[jira] [Updated] (SPARK-2359) Supporting common statistical functions in MLlib

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2359: - Target Version/s: (was: 1.1.0) Supporting common statistical functions in MLlib

[jira] [Resolved] (SPARK-2853) Check API consistency for old ones

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2853. -- Resolution: Done Check API consistency for old ones --

[jira] [Commented] (SPARK-3149) Connection establishment information is not enough.

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104137#comment-14104137 ] Apache Spark commented on SPARK-3149: - User 'sarutak' has created a pull request for

[jira] [Updated] (SPARK-2964) Fix wrong option (-S, --silent), and improve spark-sql and start-thriftserver to leverage bin/util.sh

2014-08-20 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-2964: -- Priority: Major (was: Minor) Fix wrong option (-S, --silent), and improve spark-sql and

[jira] [Updated] (SPARK-2970) spark-sql script ends with IOException when EventLogging is enabled

2014-08-20 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-2970: -- Priority: Critical (was: Major) spark-sql script ends with IOException when EventLogging is

[jira] [Resolved] (SPARK-2975) SPARK_LOCAL_DIRS may cause problems when running in local mode

2014-08-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2975. --- Resolution: Fixed Fix Version/s: 1.1.0 SPARK_LOCAL_DIRS may cause problems when running in

[jira] [Commented] (SPARK-3140) PySpark start-up throws confusing exception

2014-08-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104153#comment-14104153 ] Josh Rosen commented on SPARK-3140: --- Is it the gateway server JVM - PySpark driver

[jira] [Resolved] (SPARK-3144) No need to set spark.local.dir in ExecutorLauncher

2014-08-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3144. --- Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Josh Rosen This was fixed in my PR

[jira] [Created] (SPARK-3150) NullPointerException in Spark recovery after simultaneous fall of master and driver

2014-08-20 Thread Tatiana Borisova (JIRA)
Tatiana Borisova created SPARK-3150: --- Summary: NullPointerException in Spark recovery after simultaneous fall of master and driver Key: SPARK-3150 URL: https://issues.apache.org/jira/browse/SPARK-3150

[jira] [Commented] (SPARK-3140) PySpark start-up throws confusing exception

2014-08-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104267#comment-14104267 ] Andrew Or commented on SPARK-3140: -- Yes, normally it implies exactly that. What I mean is

[jira] [Commented] (SPARK-3124) Jar version conflict in the assembly package

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104273#comment-14104273 ] Patrick Wendell commented on SPARK-3124: Hey [~chenghao] for cases like this, can

[jira] [Commented] (SPARK-3143) Documentation for TF-IDF

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104279#comment-14104279 ] Apache Spark commented on SPARK-3143: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-3150) NullPointerException in Spark recovery after simultaneous fall of master and driver

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104288#comment-14104288 ] Apache Spark commented on SPARK-3150: - User 'tanyatik' has created a pull request for

[jira] [Commented] (SPARK-2840) Improve documentation for decision tree

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104305#comment-14104305 ] Apache Spark commented on SPARK-2840: - User 'jkbradley' has created a pull request for

[jira] [Created] (SPARK-3151) DiskStore attempts to map any size BlockId without checking MappedByteBuffer limit

2014-08-20 Thread Damon Brown (JIRA)
Damon Brown created SPARK-3151: -- Summary: DiskStore attempts to map any size BlockId without checking MappedByteBuffer limit Key: SPARK-3151 URL: https://issues.apache.org/jira/browse/SPARK-3151

[jira] [Commented] (SPARK-2843) Improve documentation for ALS

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104315#comment-14104315 ] Apache Spark commented on SPARK-2843: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-2976) Too many ugly tabs instead of white spaces

2014-08-20 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-2976: -- Summary: Too many ugly tabs instead of white spaces (was: There are too many tabs in some

[jira] [Created] (SPARK-3152) Yarn AM cluster mode doesn't cleanup staging directory when it exits cleanly

2014-08-20 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-3152: Summary: Yarn AM cluster mode doesn't cleanup staging directory when it exits cleanly Key: SPARK-3152 URL: https://issues.apache.org/jira/browse/SPARK-3152 Project:

[jira] [Commented] (SPARK-2913) Spark's log4j.properties should always appear ahead of Hadoop's on classpath

2014-08-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104366#comment-14104366 ] Josh Rosen commented on SPARK-2913: --- I've noticed that the Spark EC2 AMI adds ephemeral

[jira] [Resolved] (SPARK-3092) Always include the thriftserver when -Phive is enabled.

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3092. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 2006

[jira] [Created] (SPARK-3153) shuffle will run out of space when disks have different free space

2014-08-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-3153: - Summary: shuffle will run out of space when disks have different free space Key: SPARK-3153 URL: https://issues.apache.org/jira/browse/SPARK-3153 Project: Spark

[jira] [Updated] (SPARK-3153) shuffle will run out of space when disks have different free space

2014-08-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-3153: -- Description: If we have several disks in SPARK_LOCAL_DIRS, and one of them is much smaller than

[jira] [Created] (SPARK-3154) Make FlumePollingInputDStream shutdown cleaner

2014-08-20 Thread Hari Shreedharan (JIRA)
Hari Shreedharan created SPARK-3154: --- Summary: Make FlumePollingInputDStream shutdown cleaner Key: SPARK-3154 URL: https://issues.apache.org/jira/browse/SPARK-3154 Project: Spark Issue

[jira] [Commented] (SPARK-3154) Make FlumePollingInputDStream shutdown cleaner

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104412#comment-14104412 ] Apache Spark commented on SPARK-3154: - User 'harishreedharan' has created a pull

[jira] [Commented] (SPARK-3152) Yarn AM cluster mode doesn't cleanup staging directory when it exits cleanly

2014-08-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104430#comment-14104430 ] Thomas Graves commented on SPARK-3152: -- I'm hoping we can fix this in SPARK-2933 for

[jira] [Resolved] (SPARK-3126) HiveThriftServer2Suite hangs

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3126. - Resolution: Fixed HiveThriftServer2Suite hangs

[jira] [Resolved] (SPARK-3125) hive thriftserver test suite failure

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3125. - Resolution: Fixed Fix Version/s: 1.1.0 hive thriftserver test suite failure

[jira] [Resolved] (SPARK-3127) Modifying Spark SQL related scripts should trigger Spark SQL test suites

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3127. - Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Cheng Lian Modifying

[jira] [Updated] (SPARK-2961) Use statistics to skip partitions when reading from in-memory columnar data

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2961: Target Version/s: 1.2.0 (was: 1.1.0) Use statistics to skip partitions when reading from

[jira] [Commented] (SPARK-3037) Add ArrayType containing null value support to Parquet.

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104550#comment-14104550 ] Michael Armbrust commented on SPARK-3037: - [~ueshin], thanks for investigating

[jira] [Commented] (SPARK-3036) Add MapType containing null value support to Parquet.

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104556#comment-14104556 ] Michael Armbrust commented on SPARK-3036: - Can you explain more about what you

[jira] [Assigned] (SPARK-2554) CountDistinct and SumDistinct should do partial aggregation

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-2554: --- Assignee: Michael Armbrust CountDistinct and SumDistinct should do partial

[jira] [Updated] (SPARK-2554) CountDistinct and SumDistinct should do partial aggregation

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2554: Priority: Blocker (was: Major) Target Version/s: 1.1.0 CountDistinct and

[jira] [Resolved] (SPARK-3149) Connection establishment information is not enough.

2014-08-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3149. --- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 2060

[jira] [Commented] (SPARK-2967) Several SQL unit test failed when sort-based shuffle is enabled

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104619#comment-14104619 ] Apache Spark commented on SPARK-2967: - User 'marmbrus' has created a pull request for

[jira] [Resolved] (SPARK-2849) bin/spark-submit should respect spark.driver.* for client mode

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2849. Resolution: Fixed Issue resolved by pull request 1845

[jira] [Commented] (SPARK-3140) PySpark start-up throws confusing exception

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104697#comment-14104697 ] Apache Spark commented on SPARK-3140: - User 'andrewor14' has created a pull request

[jira] [Created] (SPARK-3155) Add support for pruning to DecisionTree

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3155: Summary: Add support for pruning to DecisionTree Key: SPARK-3155 URL: https://issues.apache.org/jira/browse/SPARK-3155 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3155) Support DecisionTree pruning

2014-08-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3155: - Summary: Support DecisionTree pruning (was: Add support for pruning to DecisionTree)

[jira] [Created] (SPARK-3157) Avoid duplicated stats in DecisionTree extractLeftRightNodeAggregates

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3157: Summary: Avoid duplicated stats in DecisionTree extractLeftRightNodeAggregates Key: SPARK-3157 URL: https://issues.apache.org/jira/browse/SPARK-3157 Project:

[jira] [Created] (SPARK-3156) DecisionTree: Order categorical features adaptively

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3156: Summary: DecisionTree: Order categorical features adaptively Key: SPARK-3156 URL: https://issues.apache.org/jira/browse/SPARK-3156 Project: Spark

[jira] [Created] (SPARK-3158) Avoid 1 extra aggregation for DecisionTree training

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3158: Summary: Avoid 1 extra aggregation for DecisionTree training Key: SPARK-3158 URL: https://issues.apache.org/jira/browse/SPARK-3158 Project: Spark

[jira] [Created] (SPARK-3159) Check for reducible DecisionTree

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3159: Summary: Check for reducible DecisionTree Key: SPARK-3159 URL: https://issues.apache.org/jira/browse/SPARK-3159 Project: Spark Issue Type:

[jira] [Created] (SPARK-3161) Cache example-node map for DecisionTree training

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3161: Summary: Cache example-node map for DecisionTree training Key: SPARK-3161 URL: https://issues.apache.org/jira/browse/SPARK-3161 Project: Spark Issue

[jira] [Created] (SPARK-3160) Simplify DecisionTree data structure for training

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3160: Summary: Simplify DecisionTree data structure for training Key: SPARK-3160 URL: https://issues.apache.org/jira/browse/SPARK-3160 Project: Spark

[jira] [Created] (SPARK-3163) Separate continuous and categorical features in DecisionTree

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3163: Summary: Separate continuous and categorical features in DecisionTree Key: SPARK-3163 URL: https://issues.apache.org/jira/browse/SPARK-3163 Project: Spark

[jira] [Created] (SPARK-3162) Train DecisionTree locally when possible

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3162: Summary: Train DecisionTree locally when possible Key: SPARK-3162 URL: https://issues.apache.org/jira/browse/SPARK-3162 Project: Spark Issue Type:

[jira] [Created] (SPARK-3164) Store DecisionTree Split.categories as Set

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3164: Summary: Store DecisionTree Split.categories as Set Key: SPARK-3164 URL: https://issues.apache.org/jira/browse/SPARK-3164 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2841) Documentation for feature transformations

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104709#comment-14104709 ] Apache Spark commented on SPARK-2841: - User 'dbtsai' has created a pull request for

[jira] [Created] (SPARK-3165) DecisionTree does not use sparsity in data

2014-08-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-3165: Summary: DecisionTree does not use sparsity in data Key: SPARK-3165 URL: https://issues.apache.org/jira/browse/SPARK-3165 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3146: - Target Version/s: 1.2.0 Improve the flexibility of Spark Streaming Kafka API to offer user the

[jira] [Updated] (SPARK-3146) Improve the flexibility of Spark Streaming Kafka API to offer user the ability to process message before storing into BM

2014-08-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3146: - Affects Version/s: 1.1.0 Improve the flexibility of Spark Streaming Kafka API to offer user the

[jira] [Resolved] (SPARK-2967) Several SQL unit test failed when sort-based shuffle is enabled

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2967. - Resolution: Fixed Fix Version/s: 1.1.0 Several SQL unit test failed when

[jira] [Updated] (SPARK-3086) Use 1-indexing for decision tree nodes

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3086: - Assignee: Joseph K. Bradley Use 1-indexing for decision tree nodes

[jira] [Commented] (SPARK-3133) Piggyback get location RPC call to fetch small blocks

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104765#comment-14104765 ] Apache Spark commented on SPARK-3133: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-2846) Add configureInputJobPropertiesForStorageHandler to initialization of job conf

2014-08-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2846. - Resolution: Fixed Fix Version/s: 1.1.0 Add

[jira] [Resolved] (SPARK-2169) SparkUI.setAppName() has no effect

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2169. Resolution: Fixed Fix Version/s: 1.1.0 SparkUI.setAppName() has no effect

[jira] [Resolved] (SPARK-2848) Shade Guava in Spark deliverables

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2848. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1813

[jira] [Resolved] (SPARK-2420) Dependency changes for compatibility with Hive

2014-08-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2420. Resolution: Fixed Fix Version/s: 1.2.0 Target Version/s: (was: 1.2.0) I

[jira] [Created] (SPARK-3166) Custom serialisers can't be shipped in application jars

2014-08-20 Thread Graham Dennis (JIRA)
Graham Dennis created SPARK-3166: Summary: Custom serialisers can't be shipped in application jars Key: SPARK-3166 URL: https://issues.apache.org/jira/browse/SPARK-3166 Project: Spark Issue

[jira] [Commented] (SPARK-2997) Improve documentation for dimensionality reduction

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104828#comment-14104828 ] Apache Spark commented on SPARK-2997: - User 'rezazadeh' has created a pull request for

[jira] [Created] (SPARK-3167) Port recent spark-submit changes to windows

2014-08-20 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3167: -- Summary: Port recent spark-submit changes to windows Key: SPARK-3167 URL: https://issues.apache.org/jira/browse/SPARK-3167 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-3143) Documentation for TF-IDF

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3143. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 2061

[jira] [Commented] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2014-08-20 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104881#comment-14104881 ] Zhan Zhang commented on SPARK-1537: --- Thanks for sharing this. Do you have concrete plan

[jira] [Resolved] (SPARK-2843) Improve documentation for ALS

2014-08-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2843. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 2064

[jira] [Commented] (SPARK-2967) Several SQL unit test failed when sort-based shuffle is enabled

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104902#comment-14104902 ] Apache Spark commented on SPARK-2967: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-3034) [HIve] java.sql.Date cannot be cast to java.sql.Timestamp

2014-08-20 Thread pengyanhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104936#comment-14104936 ] pengyanhong commented on SPARK-3034: I have a source table which contains a Date type

[jira] [Created] (SPARK-3168) The ServletContextHandler of webui lacks a SessionManager

2014-08-20 Thread meiyoula (JIRA)
meiyoula created SPARK-3168: --- Summary: The ServletContextHandler of webui lacks a SessionManager Key: SPARK-3168 URL: https://issues.apache.org/jira/browse/SPARK-3168 Project: Spark Issue Type:

[jira] [Commented] (SPARK-3168) The ServletContextHandler of webui lacks a SessionManager

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104943#comment-14104943 ] Apache Spark commented on SPARK-3168: - User 'XuTingjun' has created a pull request for

[jira] [Commented] (SPARK-3033) [Hive] java.math.BigDecimal cannot be cast to org.apache.hadoop.hive.common.type.HiveDecimal

2014-08-20 Thread pengyanhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104955#comment-14104955 ] pengyanhong commented on SPARK-3033: There are expressions In the Select clause of

[jira] [Created] (SPARK-3169) make-distribution.sh failed

2014-08-20 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-3169: -- Summary: make-distribution.sh failed Key: SPARK-3169 URL: https://issues.apache.org/jira/browse/SPARK-3169 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-3169) make-distribution.sh failed

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105005#comment-14105005 ] Apache Spark commented on SPARK-3169: - User 'witgo' has created a pull request for

[jira] [Commented] (SPARK-3036) Add MapType containing null value support to Parquet.

2014-08-20 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105012#comment-14105012 ] Takuya Ueshin commented on SPARK-3036: -- Ah, that's right. It was my mistake. Newer

[jira] [Created] (SPARK-3170) Bug Fix in Storage UI

2014-08-20 Thread uncleGen (JIRA)
uncleGen created SPARK-3170: --- Summary: Bug Fix in Storage UI Key: SPARK-3170 URL: https://issues.apache.org/jira/browse/SPARK-3170 Project: Spark Issue Type: Bug Components: Spark Core

[jira] [Commented] (SPARK-3170) Bug Fix in Storage UI

2014-08-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105035#comment-14105035 ] Apache Spark commented on SPARK-3170: - User 'uncleGen' has created a pull request for

[jira] [Updated] (SPARK-3170) Bug Fix in Storage UI

2014-08-20 Thread uncleGen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] uncleGen updated SPARK-3170: Description: current compeleted stage only need to remove its own partitions that are no longer cached.

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-08-20 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105056#comment-14105056 ] Phuoc Do commented on SPARK-975: To make it consistent with existing stack details, I leave

  1   2   >