[jira] [Commented] (SPARK-4044) Thriftserver fails to start when JAVA_HOME points to JRE instead of JDK

2014-10-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179620#comment-14179620 ] Sean Owen commented on SPARK-4044: -- How about using {{unzip -l}} to probe the contents of

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4006: -- Description: This is a huge robustness issue for us (Taboola), in mission critical , time sensitive

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4006: -- Affects Version/s: 1.2.0 Spark Driver crashes whenever an Executor is registered twice

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4006: -- Target Version/s: 1.1.1, 1.2.0 (was: 1.2.0) Spark Driver crashes whenever an Executor is registered

[jira] [Commented] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179625#comment-14179625 ] Josh Rosen commented on SPARK-4006: --- Thanks for the bug report + patch! I'd like to see

[jira] [Commented] (SPARK-2926) Add MR-style (merge-sort) SortShuffleReader for sort-based shuffle

2014-10-22 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179631#comment-14179631 ] Sandy Ryza commented on SPARK-2926: --- [~rxin] did you ever get a chance to try this out?

[jira] [Commented] (SPARK-4037) NPE in JDBC server when calling SET

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179651#comment-14179651 ] Apache Spark commented on SPARK-4037: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-3995) [PYSPARK] PySpark's sample methods do not work with NumPy 1.9

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179657#comment-14179657 ] Apache Spark commented on SPARK-3995: - User 'freeman-lab' has created a pull request

[jira] [Commented] (SPARK-3426) Sort-based shuffle compression behavior is inconsistent

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179660#comment-14179660 ] Apache Spark commented on SPARK-3426: - User 'JoshRosen' has created a pull request for

[jira] [Created] (SPARK-4045) BinaryArithmetic cannot implicitly cast StringType to DoubleType

2014-10-22 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4045: - Summary: BinaryArithmetic cannot implicitly cast StringType to DoubleType Key: SPARK-4045 URL: https://issues.apache.org/jira/browse/SPARK-4045 Project: Spark

[jira] [Commented] (SPARK-4045) BinaryArithmetic cannot implicitly cast StringType to DoubleType

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179712#comment-14179712 ] Apache Spark commented on SPARK-4045: - User 'sarutak' has created a pull request for

[jira] [Updated] (SPARK-4045) BinaryArithmetic should not implicitly cast StringType to DoubleType

2014-10-22 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-4045: -- Summary: BinaryArithmetic should not implicitly cast StringType to DoubleType (was:

[jira] [Closed] (SPARK-3939) NPE caused by SessionState.out not set in thriftserver2

2014-10-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian closed SPARK-3939. - Resolution: Duplicate NPE caused by SessionState.out not set in thriftserver2

[jira] [Commented] (SPARK-3939) NPE caused by SessionState.out not set in thriftserver2

2014-10-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179749#comment-14179749 ] Cheng Lian commented on SPARK-3939: --- Ah, actually it's SPARK-4037 who duplicates this

[jira] [Commented] (SPARK-4002) JavaKafkaStreamSuite.testKafkaStream fails on OSX

2014-10-22 Thread Ye Xianjin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179753#comment-14179753 ] Ye Xianjin commented on SPARK-4002: --- Hi, [~rdub] what's your mac os x's hostname ? Mine

[jira] [Created] (SPARK-4046) Incorrect examples on site

2014-10-22 Thread Ian Babrou (JIRA)
Ian Babrou created SPARK-4046: - Summary: Incorrect examples on site Key: SPARK-4046 URL: https://issues.apache.org/jira/browse/SPARK-4046 Project: Spark Issue Type: Bug Components:

[jira] [Updated] (SPARK-4046) Incorrect Java example on site

2014-10-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4046: - Priority: Minor (was: Critical) Affects Version/s: 1.1.0 Summary: Incorrect

[jira] [Closed] (SPARK-4045) BinaryArithmetic should not implicitly cast StringType to DoubleType

2014-10-22 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta closed SPARK-4045. - Resolution: Won't Fix BinaryArithmetic should not implicitly cast StringType to DoubleType

[jira] [Commented] (SPARK-3815) LPAD function does not work in where predicate

2014-10-22 Thread Venkata Ramana G (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179813#comment-14179813 ] Venkata Ramana G commented on SPARK-3815: - Still the issue is not re-producible,

[jira] [Commented] (SPARK-4040) calling count() on RDD's emitted from a DStream blocks forEachRDD progress.

2014-10-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179882#comment-14179882 ] RJ Nowling commented on SPARK-4040: --- I don't think you can access a RDD from with an

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179890#comment-14179890 ] RJ Nowling commented on SPARK-2429: --- A 6x performance improvement is great improvement!

[jira] [Commented] (SPARK-4040) calling count() on RDD's emitted from a DStream blocks forEachRDD progress.

2014-10-22 Thread jay vyas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179898#comment-14179898 ] jay vyas commented on SPARK-4040: - Makes sense. Is it possible that RDD's themselves ,

[jira] [Updated] (SPARK-4042) append columns ids and names before broadcast

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4042: Target Version/s: 1.1.1, 1.2.0 (was: 1.1.1) append columns ids and names before broadcast

[jira] [Commented] (SPARK-4042) append columns ids and names before broadcast

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179939#comment-14179939 ] Yin Huai commented on SPARK-4042: - Can you also add some test results? Like the amount of

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-22 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179974#comment-14179974 ] Yu Ishikawa commented on SPARK-2429: {quote} Can you add a breakdown of the timings

[jira] [Commented] (SPARK-3987) NNLS generates incorrect result

2014-10-22 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180001#comment-14180001 ] Debasish Das commented on SPARK-3987: - I will test it but this is how I called

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-22 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157605#comment-14157605 ] Guoqiang Li edited comment on SPARK-1405 at 10/22/14 3:28 PM: --

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-22 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157605#comment-14157605 ] Guoqiang Li edited comment on SPARK-1405 at 10/22/14 3:30 PM: --

[jira] [Created] (SPARK-4047) Generate runtime warning for naive implementation of PageRank example

2014-10-22 Thread Varadharajan (JIRA)
Varadharajan created SPARK-4047: --- Summary: Generate runtime warning for naive implementation of PageRank example Key: SPARK-4047 URL: https://issues.apache.org/jira/browse/SPARK-4047 Project: Spark

[jira] [Updated] (SPARK-4047) Generate runtime warning for naive implementation of PageRank example

2014-10-22 Thread Varadharajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Varadharajan updated SPARK-4047: Description: Based on SPARK-2434, we're generating runtime warnings to denote that the

[jira] [Commented] (SPARK-4047) Generate runtime warning for naive implementation of PageRank example

2014-10-22 Thread Varadharajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180083#comment-14180083 ] Varadharajan commented on SPARK-4047: - I'm working on this issue. Generate runtime

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-22 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157605#comment-14157605 ] Guoqiang Li edited comment on SPARK-1405 at 10/22/14 3:57 PM: --

[jira] [Commented] (SPARK-3359) `sbt/sbt unidoc` doesn't work with Java 8

2014-10-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180104#comment-14180104 ] holdenk commented on SPARK-3359: I think I've got a fix for it, I'll send a PR :)

[jira] [Resolved] (SPARK-3995) [PYSPARK] PySpark's sample methods do not work with NumPy 1.9

2014-10-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3995. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2889

[jira] [Commented] (SPARK-4047) Generate runtime warning for naive implementation of PageRank example

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180147#comment-14180147 ] Apache Spark commented on SPARK-4047: - User 'varadharajan' has created a pull request

[jira] [Commented] (SPARK-3655) Secondary sort

2014-10-22 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180155#comment-14180155 ] koert kuipers commented on SPARK-3655: -- i am not sure

[jira] [Comment Edited] (SPARK-3655) Secondary sort

2014-10-22 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180155#comment-14180155 ] koert kuipers edited comment on SPARK-3655 at 10/22/14 4:54 PM:

[jira] [Created] (SPARK-4048) Enhance and extend hadoop-provided profile

2014-10-22 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-4048: - Summary: Enhance and extend hadoop-provided profile Key: SPARK-4048 URL: https://issues.apache.org/jira/browse/SPARK-4048 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4048) Enhance and extend hadoop-provided profile

2014-10-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-4048: -- Description: The hadoop-provided profile is used to not package Hadoop dependencies inside the

[jira] [Commented] (SPARK-3987) NNLS generates incorrect result

2014-10-22 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180314#comment-14180314 ] Debasish Das commented on SPARK-3987: - [~coderxiang] changing to 1e-6 to 1e-7 fixes

[jira] [Commented] (SPARK-3655) Secondary sort

2014-10-22 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180348#comment-14180348 ] koert kuipers commented on SPARK-3655: -- i went through the code. to allow a secondary

[jira] [Created] (SPARK-4049) Storage web UI fraction cached shows as 100%

2014-10-22 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-4049: - Summary: Storage web UI fraction cached shows as 100% Key: SPARK-4049 URL: https://issues.apache.org/jira/browse/SPARK-4049 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Tal Sliwowicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180512#comment-14180512 ] Tal Sliwowicz commented on SPARK-4006: -- Cool! Would be very interesting to know. For

[jira] [Comment Edited] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-10-22 Thread Tal Sliwowicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180512#comment-14180512 ] Tal Sliwowicz edited comment on SPARK-4006 at 10/22/14 8:48 PM:

[jira] [Created] (SPARK-4051) Rows in python should support conversion to dictionary

2014-10-22 Thread Chris Grier (JIRA)
Chris Grier created SPARK-4051: -- Summary: Rows in python should support conversion to dictionary Key: SPARK-4051 URL: https://issues.apache.org/jira/browse/SPARK-4051 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4051) Rows in python should support conversion to dictionary

2014-10-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4051: Affects Version/s: 1.1.0 Rows in python should support conversion to dictionary

[jira] [Updated] (SPARK-4051) Rows in python should support conversion to dictionary

2014-10-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4051: Target Version/s: 1.2.0 Rows in python should support conversion to dictionary

[jira] [Updated] (SPARK-4051) Rows in python should support conversion to dictionary

2014-10-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4051: Assignee: Davies Liu Rows in python should support conversion to dictionary

[jira] [Commented] (SPARK-4051) Rows in python should support conversion to dictionary

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180599#comment-14180599 ] Apache Spark commented on SPARK-4051: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-3877) The exit code of spark-submit is still 0 when an yarn application fails

2014-10-22 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3877: - Target Version/s: 1.1.1, 1.2.0 Affects Version/s: 1.1.0 Fix Version/s: 1.2.0

[jira] [Updated] (SPARK-3877) The exit code of spark-submit is still 0 when an yarn application fails

2014-10-22 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3877: - Priority: Major (was: Minor) The exit code of spark-submit is still 0 when an yarn application fails

[jira] [Closed] (SPARK-3877) The exit code of spark-submit is still 0 when an yarn application fails

2014-10-22 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3877. Resolution: Fixed Fix Version/s: 1.1.1 The exit code of spark-submit is still 0 when an yarn

[jira] [Commented] (SPARK-3877) The exit code of spark-submit is still 0 when an yarn application fails

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180628#comment-14180628 ] Apache Spark commented on SPARK-3877: - User 'zsxwing' has created a pull request for

[jira] [Resolved] (SPARK-3426) Sort-based shuffle compression behavior is inconsistent

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3426. --- Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Fixed in 1.1.1. and 1.2.0 by

[jira] [Assigned] (SPARK-2353) ArrayIndexOutOfBoundsException in scheduler

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-2353: - Assignee: Josh Rosen ArrayIndexOutOfBoundsException in scheduler

[jira] [Resolved] (SPARK-2353) ArrayIndexOutOfBoundsException in scheduler

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2353. --- Resolution: Fixed Fix Version/s: 1.1.0 This looks like a duplicate of SPARK-2931, which was

[jira] [Resolved] (SPARK-3709) Executors don't always report broadcast block removal properly back to the driver

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3709. --- Resolution: Fixed Fix Version/s: 1.0.3 1.2.0 1.1.1 It

[jira] [Updated] (SPARK-4019) Repartitioning with more than 2000 partitions may drop all data when partitions are mostly empty or cause deserialization errors if at least one partition is empty

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4019: -- Description: {code} sc.makeRDD(0 until 10, 1000).repartition(2001).collect() {code} returns `Array()`.

[jira] [Commented] (SPARK-4019) Repartitioning with more than 2000 partitions may drop all data when partitions are mostly empty or cause deserialization errors if at least one partition is empty

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180737#comment-14180737 ] Josh Rosen commented on SPARK-4019: --- This also explains another occurrence of the Snappy

[jira] [Updated] (SPARK-4052) Use scala.collection.Map for pattern matching instead of using Predef.Map (it is scala.collection.immutable.Map)

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4052: Description: {code} val sqlContext = new org.apache.spark.sql.hive.HiveContext(sc) import

[jira] [Updated] (SPARK-4052) Use scala.collection.Map for pattern matching instead of using Predef.Map (it is scala.collection.immutable.Map)

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4052: Description: Seems ScalaReflection and InsertIntoHiveTable only take scala.collection.immutable.Map as the

[jira] [Updated] (SPARK-4052) Use scala.collection.Map for pattern matching instead of using Predef.Map (it is scala.collection.immutable.Map)

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4052: Description: Seems ScalaReflection and InsertIntoHiveTable only take scala.collection.immutable.Map as the

[jira] [Commented] (SPARK-4052) Use scala.collection.Map for pattern matching instead of using Predef.Map (it is scala.collection.immutable.Map)

2014-10-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180749#comment-14180749 ] Yin Huai commented on SPARK-4052: - I searched our sql code base with {code} grep -r

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180758#comment-14180758 ] Josh Rosen commented on SPARK-3630: --- I found another cause: *Errors in reduce phases

[jira] [Commented] (SPARK-4052) Use scala.collection.Map for pattern matching instead of using Predef.Map (it is scala.collection.immutable.Map)

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180817#comment-14180817 ] Apache Spark commented on SPARK-4052: - User 'yhuai' has created a pull request for

[jira] [Created] (SPARK-4053) Block generator throttling in NetworkReceiverSuite is flaky

2014-10-22 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-4053: Summary: Block generator throttling in NetworkReceiverSuite is flaky Key: SPARK-4053 URL: https://issues.apache.org/jira/browse/SPARK-4053 Project: Spark

[jira] [Commented] (SPARK-4053) Block generator throttling in NetworkReceiverSuite is flaky

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180832#comment-14180832 ] Apache Spark commented on SPARK-4053: - User 'tdas' has created a pull request for this

[jira] [Updated] (SPARK-1239) Don't fetch all map output statuses at each reducer during shuffles

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-1239: -- Assignee: Josh Rosen (was: Kostas Sakellis) I'm re-assigning this to me since I've been working in

[jira] [Commented] (SPARK-3988) Public API for DateType support

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180928#comment-14180928 ] Apache Spark commented on SPARK-3988: - User 'adrian-wang' has created a pull request

[jira] [Commented] (SPARK-3988) Public API for DateType support

2014-10-22 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180933#comment-14180933 ] Adrian Wang commented on SPARK-3988: have to investigate solution 3 in spark-2179

[jira] [Created] (SPARK-4054) Dead link in README

2014-10-22 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4054: - Summary: Dead link in README Key: SPARK-4054 URL: https://issues.apache.org/jira/browse/SPARK-4054 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-4054) Dead link in README

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180946#comment-14180946 ] Apache Spark commented on SPARK-4054: - User 'sarutak' has created a pull request for

[jira] [Resolved] (SPARK-3812) Adapt maven build to publish effective pom.

2014-10-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3812. Resolution: Fixed Assignee: Prashant Sharma Fixed by:

[jira] [Commented] (SPARK-4002) JavaKafkaStreamSuite.testKafkaStream fails on OSX

2014-10-22 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180992#comment-14180992 ] Ryan Williams commented on SPARK-4002: -- [~jerryshao] cool, a couple of notes: * If

[jira] [Updated] (SPARK-4002) JavaKafkaStreamSuite.testKafkaStream fails on OSX

2014-10-22 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-4002: - Attachment: unit-tests.log unit-tests.log file from running {{mvn clean test

[jira] [Commented] (SPARK-4002) JavaKafkaStreamSuite.testKafkaStream fails on OSX

2014-10-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181006#comment-14181006 ] Saisai Shao commented on SPARK-4002: Thanks a lot Ryan for your detailed description,

[jira] [Updated] (SPARK-4002) KafkaStreamSuite Kafka input stream case fails on OSX

2014-10-22 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-4002: - Description: [~sowen] mentioned this on spark-dev

[jira] [Updated] (SPARK-4002) KafkaStreamSuite Kafka input stream case fails on OSX

2014-10-22 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-4002: - Description: [~sowen] mentioned this on spark-dev

[jira] [Closed] (SPARK-4054) Dead link in README

2014-10-22 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta closed SPARK-4054. - Resolution: Not a Problem Dead link in README --- Key:

[jira] [Created] (SPARK-4055) Inconsistent spelling 'MLlib' and 'MLLib'

2014-10-22 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4055: - Summary: Inconsistent spelling 'MLlib' and 'MLLib' Key: SPARK-4055 URL: https://issues.apache.org/jira/browse/SPARK-4055 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4019) Shuffling with more than 2000 reducers may drop all data when partitions are mostly empty or cause deserialization errors if at least one partition is empty

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4019: -- Summary: Shuffling with more than 2000 reducers may drop all data when partitions are mostly empty or

[jira] [Commented] (SPARK-4055) Inconsistent spelling 'MLlib' and 'MLLib'

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181034#comment-14181034 ] Apache Spark commented on SPARK-4055: - User 'sarutak' has created a pull request for

[jira] [Created] (SPARK-4056) Upgrade snappy-java to 1.1.1.4

2014-10-22 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-4056: - Summary: Upgrade snappy-java to 1.1.1.4 Key: SPARK-4056 URL: https://issues.apache.org/jira/browse/SPARK-4056 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181044#comment-14181044 ] Josh Rosen commented on SPARK-3630: --- snappy-java just published a new release (1.1.1.4)

[jira] [Created] (SPARK-4057) Use -agentlib instead of -Xdebug in sbt--launch-lib.bash for debugging

2014-10-22 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4057: - Summary: Use -agentlib instead of -Xdebug in sbt--launch-lib.bash for debugging Key: SPARK-4057 URL: https://issues.apache.org/jira/browse/SPARK-4057 Project:

[jira] [Updated] (SPARK-4057) Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging

2014-10-22 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-4057: -- Summary: Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging (was: Use

[jira] [Commented] (SPARK-4057) Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging

2014-10-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181049#comment-14181049 ] Apache Spark commented on SPARK-4057: - User 'sarutak' has created a pull request for