[jira] [Closed] (SPARK-1784) Add a partitioner which partitions an RDD with each partition having specified # of keys

2014-05-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia closed SPARK-1784. Resolution: Invalid Fix Version/s: (was: 1.0.0) Add a partitioner which partitions an

[jira] [Updated] (SPARK-1811) Support resizable output buffer for kryo serializer

2014-05-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1811: - Assignee: Koert Kuipers Support resizable output buffer for kryo serializer

[jira] [Comment Edited] (SPARK-874) Have a --wait flag in ./sbin/stop-all.sh that polls until Worker's are finished

2014-05-29 Thread Archit Thakur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012127#comment-14012127 ] Archit Thakur edited comment on SPARK-874 at 5/29/14 6:45 AM: --

[jira] [Commented] (SPARK-874) Have a --wait flag in ./sbin/stop-all.sh that polls until Worker's are finished

2014-05-29 Thread Archit Thakur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012127#comment-14012127 ] Archit Thakur commented on SPARK-874: - I am intersted in taking it up. Have a --wait

[jira] [Created] (SPARK-1961) when data return from map is about 10 kb, reduce(_ + _) would always pending

2014-05-29 Thread zhoudi (JIRA)
zhoudi created SPARK-1961: - Summary: when data return from map is about 10 kb, reduce(_ + _) would always pending Key: SPARK-1961 URL: https://issues.apache.org/jira/browse/SPARK-1961 Project: Spark

[jira] [Updated] (SPARK-1962) Add RDD cache reference counting

2014-05-29 Thread Taeyun Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taeyun Kim updated SPARK-1962: -- Description: It would be nice if the RDD cache() method incorporate a reference counting information.

[jira] [Updated] (SPARK-1962) Add RDD cache reference counting

2014-05-29 Thread Taeyun Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taeyun Kim updated SPARK-1962: -- Affects Version/s: 1.0.0 Add RDD cache reference counting

[jira] [Commented] (SPARK-1963) Job aborted with NullPointerException from DAGScheduler.scala:1020

2014-05-29 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012181#comment-14012181 ] Kevin (Sangwoo) Kim commented on SPARK-1963: I guess the data is valid,

[jira] [Commented] (SPARK-1963) Job aborted with NullPointerException from DAGScheduler.scala:1020

2014-05-29 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012241#comment-14012241 ] Kevin (Sangwoo) Kim commented on SPARK-1963: I think the path

[jira] [Resolved] (SPARK-1963) Job aborted with NullPointerException from DAGScheduler.scala:1020

2014-05-29 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin (Sangwoo) Kim resolved SPARK-1963. Resolution: Invalid Job aborted with NullPointerException from

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2014-05-29 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012295#comment-14012295 ] sam commented on SPARK-1867: [~srowen] Thanks, I still find it difficult to find the correct

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2014-05-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012309#comment-14012309 ] Sean Owen commented on SPARK-1867: -- There is no hadoop-io module. Modules are

[jira] [Commented] (SPARK-1948) Scalac crashes when building Spark in IntelliJ IDEA

2014-05-29 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012366#comment-14012366 ] Cheng Lian commented on SPARK-1948: --- Hi [~sowen], thanks for the suggestion! I tried to

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2014-05-29 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012431#comment-14012431 ] sam commented on SPARK-1867: Hmm, the reason for my confusion is a very stange compile

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2014-05-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012434#comment-14012434 ] Sean Owen commented on SPARK-1867: -- Something else is up; those are equivalent in Java

[jira] [Updated] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1518: --- Component/s: Spark Core Spark master doesn't compile against hadoop-common trunk

[jira] [Updated] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1518: --- Target Version/s: 1.1.0, 1.0.1 (was: 1.0.1) Spark master doesn't compile against

[jira] [Commented] (SPARK-1961) when data return from map is about 10 kb, reduce(_ + _) would always pending

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012471#comment-14012471 ] Patrick Wendell commented on SPARK-1961: Could you add a bit more information here

[jira] [Updated] (SPARK-1962) Add RDD cache reference counting

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1962: --- Component/s: Spark Core Add RDD cache reference counting

[jira] [Resolved] (SPARK-1935) Explicitly add commons-codec 1.5 as a dependency

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1935. Resolution: Fixed Explicitly add commons-codec 1.5 as a dependency

[jira] [Updated] (SPARK-1935) Explicitly add commons-codec 1.5 as a dependency

2014-05-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1935: --- Fix Version/s: 1.0.1 1.1.0 Explicitly add commons-codec 1.5 as a

[jira] [Created] (SPARK-1964) Timestamp missing from HiveMetastore types parser

2014-05-29 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-1964: --- Summary: Timestamp missing from HiveMetastore types parser Key: SPARK-1964 URL: https://issues.apache.org/jira/browse/SPARK-1964 Project: Spark Issue

[jira] [Assigned] (SPARK-1964) Timestamp missing from HiveMetastore types parser

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-1964: --- Assignee: Michael Armbrust Timestamp missing from HiveMetastore types parser

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012520#comment-14012520 ] Matei Zaharia commented on SPARK-1518: -- Sorry, I'm still not sure I understand what

[jira] [Commented] (SPARK-1964) Timestamp missing from HiveMetastore types parser

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012543#comment-14012543 ] Michael Armbrust commented on SPARK-1964: -

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012552#comment-14012552 ] Sean Owen commented on SPARK-1518: -- Heh, I think the essence is: at least one more

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-29 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012650#comment-14012650 ] Ryan Compton commented on SPARK-1952: - Thanks so much! Here's the fix I had to make

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012651#comment-14012651 ] Matei Zaharia commented on SPARK-1518: -- Okay, got it. But this only applies to you

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012655#comment-14012655 ] Matei Zaharia commented on SPARK-1518: -- BTW one other thing is that in 1.0, you can

[jira] [Created] (SPARK-1965) Spark UI throws NPE on trying to load the app page for non-existent app

2014-05-29 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-1965: - Summary: Spark UI throws NPE on trying to load the app page for non-existent app Key: SPARK-1965 URL: https://issues.apache.org/jira/browse/SPARK-1965 Project:

[jira] [Created] (SPARK-1966) Cannot cancel tasks running locally

2014-05-29 Thread Aaron Davidson (JIRA)
Aaron Davidson created SPARK-1966: - Summary: Cannot cancel tasks running locally Key: SPARK-1966 URL: https://issues.apache.org/jira/browse/SPARK-1966 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-1967) Using parallelize method to create RDD, wordcount app just hanging there without errors or warnings

2014-05-29 Thread Min Li (JIRA)
Min Li created SPARK-1967: - Summary: Using parallelize method to create RDD, wordcount app just hanging there without errors or warnings Key: SPARK-1967 URL: https://issues.apache.org/jira/browse/SPARK-1967

[jira] [Commented] (SPARK-1697) Driver error org.apache.spark.scheduler.TaskSetManager - Loss was due to java.io.FileNotFoundException

2014-05-29 Thread Arup Malakar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14012913#comment-14012913 ] Arup Malakar commented on SPARK-1697: - [~mridulm80] We saw this issue again. Are you

[jira] [Updated] (SPARK-1939) Refactor takeSample method in RDD to use ScaSRS

2014-05-29 Thread Doris Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doris Xin updated SPARK-1939: - Summary: Refactor takeSample method in RDD to use ScaSRS (was: Improve takeSample method in RDD)

[jira] [Updated] (SPARK-1958) Calling .collect() on a SchemaRDD should call executeCollect() on the underlying query plan.

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1958: Assignee: Cheng Lian Calling .collect() on a SchemaRDD should call executeCollect() on

[jira] [Updated] (SPARK-1852) SparkSQL Queries with Sorts run before the user asks them to

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1852: Assignee: Cheng Lian SparkSQL Queries with Sorts run before the user asks them to

[jira] [Updated] (SPARK-1947) Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1947: Assignee: Takuya Ueshin Child of SumDistinct or Average should be widened to prevent

[jira] [Updated] (SPARK-1968) SQL commands for caching tables

2014-05-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1968: Component/s: SQL SQL commands for caching tables ---

[jira] [Commented] (SPARK-1911) Warn users that jars should be built with Java 6 for PySpark to work on YARN

2014-05-29 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14013142#comment-14013142 ] Tathagata Das commented on SPARK-1911: -- As far as I think, it is because Java 7 uses

[jira] [Comment Edited] (SPARK-1911) Warn users that jars should be built with Java 6 for PySpark to work on YARN

2014-05-29 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14013142#comment-14013142 ] Tathagata Das edited comment on SPARK-1911 at 5/30/14 12:31 AM: