[jira] [Commented] (SPARK-874) Have a --wait flag in ./sbin/stop-all.sh that polls until Worker's are finished

2014-05-28 Thread Archit Thakur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012127#comment-14012127 ] Archit Thakur commented on SPARK-874: - I am intersted in taking it up. > Have a --wait

[jira] [Comment Edited] (SPARK-874) Have a --wait flag in ./sbin/stop-all.sh that polls until Worker's are finished

2014-05-28 Thread Archit Thakur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012127#comment-14012127 ] Archit Thakur edited comment on SPARK-874 at 5/29/14 6:45 AM: --

[jira] [Commented] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012118#comment-14012118 ] Cheng Lian commented on SPARK-1959: --- Pull request: https://github.com/apache/spark/pull/

[jira] [Updated] (SPARK-1811) Support resizable output buffer for kryo serializer

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1811: - Assignee: Koert Kuipers > Support resizable output buffer for kryo serializer > -

[jira] [Closed] (SPARK-1784) Add a partitioner which partitions an RDD with each partition having specified # of keys

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia closed SPARK-1784. Resolution: Invalid Fix Version/s: (was: 1.0.0) > Add a partitioner which partitions an

[jira] [Commented] (SPARK-1784) Add a partitioner which partitions an RDD with each partition having specified # of keys

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012108#comment-14012108 ] Matei Zaharia commented on SPARK-1784: -- As discussed on https://github.com/apache/spa

[jira] [Updated] (SPARK-1960) EOFException when file size 0 exists when use sc.sequenceFile[K,V]("path")

2014-05-28 Thread Eunsu Yun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eunsu Yun updated SPARK-1960: - Description: java.io.EOFException throws when use sc.sequenceFile[K,V] if there is a file which size is

[jira] [Created] (SPARK-1960) EOFException when 0 size file exists when use sc.sequenceFile[K,V]("path")

2014-05-28 Thread Eunsu Yun (JIRA)
Eunsu Yun created SPARK-1960: Summary: EOFException when 0 size file exists when use sc.sequenceFile[K,V]("path") Key: SPARK-1960 URL: https://issues.apache.org/jira/browse/SPARK-1960 Project: Spark

[jira] [Commented] (SPARK-1957) Pluggable disk store for BlockManager

2014-05-28 Thread Raymond Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012081#comment-14012081 ] Raymond Liu commented on SPARK-1957: Initial pull request at : https://github.com/apac

[jira] [Resolved] (SPARK-1913) Parquet table column pruning error caused by filter pushdown

2014-05-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-1913. - Resolution: Fixed > Parquet table column pruning error caused by filter pushdown > --

[jira] [Updated] (SPARK-1913) Parquet table column pruning error caused by filter pushdown

2014-05-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1913: Assignee: Cheng Lian > Parquet table column pruning error caused by filter pushdown > -

[jira] [Updated] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1954: --- Issue Type: Improvement (was: Bug) > Make it easier to get Spark on YARN code to compile in

[jira] [Updated] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1954: --- Component/s: Build > Make it easier to get Spark on YARN code to compile in IntelliJ > --

[jira] [Commented] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012058#comment-14012058 ] Michael Armbrust commented on SPARK-1959: - If all the hive tests still pass with t

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012040#comment-14012040 ] Patrick Wendell commented on SPARK-1952: So I think the issue here is simply that

[jira] [Commented] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012031#comment-14012031 ] Cheng Lian commented on SPARK-1959: --- The problematic line should be [this one|https://g

[jira] [Comment Edited] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012031#comment-14012031 ] Cheng Lian edited comment on SPARK-1959 at 5/29/14 3:54 AM: To

[jira] [Commented] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012027#comment-14012027 ] Sandy Ryza commented on SPARK-1954: --- Cool. Your suggestion does appear to work. > Make

[jira] [Updated] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-1959: -- Description: The {{HiveTableScan}} operator unwraps string "NULL" (case insensitive) into null values

[jira] [Updated] (SPARK-1901) Standalone worker update exector's state ahead of executor process exit

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1901: --- Fix Version/s: (was: 1.0.0) 1.0.1 > Standalone worker update exector's

[jira] [Created] (SPARK-1959) String "NULL" is interpreted as null value

2014-05-28 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-1959: - Summary: String "NULL" is interpreted as null value Key: SPARK-1959 URL: https://issues.apache.org/jira/browse/SPARK-1959 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-1958) Calling .collect() on a SchemaRDD should call executeCollect() on the underlying query plan.

2014-05-28 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-1958: --- Summary: Calling .collect() on a SchemaRDD should call executeCollect() on the underlying query plan. Key: SPARK-1958 URL: https://issues.apache.org/jira/browse/SPARK-1958

[jira] [Comment Edited] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011968#comment-14011968 ] Kevin (Sangwoo) Kim edited comment on SPARK-1112 at 5/29/14 2:50 AM: ---

[jira] [Commented] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011993#comment-14011993 ] Kevin (Sangwoo) Kim commented on SPARK-1112: [~matei] I've found the default o

[jira] [Commented] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011978#comment-14011978 ] Matei Zaharia commented on SPARK-1112: -- I'm curious, why did you want to make the fra

[jira] [Updated] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1112: - Priority: Critical (was: Blocker) > When spark.akka.frameSize > 10, task results bigger than 10M

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011971#comment-14011971 ] Ryan Compton commented on SPARK-1952: - No luck. I modified project/SparkBuild.scala {

[jira] [Comment Edited] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011968#comment-14011968 ] Kevin (Sangwoo) Kim edited comment on SPARK-1112 at 5/29/14 2:01 AM: ---

[jira] [Commented] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-05-28 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011968#comment-14011968 ] Kevin (Sangwoo) Kim commented on SPARK-1112: Hi all, I'm very new to Spark a

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011942#comment-14011942 ] Matei Zaharia commented on SPARK-1518: -- Sean, the model for linking to Hadoop has bee

[jira] [Updated] (SPARK-1957) Pluggable disk store for BlockManager

2014-05-28 Thread Raymond Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Liu updated SPARK-1957: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-1733 > Pluggable disk store for BlockM

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011897#comment-14011897 ] Sean Owen commented on SPARK-1518: -- "they write their app against the Spark API's in Mave

[jira] [Created] (SPARK-1957) Pluggable disk store for BlockManager

2014-05-28 Thread Raymond Liu (JIRA)
Raymond Liu created SPARK-1957: -- Summary: Pluggable disk store for BlockManager Key: SPARK-1957 URL: https://issues.apache.org/jira/browse/SPARK-1957 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-1931) Graph.partitionBy does not reconstruct routing tables

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010340#comment-14010340 ] Ankur Dave edited comment on SPARK-1931 at 5/29/14 12:11 AM: -

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011851#comment-14011851 ] Patrick Wendell commented on SPARK-1518: bq. In practice it look like one generic

[jira] [Commented] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011842#comment-14011842 ] Patrick Wendell commented on SPARK-1954: I agree, I was just wondering if we have

[jira] [Commented] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011837#comment-14011837 ] Sandy Ryza commented on SPARK-1954: --- Trying this now. But, assuming it works, I still t

[jira] [Updated] (SPARK-1712) ParallelCollectionRDD operations hanging forever without any error messages

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1712: - Fix Version/s: 0.9.2 > ParallelCollectionRDD operations hanging forever without any error message

[jira] [Commented] (SPARK-1712) ParallelCollectionRDD operations hanging forever without any error messages

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011830#comment-14011830 ] Matei Zaharia commented on SPARK-1712: -- Merged the frame size check into 0.9.2 as wel

[jira] [Resolved] (SPARK-1950) spark on yarn can't start

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1950. Resolution: Duplicate > spark on yarn can't start > -- > >

[jira] [Commented] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011810#comment-14011810 ] Patrick Wendell commented on SPARK-1954: Have you tried running sbt/sbt gen-idea w

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011808#comment-14011808 ] Patrick Wendell commented on SPARK-1952: [~rcompton] - what if you modify the spar

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011806#comment-14011806 ] Patrick Wendell commented on SPARK-1952: Hm, unfortunately I dont' see any obvious

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011789#comment-14011789 ] Ryan Compton commented on SPARK-1952: - Pig depends on slf4j 1.6.1 {code} rfcompton@no

[jira] [Updated] (SPARK-1712) ParallelCollectionRDD operations hanging forever without any error messages

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1712: - Fix Version/s: 1.0.1 > ParallelCollectionRDD operations hanging forever without any error message

[jira] [Updated] (SPARK-1817) RDD zip erroneous when partitions do not divide RDD count

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1817: - Priority: Major (was: Minor) > RDD zip erroneous when partitions do not divide RDD count > -

[jira] [Updated] (SPARK-1817) RDD zip erroneous when partitions do not divide RDD count

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1817: - Priority: Minor (was: Blocker) > RDD zip erroneous when partitions do not divide RDD count > ---

[jira] [Updated] (SPARK-1712) ParallelCollectionRDD operations hanging forever without any error messages

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1712: - Priority: Major (was: Blocker) > ParallelCollectionRDD operations hanging forever without any er

[jira] [Resolved] (SPARK-1712) ParallelCollectionRDD operations hanging forever without any error messages

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1712. -- Resolution: Fixed > ParallelCollectionRDD operations hanging forever without any error messages

[jira] [Updated] (SPARK-1759) sbt/sbt package fail cause by directory

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1759: --- Fix Version/s: (was: 0.9.1) 0.9.2 > sbt/sbt package fail cause by dire

[jira] [Updated] (SPARK-1576) Passing of JAVA_OPTS to YARN on command line

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1576: --- Fix Version/s: (was: 0.9.1) 0.9.2 > Passing of JAVA_OPTS to YARN on co

[jira] [Updated] (SPARK-1849) Broken UTF-8 encoded data gets character replacements and thus can't be "fixed"

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1849: --- Fix Version/s: (was: 0.9.1) 0.9.2 > Broken UTF-8 encoded data gets cha

[jira] [Resolved] (SPARK-1916) SparkFlumeEvent with body bigger than 1020 bytes are not read properly

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1916. Resolution: Fixed Fix Version/s: 0.9.2 1.0.1 Issue resolved by pu

[jira] [Commented] (SPARK-1956) Enable shuffle consolidation by default

2014-05-28 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011745#comment-14011745 ] Sandy Ryza commented on SPARK-1956: --- Are there JIRAs for those fixes? It would be good

[jira] [Commented] (SPARK-1956) Enable shuffle consolidation by default

2014-05-28 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011741#comment-14011741 ] Mridul Muralidharan commented on SPARK-1956: shuffle consolidation MUST NOT be

[jira] [Updated] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1952: --- Fix Version/s: (was: 1.0.0) > slf4j version conflicts with pig >

[jira] [Updated] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1952: --- Target Version/s: 1.0.1 (was: 1.0.0) > slf4j version conflicts with pig > --

[jira] [Created] (SPARK-1956) Enable shuffle consolidation by default

2014-05-28 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-1956: - Summary: Enable shuffle consolidation by default Key: SPARK-1956 URL: https://issues.apache.org/jira/browse/SPARK-1956 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011711#comment-14011711 ] Matei Zaharia commented on SPARK-1952: -- Ryan, do you know what SLF4J version Pig need

[jira] [Resolved] (SPARK-1501) Assertions in Graph.apply test are never executed

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-1501. --- Resolution: Fixed Assignee: William Benton > Assertions in Graph.apply test are never executed

[jira] [Updated] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-1955: -- Description: Many VertexRDD operations (diff, leftJoin, innerJoin) can use a fast zip join if both ope

[jira] [Updated] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-1955: -- Description: Many VertexRDD operations (diff, leftJoin, innerJoin) can use a fast zip join if both ope

[jira] [Created] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2014-05-28 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-1955: - Summary: VertexRDD can incorrectly assume index sharing Key: SPARK-1955 URL: https://issues.apache.org/jira/browse/SPARK-1955 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-1954) Make it easier to get Spark on YARN code to compile in IntelliJ

2014-05-28 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-1954: - Summary: Make it easier to get Spark on YARN code to compile in IntelliJ Key: SPARK-1954 URL: https://issues.apache.org/jira/browse/SPARK-1954 Project: Spark Issu

[jira] [Updated] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Compton updated SPARK-1952: Description: Upgrading from Spark-0.9.1 to Spark-1.0.0 causes all Pig scripts to fail when they "r

[jira] [Updated] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Compton updated SPARK-1952: Description: Upgrading from Spark-0.9.1 to Spark-1.0.0 causes all Pig scripts to fail when they "r

[jira] [Updated] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Compton updated SPARK-1952: Description: Upgrading from Spark-0.9.1 to Spark-1.0.0 causes all Pig scripts to fail when they "r

[jira] [Created] (SPARK-1953) yarn client mode Application Master memory size is same as driver memory size

2014-05-28 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-1953: Summary: yarn client mode Application Master memory size is same as driver memory size Key: SPARK-1953 URL: https://issues.apache.org/jira/browse/SPARK-1953 Project:

[jira] [Updated] (SPARK-1790) Update EC2 scripts to support r3 instance types

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1790: - Labels: Starter (was: starter) > Update EC2 scripts to support r3 instance types > -

[jira] [Commented] (SPARK-1790) Update EC2 scripts to support r3 instance types

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011548#comment-14011548 ] Matei Zaharia commented on SPARK-1790: -- Thanks Sujeet! Just post here when you have a

[jira] [Updated] (SPARK-1790) Update EC2 scripts to support r3 instance types

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1790: - Assignee: Sujeet Varakhedi > Update EC2 scripts to support r3 instance types > --

[jira] [Resolved] (SPARK-1936) Add apache header and remove author tags

2014-05-28 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1936. -- Resolution: Won't Fix We should not change these files' license headers because they're files w

[jira] [Created] (SPARK-1952) slf4j version conflicts with pig

2014-05-28 Thread Ryan Compton (JIRA)
Ryan Compton created SPARK-1952: --- Summary: slf4j version conflicts with pig Key: SPARK-1952 URL: https://issues.apache.org/jira/browse/SPARK-1952 Project: Spark Issue Type: Bug Compon

[jira] [Resolved] (SPARK-1836) REPL $outer type mismatch causes lookup() and equals() problems

2014-05-28 Thread Michael Malak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Malak resolved SPARK-1836. -- Resolution: Duplicate > REPL $outer type mismatch causes lookup() and equals() problems > -

[jira] [Commented] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2014-05-28 Thread Michael Malak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011492#comment-14011492 ] Michael Malak commented on SPARK-1199: -- See also additional test cases in https://is

[jira] [Commented] (SPARK-1836) REPL $outer type mismatch causes lookup() and equals() problems

2014-05-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011477#comment-14011477 ] Michael Armbrust commented on SPARK-1836: - Yeah I think its likely they are relate

[jira] [Updated] (SPARK-1916) SparkFlumeEvent with body bigger than 1020 bytes are not read properly

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1916: --- Assignee: David Lemieux > SparkFlumeEvent with body bigger than 1020 bytes are not read prope

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-28 Thread Colin Patrick McCabe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011350#comment-14011350 ] Colin Patrick McCabe commented on SPARK-1518: - bq. Re: versioning one more tim

[jira] [Commented] (SPARK-1950) spark on yarn can't start

2014-05-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011330#comment-14011330 ] Sean Owen commented on SPARK-1950: -- (Looks like you opened this twice? https://issues.ap

[jira] [Commented] (SPARK-1951) spark on yarn can't start

2014-05-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011327#comment-14011327 ] Sean Owen commented on SPARK-1951: -- Yes, you should prefix HDFS file locations with hdfs:

[jira] [Created] (SPARK-1951) spark on yarn can't start

2014-05-28 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-1951: -- Summary: spark on yarn can't start Key: SPARK-1951 URL: https://issues.apache.org/jira/browse/SPARK-1951 Project: Spark Issue Type: Bug Components: YA

[jira] [Created] (SPARK-1950) spark on yarn can't start

2014-05-28 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-1950: -- Summary: spark on yarn can't start Key: SPARK-1950 URL: https://issues.apache.org/jira/browse/SPARK-1950 Project: Spark Issue Type: Bug Components: YA

[jira] [Updated] (SPARK-1951) spark on yarn can't start

2014-05-28 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-1951: --- Description: {{HADOOP_CONF_DIR=/etc/hadoop/conf ./bin/spark-submit --archives /input/lbs/recommend/

[jira] [Created] (SPARK-1949) Servlet 2.5 vs 3.0 conflict in SBT build

2014-05-28 Thread Sean Owen (JIRA)
Sean Owen created SPARK-1949: Summary: Servlet 2.5 vs 3.0 conflict in SBT build Key: SPARK-1949 URL: https://issues.apache.org/jira/browse/SPARK-1949 Project: Spark Issue Type: Bug Comp

[jira] [Commented] (SPARK-1948) Scalac crashes when building Spark in IntelliJ IDEA

2014-05-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011134#comment-14011134 ] Sean Owen commented on SPARK-1948: -- This is likely a scalac or IntelliJ problem, indeed.

[jira] [Updated] (SPARK-1948) Scalac crashes when building Spark in IntelliJ IDEA

2014-05-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-1948: -- Attachment: scalac-crash.log > Scalac crashes when building Spark in IntelliJ IDEA > --

[jira] [Created] (SPARK-1948) Scalac crashes when building Spark in IntelliJ IDEA

2014-05-28 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-1948: - Summary: Scalac crashes when building Spark in IntelliJ IDEA Key: SPARK-1948 URL: https://issues.apache.org/jira/browse/SPARK-1948 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-1825) Windows Spark fails to work with Linux YARN

2014-05-28 Thread Taeyun Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taeyun Kim updated SPARK-1825: -- Description: Windows Spark fails to work with Linux YARN. This is a cross-platform problem. This error

[jira] [Commented] (SPARK-1947) Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.

2014-05-28 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010945#comment-14010945 ] Takuya Ueshin commented on SPARK-1947: -- PRed: https://github.com/apache/spark/pull/90

[jira] [Created] (SPARK-1947) Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.

2014-05-28 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-1947: Summary: Child of SumDistinct or Average should be widened to prevent overflows the same as Sum. Key: SPARK-1947 URL: https://issues.apache.org/jira/browse/SPARK-1947

[jira] [Commented] (SPARK-1518) Spark master doesn't compile against hadoop-common trunk

2014-05-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010937#comment-14010937 ] Sean Owen commented on SPARK-1518: -- Re: versioning one more time, really supporting a bun

[jira] [Commented] (SPARK-1495) support leftsemijoin for sparkSQL

2014-05-28 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010932#comment-14010932 ] Adrian Wang commented on SPARK-1495: Another PR [https://github.com/apache/spark/pull/

[jira] [Updated] (SPARK-1946) Submit stage after executors have been registered

2014-05-28 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhihui updated SPARK-1946: -- Description: Because creating TaskSetManager and registering executors are asynchronous, in most situation, ea

[jira] [Commented] (SPARK-1946) Submit stage after executors have been registered

2014-05-28 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010877#comment-14010877 ] Zhihui commented on SPARK-1946: --- I submit a PR for proposal-1 https://github.com/apache/spar

[jira] [Updated] (SPARK-1946) Submit stage after executors have been registered

2014-05-28 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhihui updated SPARK-1946: -- Attachment: Spark Task Scheduler Optimization Proposal.pptx > Submit stage after executors have been registered

[jira] [Commented] (SPARK-1552) GraphX performs type comparison incorrectly

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010869#comment-14010869 ] Ankur Dave commented on SPARK-1552: --- Alternatively, we could introduce type-preserving v

[jira] [Assigned] (SPARK-1552) GraphX performs type comparison incorrectly

2014-05-28 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave reassigned SPARK-1552: - Assignee: Ankur Dave > GraphX performs type comparison incorrectly >

[jira] [Created] (SPARK-1946) Submit stage after executors have been registered

2014-05-28 Thread Zhihui (JIRA)
Zhihui created SPARK-1946: - Summary: Submit stage after executors have been registered Key: SPARK-1946 URL: https://issues.apache.org/jira/browse/SPARK-1946 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-1944) Document --verbose in spark-shell -h

2014-05-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1944: --- Assignee: Andrew Ash > Document --verbose in spark-shell -h > ---