[jira] [Created] (SPARK-2640) In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

2014-07-22 Thread woshilaiceshide (JIRA)
woshilaiceshide created SPARK-2640: -- Summary: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. Key: SPARK-2640 URL: https://issues.apache.o

[jira] [Commented] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-07-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071407#comment-14071407 ] Davies Liu commented on SPARK-2630: --- [~tsudukim], it make sense, I had change the title.

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-07-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-2630: -- Summary: Input data size of CoalescedRDD is incorrect (was: Input data size goes overflow when size is

[jira] [Commented] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-22 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071402#comment-14071402 ] Chengxiang Li commented on SPARK-2633: -- For Hive job status monitor, spark listener m

[jira] [Commented] (SPARK-2630) Input data size goes overflow when size is large then 4G in one task

2014-07-22 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071396#comment-14071396 ] Masayoshi TSUZUKI commented on SPARK-2630: -- I looked the source code. I think it

[jira] [Commented] (SPARK-2575) SVMWithSGD throwing Input Validation failed

2014-07-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071384#comment-14071384 ] Xiangrui Meng commented on SPARK-2575: -- Do you have labels with values other than 0.0

[jira] [Commented] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071365#comment-14071365 ] Yin Huai commented on SPARK-2632: - I have created a a [REPL test|https://github.com/yhuai

[jira] [Updated] (SPARK-2639) Under execute tab in web UI, # Completed task is more than # Total tasks

2014-07-22 Thread npanj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] npanj updated SPARK-2639: - Description: Under execute tab in web UI, # Completed task is more than #Total task => active task is -ive? Is i

[jira] [Updated] (SPARK-2639) Under execute tab in web UI, # Completed task is less than # Total tasks

2014-07-22 Thread npanj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] npanj updated SPARK-2639: - Attachment: Screen Shot 2014-07-22 at 9.30.05 PM.png > Under execute tab in web UI, # Completed task is less than

[jira] [Commented] (SPARK-2638) Improve concurrency of fetching Map outputs

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071351#comment-14071351 ] Apache Spark commented on SPARK-2638: - User 'javadba' has created a pull request for t

[jira] [Created] (SPARK-2639) Under execute tab in web UI, # Completed task is less than # Total tasks

2014-07-22 Thread npanj (JIRA)
npanj created SPARK-2639: Summary: Under execute tab in web UI, # Completed task is less than # Total tasks Key: SPARK-2639 URL: https://issues.apache.org/jira/browse/SPARK-2639 Project: Spark Issue

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071350#comment-14071350 ] Yin Huai commented on SPARK-2576: - OK. I have created a [REPL test|https://github.com/yhu

[jira] [Updated] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2632: Affects Version/s: 1.0.1 1.0.0 > Importing a method of class in Spark REPL causes th

[jira] [Created] (SPARK-2638) Improve concurrency of fetching Map outputs

2014-07-22 Thread Stephen Boesch (JIRA)
Stephen Boesch created SPARK-2638: - Summary: Improve concurrency of fetching Map outputs Key: SPARK-2638 URL: https://issues.apache.org/jira/browse/SPARK-2638 Project: Spark Issue Type: Impro

[jira] [Commented] (SPARK-2634) MapOutputTrackerWorker.mapStatuses should be thread-safe

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071317#comment-14071317 ] Apache Spark commented on SPARK-2634: - User 'zsxwing' has created a pull request for t

[jira] [Created] (SPARK-2637) PEP8 Compliance pull request #1540

2014-07-22 Thread Vincent Ohprecio (JIRA)
Vincent Ohprecio created SPARK-2637: --- Summary: PEP8 Compliance pull request #1540 Key: SPARK-2637 URL: https://issues.apache.org/jira/browse/SPARK-2637 Project: Spark Issue Type: Documentat

[jira] [Commented] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071312#comment-14071312 ] Reynold Xin commented on SPARK-2633: Yes we should do this. Maybe we should take this

[jira] [Commented] (SPARK-2636) no where to get job identifier while submit spark job through spark API

2014-07-22 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071311#comment-14071311 ] Chengxiang Li commented on SPARK-2636: -- cc [~rxin] [~xuefuz] > no where to get job i

[jira] [Updated] (SPARK-2636) no where to get job identifier while submit spark job through spark API

2014-07-22 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chengxiang Li updated SPARK-2636: - Description: In Hive on Spark, we want to track spark job status through Spark API, the basic id

[jira] [Updated] (SPARK-2636) no where to get job identifier while submit spark job through spark API

2014-07-22 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chengxiang Li updated SPARK-2636: - Description: In Hive on Spark, we want to track spark job status through Spark API, the basic id

[jira] [Created] (SPARK-2636) no where to get job identifier while submit spark job through spark API

2014-07-22 Thread Chengxiang Li (JIRA)
Chengxiang Li created SPARK-2636: Summary: no where to get job identifier while submit spark job through spark API Key: SPARK-2636 URL: https://issues.apache.org/jira/browse/SPARK-2636 Project: Spark

[jira] [Commented] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-22 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071310#comment-14071310 ] Chengxiang Li commented on SPARK-2633: -- cc [~rxin] [~xuefuz] > support register spar

[jira] [Commented] (SPARK-2635) Fix race condition at SchedulerBackend.isReady in standalone mode

2014-07-22 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071302#comment-14071302 ] Zhihui commented on SPARK-2635: --- PR https://github.com/apache/spark/pull/1525 > Fix race co

[jira] [Created] (SPARK-2635) Fix race condition at SchedulerBackend.isReady in standalone mode

2014-07-22 Thread Zhihui (JIRA)
Zhihui created SPARK-2635: - Summary: Fix race condition at SchedulerBackend.isReady in standalone mode Key: SPARK-2635 URL: https://issues.apache.org/jira/browse/SPARK-2635 Project: Spark Issue Type

[jira] [Commented] (SPARK-2260) Spark submit standalone-cluster mode is broken

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071296#comment-14071296 ] Apache Spark commented on SPARK-2260: - User 'andrewor14' has created a pull request fo

[jira] [Created] (SPARK-2634) MapOutputTrackerWorker.mapStatuses should be thread-safe

2014-07-22 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-2634: --- Summary: MapOutputTrackerWorker.mapStatuses should be thread-safe Key: SPARK-2634 URL: https://issues.apache.org/jira/browse/SPARK-2634 Project: Spark Issue Ty

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071284#comment-14071284 ] Cheng Lian commented on SPARK-975: -- Red lines indicate wide dependencies (shuffles are int

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071280#comment-14071280 ] Phuoc Do commented on SPARK-975: In the old diagram, there are red links 5 - 3, 9 - 4. What

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071275#comment-14071275 ] Cheng Lian commented on SPARK-975: -- No it doesn't. I chose rectangles just because more te

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071274#comment-14071274 ] Phuoc Do commented on SPARK-975: Does the shape have any significance. I saw it was rectang

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071273#comment-14071273 ] Cheng Lian commented on SPARK-975: -- [~phuocd] Yea, exactly :) > Spark Replay Debugger > -

[jira] [Updated] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phuoc Do updated SPARK-975: --- Attachment: (was: IMG_20140722_184149.jpg) > Spark Replay Debugger > - > >

[jira] [Updated] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phuoc Do updated SPARK-975: --- Attachment: IMG_20140722_184149.jpg > Spark Replay Debugger > - > > Key: S

[jira] [Resolved] (SPARK-2577) File upload to viewfs is broken due to mount point resolution

2014-07-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-2577. -- Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 > File upload t

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071263#comment-14071263 ] Phuoc Do commented on SPARK-975: Cheng Lian, maybe something like this: !IMG_20140722_1841

[jira] [Comment Edited] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071263#comment-14071263 ] Phuoc Do edited comment on SPARK-975 at 7/23/14 2:05 AM: - Cheng Lia

[jira] [Created] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-22 Thread Chengxiang Li (JIRA)
Chengxiang Li created SPARK-2633: Summary: support register spark listener to listener bus with Java API Key: SPARK-2633 URL: https://issues.apache.org/jira/browse/SPARK-2633 Project: Spark

[jira] [Updated] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phuoc Do updated SPARK-975: --- Attachment: IMG_20140722_184149.jpg > Spark Replay Debugger > - > > Key: S

[jira] [Commented] (SPARK-2037) yarn client mode doesn't support spark.yarn.max.executor.failures

2014-07-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071261#comment-14071261 ] Thomas Graves commented on SPARK-2037: -- https://github.com/apache/spark/pull/1180 >

[jira] [Commented] (SPARK-2630) Input data size goes overflow when size is large then 4G in one task

2014-07-22 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071246#comment-14071246 ] Masayoshi TSUZUKI commented on SPARK-2630: -- The same problem occurs even when I u

[jira] [Resolved] (SPARK-2606) In some cases, pages display incorrect in spark UI

2014-07-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-2606. -- Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 > In some cases

[jira] [Comment Edited] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071240#comment-14071240 ] sunsc edited comment on SPARK-2379 at 7/23/14 1:27 AM: --- --- a/spark

[jira] [Comment Edited] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071240#comment-14071240 ] sunsc edited comment on SPARK-2379 at 7/23/14 1:26 AM: --- --- a/spark

[jira] [Comment Edited] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071240#comment-14071240 ] sunsc edited comment on SPARK-2379 at 7/23/14 1:26 AM: --- --- a/spark

[jira] [Commented] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071238#comment-14071238 ] sunsc commented on SPARK-2379: -- Easily to reproduce this bug. java.lang.StackOverflowError

[jira] [Comment Edited] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071240#comment-14071240 ] sunsc edited comment on SPARK-2379 at 7/23/14 1:26 AM: --- --- a/spark

[jira] [Commented] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread sunsc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071240#comment-14071240 ] sunsc commented on SPARK-2379: -- --- a/spark/streaming/src/main/scala/org/apache/spark/stream

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071232#comment-14071232 ] Cheng Lian commented on SPARK-975: -- Hey [~phuocd], that image actually shows exactly the s

[jira] [Resolved] (SPARK-2615) Add "==" support for HiveQl

2014-07-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2615. - Resolution: Fixed Fix Version/s: 1.0.2 1.1.0 > Add "==" support

[jira] [Commented] (SPARK-975) Spark Replay Debugger

2014-07-22 Thread Phuoc Do (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071225#comment-14071225 ] Phuoc Do commented on SPARK-975: Cheng Lian, some JS libraries that can draw flow diagrams:

[jira] [Commented] (SPARK-2614) Add the spark-examples-xxx-.jar to the Debian package created by assembly/pom.xml (e.g. -Pdeb)

2014-07-22 Thread Christian Tzolov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071220#comment-14071220 ] Christian Tzolov commented on SPARK-2614: - Fair point [~markhamstra]. I agree abou

[jira] [Commented] (SPARK-2615) Add "==" support for HiveQl

2014-07-22 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071214#comment-14071214 ] Cheng Hao commented on SPARK-2615: -- Yes, that's true. But "==" is actually used in lots o

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071197#comment-14071197 ] Patrick Wendell commented on SPARK-2282: Ah my b. I was confused. > PySpark crash

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-22 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071192#comment-14071192 ] Aaron Davidson commented on SPARK-2282: --- [~pwendell] That would in general be the ri

[jira] [Updated] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2632: Description: Master is affected by this bug. To reproduce the exception, you can start a local cluster (s

[jira] [Commented] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071177#comment-14071177 ] Yin Huai commented on SPARK-2632: - [~prashant_] Can you take a look at this issue? Thanks:

[jira] [Created] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-22 Thread Yin Huai (JIRA)
Yin Huai created SPARK-2632: --- Summary: Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff. Key: SPARK-2632 URL: https://issues.apache.org/jira/browse/SPARK-2632 Project:

[jira] [Updated] (SPARK-2010) Support for nested data in PySpark SQL

2014-07-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2010: Priority: Blocker (was: Critical) > Support for nested data in PySpark SQL > -

[jira] [Updated] (SPARK-1547) Add gradient boosting algorithm to MLlib

2014-07-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1547: - Target Version/s: (was: 1.1.0) > Add gradient boosting algorithm to MLlib > ---

[jira] [Updated] (SPARK-1545) Add Random Forest algorithm to MLlib

2014-07-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1545: - Target Version/s: (was: 1.1.0) > Add Random Forest algorithm to MLlib > ---

[jira] [Resolved] (SPARK-2613) CLONE - word2vec: Distributed Representation of Words

2014-07-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2613. -- Resolution: Duplicate > CLONE - word2vec: Distributed Representation of Words > ---

[jira] [Updated] (SPARK-2426) Quadratic Minimization for MLlib ALS

2014-07-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2426: - Target Version/s: (was: 1.1.0) > Quadratic Minimization for MLlib ALS > ---

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071039#comment-14071039 ] Patrick Wendell commented on SPARK-2282: [~carlilek] I'd actually recommend just p

[jira] [Updated] (SPARK-2630) Input data size goes overflow when size is large then 4G in one task

2014-07-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-2630: -- Attachment: overflow.tiff The input size is showed as 5.8MB, but the real input size is 4.3G. > Input

[jira] [Created] (SPARK-2631) In-memory Compression is not configured with SQLConf

2014-07-22 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2631: --- Summary: In-memory Compression is not configured with SQLConf Key: SPARK-2631 URL: https://issues.apache.org/jira/browse/SPARK-2631 Project: Spark Issu

[jira] [Created] (SPARK-2630) Input data size goes overflow when size is large then 4G in one task

2014-07-22 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2630: - Summary: Input data size goes overflow when size is large then 4G in one task Key: SPARK-2630 URL: https://issues.apache.org/jira/browse/SPARK-2630 Project: Spark

[jira] [Updated] (SPARK-1853) Show Streaming application code context (file, line number) in Spark Stages UI

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1853: - Assignee: Mubarak Seyed (was: Tathagata Das) > Show Streaming application code context (file, li

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070966#comment-14070966 ] Marcelo Vanzin commented on SPARK-2420: --- I'm all for sanitizing dependencies, but ju

[jira] [Commented] (SPARK-1642) Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083

2014-07-22 Thread Ted Malaska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070938#comment-14070938 ] Ted Malaska commented on SPARK-1642: Are there any changes needed here? > Upgrade Flu

[jira] [Commented] (SPARK-2629) Improve performance of DStream.updateStateByKey using IndexRDD

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070939#comment-14070939 ] Tathagata Das commented on SPARK-2629: -- Index RDD is necessary for this improvement t

[jira] [Created] (SPARK-2629) Improve performance of DStream.updateStateByKey using IndexRDD

2014-07-22 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-2629: Summary: Improve performance of DStream.updateStateByKey using IndexRDD Key: SPARK-2629 URL: https://issues.apache.org/jira/browse/SPARK-2629 Project: Spark

[jira] [Commented] (SPARK-2447) Add common solution for sending upsert actions to HBase (put, deletes, and increment)

2014-07-22 Thread Ted Malaska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070937#comment-14070937 ] Ted Malaska commented on SPARK-2447: Added JavaDoc and rename method changes This is

[jira] [Updated] (SPARK-2548) JavaRecoverableWordCount is missing

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2548: - Target Version/s: 1.1.0, 1.0.2, 0.9.3 (was: 1.1.0, 0.9.3) > JavaRecoverableWordCount is missing

[jira] [Updated] (SPARK-1730) Make receiver store data reliably to avoid data-loss on executor failures

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1730: - Target Version/s: 1.1.0 Fix Version/s: (was: 1.1.0) > Make receiver store data reliabl

[jira] [Updated] (SPARK-2438) Streaming + MLLib

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2438: - Issue Type: New Feature (was: Improvement) > Streaming + MLLib > - > >

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Issue Type: New Feature (was: Improvement) > Improve Spark Streaming compatibility with Flume >

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Issue Type: Improvement (was: New Feature) > Improve Spark Streaming compatibility with Flume >

[jira] [Updated] (SPARK-2438) Streaming + MLLib

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2438: - Component/s: Streaming > Streaming + MLLib > - > > Key: SPARK-243

[jira] [Updated] (SPARK-2438) Streaming + MLLib

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2438: - Target Version/s: 1.1.0 > Streaming + MLLib > - > > Key: SPARK-24

[jira] [Updated] (SPARK-1730) Make receiver store data reliably to avoid data-loss on executor failures

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1730: - Assignee: Hari Shreedharan > Make receiver store data reliably to avoid data-loss on executor fai

[jira] [Commented] (SPARK-2599) almostEquals mllib.util.TestingUtils does not behave as expected when comparing against 0.0

2014-07-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070929#comment-14070929 ] DB Tsai commented on SPARK-2599: I'm the original guy implementing `almostEquals` for my u

[jira] [Updated] (SPARK-2447) Add common solution for sending upsert actions to HBase (put, deletes, and increment)

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2447: - Component/s: Streaming Spark Core > Add common solution for sending upsert actio

[jira] [Updated] (SPARK-1729) Make Flume pull data from source, rather than the current push model

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1729: - Assignee: Hari Shreedharan (was: Tathagata Das) > Make Flume pull data from source, rather than

[jira] [Assigned] (SPARK-2377) Create a Python API for Spark Streaming

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-2377: Assignee: Tathagata Das > Create a Python API for Spark Streaming > ---

[jira] [Updated] (SPARK-2377) Create a Python API for Spark Streaming

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2377: - Target Version/s: 1.1.0 > Create a Python API for Spark Streaming > -

[jira] [Updated] (SPARK-2377) Create a Python API for Spark Streaming

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2377: - Fix Version/s: (was: 1.1.0) > Create a Python API for Spark Streaming > -

[jira] [Updated] (SPARK-2447) Add common solution for sending upsert actions to HBase (put, deletes, and increment)

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2447: - Target Version/s: 1.1.0 > Add common solution for sending upsert actions to HBase (put, deletes,

[jira] [Updated] (SPARK-944) Give example of writing to HBase from Spark Streaming

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-944: Target Version/s: 1.1.0 > Give example of writing to HBase from Spark Streaming > --

[jira] [Commented] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070915#comment-14070915 ] Tathagata Das commented on SPARK-2379: -- Any information on this? If we have no way to

[jira] [Updated] (SPARK-2464) Twitter Receiver does not stop correctly when streamingContext.stop is called

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2464: - Fix Version/s: (was: 1.0.2) (was: 1.1.0) > Twitter Receiver does not s

[jira] [Updated] (SPARK-2345) ForEachDStream should have an option of running the foreachfunc on Spark

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2345: - Issue Type: Wish (was: Bug) > ForEachDStream should have an option of running the foreachfunc on

[jira] [Updated] (SPARK-1854) Add a version of StreamingContext.fileStream that take hadoop conf object

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1854: - Fix Version/s: (was: 1.1.0) > Add a version of StreamingContext.fileStream that take hadoop c

[jira] [Updated] (SPARK-2464) Twitter Receiver does not stop correctly when streamingContext.stop is called

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2464: - Target Version/s: 1.1.0, 1.0.2 > Twitter Receiver does not stop correctly when streamingContext.s

[jira] [Updated] (SPARK-1853) Show Streaming application code context (file, line number) in Spark Stages UI

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1853: - Target Version/s: 1.1.0 > Show Streaming application code context (file, line number) in Spark St

[jira] [Commented] (SPARK-2614) Add the spark-examples-xxx-.jar to the Debian package created by assembly/pom.xml (e.g. -Pdeb)

2014-07-22 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070908#comment-14070908 ] Mark Hamstra commented on SPARK-2614: - It's also common for installers/admins to not w

[jira] [Updated] (SPARK-1642) Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1642: - Target Version/s: 1.1.0 > Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083 > -

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Target Version/s: 1.1.0 > Improve Spark Streaming compatibility with Flume >

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Issue Type: Improvement (was: Bug) > Improve Spark Streaming compatibility with Flume >

[jira] [Updated] (SPARK-1642) Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083

2014-07-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1642: - Fix Version/s: (was: 1.1.0) > Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083

[jira] [Updated] (SPARK-2628) Mesos backend throwing unable to find LoginModule

2014-07-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2628: --- Assignee: Tim Chen > Mesos backend throwing unable to find LoginModule > ---

  1   2   >