[jira] [Resolved] (SPARK-5052) com.google.common.base.Optional binary has a wrong method signatures

2015-01-26 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5052. Resolution: Fixed Fix Version/s: 1.3.0 > com.google.common.base.Optional binary has a

[jira] [Updated] (SPARK-5052) com.google.common.base.Optional binary has a wrong method signatures

2015-01-26 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5052: --- Assignee: Elmer Garduno > com.google.common.base.Optional binary has a wrong method signatures

[jira] [Commented] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-01-26 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292823#comment-14292823 ] Guoqiang Li commented on SPARK-5261: [~lewuathe] {code} normalize_text() { awk '{pri

[jira] [Commented] (SPARK-5341) Support maven coordinates in spark-shell and spark-submit

2015-01-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292826#comment-14292826 ] Apache Spark commented on SPARK-5341: - User 'brkyvz' has created a pull request for th

[jira] [Resolved] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5119. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3975 [https://githu

[jira] [Created] (SPARK-5418) Output directory for shuffle should consider left space of each directory set in conf

2015-01-26 Thread ding (JIRA)
ding created SPARK-5418: --- Summary: Output directory for shuffle should consider left space of each directory set in conf Key: SPARK-5418 URL: https://issues.apache.org/jira/browse/SPARK-5418 Project: Spark

[jira] [Updated] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5119: - Target Version/s: 1.3.0 > java.lang.ArrayIndexOutOfBoundsException on trying to train decision tre

[jira] [Updated] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5119: - Assignee: Kai Sasaki > java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree

[jira] [Commented] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292853#comment-14292853 ] Joseph Tang commented on SPARK-4846: Sorry about the procrastination. I'm still workin

[jira] [Commented] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292855#comment-14292855 ] Joseph Tang commented on SPARK-4846: Sorry about the procrastination. I'm still workin

[jira] [Created] (SPARK-5419) Fix the logic in Vectors.sqdist

2015-01-26 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5419: Summary: Fix the logic in Vectors.sqdist Key: SPARK-5419 URL: https://issues.apache.org/jira/browse/SPARK-5419 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway

2015-01-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292866#comment-14292866 ] Apache Spark commented on SPARK-5388: - User 'andrewor14' has created a pull request fo

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway

2015-01-26 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292874#comment-14292874 ] Andrew Or commented on SPARK-5388: -- Hi Dale, thank you for your comments. Yes, in the des

[jira] [Commented] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292886#comment-14292886 ] Joseph Tang commented on SPARK-4846: Hi Xiangrui, here is a problem. PR #3693 that ad

[jira] [Updated] (SPARK-5420) Cross-langauge load/store functions for creating and saving DataFrames

2015-01-26 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5420: --- Summary: Cross-langauge load/store functions for creating and saving DataFrames (was: Create

[jira] [Created] (SPARK-5420) Create cross-langauge load/store functions for creating and saving DataFrames

2015-01-26 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5420: -- Summary: Create cross-langauge load/store functions for creating and saving DataFrames Key: SPARK-5420 URL: https://issues.apache.org/jira/browse/SPARK-5420 Proje

[jira] [Issue Comment Deleted] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Tang updated SPARK-4846: --- Comment: was deleted (was: Sorry about the procrastination. I'm still working on this. Regarding your

[jira] [Comment Edited] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292853#comment-14292853 ] Joseph Tang edited comment on SPARK-4846 at 1/27/15 2:44 AM: -

[jira] [Created] (SPARK-5421) SparkSql throw OOM at shuffle

2015-01-26 Thread Hong Shen (JIRA)
Hong Shen created SPARK-5421: Summary: SparkSql throw OOM at shuffle Key: SPARK-5421 URL: https://issues.apache.org/jira/browse/SPARK-5421 Project: Spark Issue Type: Bug Components: SQL

[jira] [Comment Edited] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292853#comment-14292853 ] Joseph Tang edited comment on SPARK-4846 at 1/27/15 2:46 AM: -

[jira] [Updated] (SPARK-5421) SparkSql throw OOM at shuffle

2015-01-26 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-5421: - Description: ExternalAppendOnlyMap if only for the spark job that aggregator isDefined, but sparkSQL's s

[jira] [Updated] (SPARK-5421) SparkSql throw OOM at shuffle

2015-01-26 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-5421: - Description: ExternalAppendOnlyMap if only for the spark job that aggregator isDefined, but sparkSQL's s

[jira] [Commented] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-26 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292896#comment-14292896 ] Saisai Shao commented on SPARK-5206: IMHO I think this is a general problem in Spark S

[jira] [Commented] (SPARK-5419) Fix the logic in Vectors.sqdist

2015-01-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292898#comment-14292898 ] Apache Spark commented on SPARK-5419: - User 'viirya' has created a pull request for th

[jira] [Commented] (SPARK-5395) Large number of Python workers causing resource depletion

2015-01-26 Thread Sven Krasser (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292913#comment-14292913 ] Sven Krasser commented on SPARK-5395: - Some additional findings from my side: I've man

[jira] [Created] (SPARK-5422) Support sending to Graphite via UDP

2015-01-26 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5422: Summary: Support sending to Graphite via UDP Key: SPARK-5422 URL: https://issues.apache.org/jira/browse/SPARK-5422 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5422) Support sending to Graphite via UDP

2015-01-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292925#comment-14292925 ] Apache Spark commented on SPARK-5422: - User 'ryan-williams' has created a pull request

[jira] [Commented] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292926#comment-14292926 ] Joseph Tang commented on SPARK-4846: I've added some code at https://github.com/jinnt

[jira] [Updated] (SPARK-4979) Add streaming logistic regression

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4979: - Assignee: Jeremy Freeman > Add streaming logistic regression > - >

[jira] [Comment Edited] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield "OutOfMemoryError: Requested array size exceeds VM limit"

2015-01-26 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292926#comment-14292926 ] Joseph Tang edited comment on SPARK-4846 at 1/27/15 3:42 AM: -

[jira] [Resolved] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3726. -- Resolution: Fixed Fix Version/s: (was: 1.2.0) 1.3.0 Issue resolved

[jira] [Updated] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3726: - Target Version/s: 1.3.0 > RandomForest: Support for bootstrap options > --

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-26 Thread Luca Morandini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14292998#comment-14292998 ] Luca Morandini commented on SPARK-1405: --- Indeed, I have a couple students whose assi

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293001#comment-14293001 ] Joseph K. Bradley commented on SPARK-1405: -- It has not yet been merged into Spark

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-01-26 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293019#comment-14293019 ] Aniket Bhatnagar commented on SPARK-2243: - I am also interested in having this fix

[jira] [Commented] (SPARK-5267) Add a streaming module to ingest Apache Camel Messages from a configured endpoints

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293043#comment-14293043 ] Tathagata Das commented on SPARK-5267: -- Hey this is a great initiative! However, we a

[jira] [Commented] (SPARK-4964) Exactly-once semantics for Kafka

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293129#comment-14293129 ] Tathagata Das commented on SPARK-4964: -- I am renaming this JIRA to "Native Kafka Supp

[jira] [Updated] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-4964: - Summary: Exactly-once + WAL-free Kafka Support in Spark Streaming (was: Exactly-once semantics fo

[jira] [Comment Edited] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293129#comment-14293129 ] Tathagata Das edited comment on SPARK-4964 at 1/27/15 7:47 AM: -

[jira] [Updated] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-4964: - Description: for background, see http://apache-spark-developers-list.1001551.n3.nabble.com/Which-

[jira] [Updated] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-4964: - Description: There are two issues with the current Kafka support - Use of Write Ahead Logs in Sp

[jira] [Comment Edited] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293129#comment-14293129 ] Tathagata Das edited comment on SPARK-4964 at 1/27/15 7:53 AM: -

[jira] [Commented] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-01-26 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14293140#comment-14293140 ] Tathagata Das commented on SPARK-4964: -- [~dibbhatt][~jerryshao][~hshreedharan][~c...@

<    1   2