[jira] [Updated] (SPARK-4359) Empty classifier in avro-mapred is misinterpreted in SBT

2014-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4359: - Description: In the parent pom, avro.mapred.classifier is set to hadoop2 for Yarn but not otherwise set.

[jira] [Commented] (SPARK-4341) Spark need to set num-executors automatically

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207793#comment-14207793 ] Sean Owen commented on SPARK-4341: -- The problem is that the number of executors is then

[jira] [Commented] (SPARK-4359) Empty classifier in avro-mapred is misinterpreted in SBT

2014-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207794#comment-14207794 ] Andrew Or commented on SPARK-4359: -- Ok, I reverted commit

[jira] [Commented] (SPARK-2426) Quadratic Minimization for MLlib ALS

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207801#comment-14207801 ] Apache Spark commented on SPARK-2426: - User 'debasish83' has created a pull request

[jira] [Resolved] (SPARK-4353) Delete the val that never used in Catalog

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4353. -- Resolution: Not a Problem Delete the val that never used in Catalog

[jira] [Commented] (SPARK-4341) Spark need to set num-executors automatically

2014-11-12 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207816#comment-14207816 ] Hong Shen commented on SPARK-4341: -- After the first action computed, we can set set

[jira] [Comment Edited] (SPARK-4341) Spark need to set num-executors automatically

2014-11-12 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207816#comment-14207816 ] Hong Shen edited comment on SPARK-4341 at 11/12/14 8:40 AM:

[jira] [Commented] (SPARK-3206) Error in PageRank values

2014-11-12 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207822#comment-14207822 ] Ankur Dave commented on SPARK-3206: --- I just tested this with the standalone version of

[jira] [Resolved] (SPARK-3206) Error in PageRank values

2014-11-12 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-3206. --- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Ankur Dave Error in PageRank values

[jira] [Created] (SPARK-4360) task only execute on one node when spark on yarn

2014-11-12 Thread seekerak (JIRA)
seekerak created SPARK-4360: --- Summary: task only execute on one node when spark on yarn Key: SPARK-4360 URL: https://issues.apache.org/jira/browse/SPARK-4360 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-12 Thread zzc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207826#comment-14207826 ] zzc commented on SPARK-2468: Hi, Aaron Davidson, I am sure that I ran my last test with the

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-12 Thread zzc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207839#comment-14207839 ] zzc commented on SPARK-2468: Hi, Aaron Davidson, can you describe your test, including the

[jira] [Commented] (SPARK-4251) Add Restricted Boltzmann machine(RBM) algorithm to MLlib

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207855#comment-14207855 ] Apache Spark commented on SPARK-4251: - User 'witgo' has created a pull request for

[jira] [Created] (SPARK-4361) SparkContext HadoopRDD is not clear about how to use a Hadoop Configuration

2014-11-12 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4361: --- Summary: SparkContext HadoopRDD is not clear about how to use a Hadoop Configuration Key: SPARK-4361 URL: https://issues.apache.org/jira/browse/SPARK-4361 Project:

[jira] [Resolved] (SPARK-4355) OnlineSummarizer doesn't merge mean correctly

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4355. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3220

[jira] [Reopened] (SPARK-4355) OnlineSummarizer doesn't merge mean correctly

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-4355: -- Reopen this issue because we haven't fixed branch-1.1 and branch-1.0 yet. OnlineSummarizer

[jira] [Commented] (SPARK-4361) SparkContext HadoopRDD is not clear about how to use a Hadoop Configuration

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207882#comment-14207882 ] Apache Spark commented on SPARK-4361: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-4341) Spark need to set num-executors automatically

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207885#comment-14207885 ] Sean Owen commented on SPARK-4341: -- So I think some of this is already done by Spark. For

[jira] [Commented] (SPARK-4360) task only execute on one node when spark on yarn

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207894#comment-14207894 ] Sean Owen commented on SPARK-4360: -- I don't think there's enough info here; this maybe

[jira] [Commented] (SPARK-4341) Spark need to set num-executors automatically

2014-11-12 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207899#comment-14207899 ] Hong Shen commented on SPARK-4341: -- My main point is when running spark (especially spark

[jira] [Commented] (SPARK-4038) Outlier Detection Algorithm for MLlib

2014-11-12 Thread Ashutosh Trivedi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207925#comment-14207925 ] Ashutosh Trivedi commented on SPARK-4038: - The questions raised are valid and we

[jira] [Comment Edited] (SPARK-4038) Outlier Detection Algorithm for MLlib

2014-11-12 Thread Ashutosh Trivedi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207925#comment-14207925 ] Ashutosh Trivedi edited comment on SPARK-4038 at 11/12/14 10:53 AM:

[jira] [Created] (SPARK-4362) Make prediction probability available in Naive Baye's Model

2014-11-12 Thread Jatinpreet Singh (JIRA)
Jatinpreet Singh created SPARK-4362: --- Summary: Make prediction probability available in Naive Baye's Model Key: SPARK-4362 URL: https://issues.apache.org/jira/browse/SPARK-4362 Project: Spark

[jira] [Created] (SPARK-4363) The Broadcast example is out of date

2014-11-12 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4363: --- Summary: The Broadcast example is out of date Key: SPARK-4363 URL: https://issues.apache.org/jira/browse/SPARK-4363 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-4363) The Broadcast example is out of date

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207942#comment-14207942 ] Apache Spark commented on SPARK-4363: - User 'zsxwing' has created a pull request for

[jira] [Created] (SPARK-4364) Some variable types in org.apache.spark.streaming.JavaAPISuite are wrong

2014-11-12 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4364: --- Summary: Some variable types in org.apache.spark.streaming.JavaAPISuite are wrong Key: SPARK-4364 URL: https://issues.apache.org/jira/browse/SPARK-4364 Project: Spark

[jira] [Commented] (SPARK-4364) Some variable types in org.apache.spark.streaming.JavaAPISuite are wrong

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207981#comment-14207981 ] Sean Owen commented on SPARK-4364: -- This is already covered in SPARK-4297 Some variable

[jira] [Updated] (SPARK-4362) Make prediction probability available in NaiveBayesModel

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4362: - Summary: Make prediction probability available in NaiveBayesModel (was: Make prediction probability

[jira] [Commented] (SPARK-4364) Some variable types in org.apache.spark.streaming.JavaAPISuite are wrong

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207995#comment-14207995 ] Apache Spark commented on SPARK-4364: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-2867) saveAsHadoopFile() in PairRDDFunction.scala should allow use other OutputCommiter class

2014-11-12 Thread Romi Kuntsman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207997#comment-14207997 ] Romi Kuntsman commented on SPARK-2867: -- In the latest code, it seems to be resolved

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208028#comment-14208028 ] Cristian Opris commented on SPARK-3633: --- FWIW I get this as well, with a very

[jira] [Created] (SPARK-4365) Remove unnecessary filter call on records returned from parquet library

2014-11-12 Thread Yash Datta (JIRA)
Yash Datta created SPARK-4365: - Summary: Remove unnecessary filter call on records returned from parquet library Key: SPARK-4365 URL: https://issues.apache.org/jira/browse/SPARK-4365 Project: Spark

[jira] [Commented] (SPARK-4365) Remove unnecessary filter call on records returned from parquet library

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208068#comment-14208068 ] Apache Spark commented on SPARK-4365: - User 'saucam' has created a pull request for

[jira] [Commented] (SPARK-4320) JavaPairRDD should supply a saveAsNewHadoopDataset which takes a Job object

2014-11-12 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208084#comment-14208084 ] Corey J. Nolet commented on SPARK-4320: --- Since this is a simple change, I wanted to

[jira] [Created] (SPARK-4366) Aggregation Optimization

2014-11-12 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4366: Summary: Aggregation Optimization Key: SPARK-4366 URL: https://issues.apache.org/jira/browse/SPARK-4366 Project: Spark Issue Type: Improvement Components:

[jira] [Created] (SPARK-4367) Process the distinct value before shuffling for aggregation

2014-11-12 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4367: Summary: Process the distinct value before shuffling for aggregation Key: SPARK-4367 URL: https://issues.apache.org/jira/browse/SPARK-4367 Project: Spark Issue

[jira] [Updated] (SPARK-4233) Simplify the Aggregation Function implementation

2014-11-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4233: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-4366 Simplify the Aggregation Function

[jira] [Updated] (SPARK-4367) Process the distinct value before shuffling for aggregation

2014-11-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4367: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-4366 Process the distinct value before

[jira] [Comment Edited] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208028#comment-14208028 ] Cristian Opris edited comment on SPARK-3633 at 11/12/14 3:20 PM:

[jira] [Updated] (SPARK-3056) Sort-based Aggregation

2014-11-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-3056: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-4366 Sort-based Aggregation

[jira] [Comment Edited] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208028#comment-14208028 ] Cristian Opris edited comment on SPARK-3633 at 11/12/14 3:36 PM:

[jira] [Commented] (SPARK-1014) MultilogisticRegressionWithSGD

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208226#comment-14208226 ] Sean Owen commented on SPARK-1014: -- I'm curious if this is still active -- where was the

[jira] [Resolved] (SPARK-1245) Can't read EMR HBase cluster from properly built Cloudera Spark Cluster.

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1245. -- Resolution: Not a Problem I'm guessing this is now either obsolete, or, a case of matching HBase /

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2014-11-12 Thread Anson Abraham (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208258#comment-14208258 ] Anson Abraham commented on SPARK-1867: -- I'm running 1.1 (standalone) w/o yarn on CDH

[jira] [Created] (SPARK-4368) Ceph integration?

2014-11-12 Thread Serge Smertin (JIRA)
Serge Smertin created SPARK-4368: Summary: Ceph integration? Key: SPARK-4368 URL: https://issues.apache.org/jira/browse/SPARK-4368 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208318#comment-14208318 ] Cristian Opris commented on SPARK-3633: --- This looks like a memory leak in

[jira] [Comment Edited] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208318#comment-14208318 ] Cristian Opris edited comment on SPARK-3633 at 11/12/14 5:48 PM:

[jira] [Created] (SPARK-4369) TreeModel.predict does not work with RDD

2014-11-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4369: - Summary: TreeModel.predict does not work with RDD Key: SPARK-4369 URL: https://issues.apache.org/jira/browse/SPARK-4369 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-1014) MultilogisticRegressionWithSGD

2014-11-12 Thread Kun Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208352#comment-14208352 ] Kun Yang commented on SPARK-1014: - I am not sure if you can find the pr on the repository.

[jira] [Created] (SPARK-4370) Limit cores used by Netty transfer service based on executor size

2014-11-12 Thread Aaron Davidson (JIRA)
Aaron Davidson created SPARK-4370: - Summary: Limit cores used by Netty transfer service based on executor size Key: SPARK-4370 URL: https://issues.apache.org/jira/browse/SPARK-4370 Project: Spark

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-12 Thread Cristian Opris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208378#comment-14208378 ] Cristian Opris commented on SPARK-3633: --- At first sight (haven't tested this) the

[jira] [Commented] (SPARK-4369) TreeModel.predict does not work with RDD

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208377#comment-14208377 ] Apache Spark commented on SPARK-4369: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-4370) Limit cores used by Netty transfer service based on executor size

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208391#comment-14208391 ] Apache Spark commented on SPARK-4370: - User 'aarondav' has created a pull request for

[jira] [Resolved] (SPARK-3530) Pipeline and Parameters

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3530. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3099

[jira] [Closed] (SPARK-3315) Support hyperparameter tuning

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-3315. Resolution: Fixed Fix Version/s: 1.2.0 CrossValidator and ParamGridBuilder were included in

[jira] [Commented] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2014-11-12 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208472#comment-14208472 ] Manish Amde commented on SPARK-3717: [~bbnsumanth] Look forward to your details of

[jira] [Comment Edited] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2014-11-12 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208472#comment-14208472 ] Manish Amde edited comment on SPARK-3717 at 11/12/14 7:02 PM: --

[jira] [Commented] (SPARK-4371) Spark crashes with JBoss Logging 3.6.1

2014-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208525#comment-14208525 ] Sean Owen commented on SPARK-4371: -- SLF4J is pretty backwards compatible. The right thing

[jira] [Reopened] (SPARK-3039) Spark assembly for new hadoop API (hadoop 2) contains avro-mapred for hadoop 1 API

2014-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reopened SPARK-3039: -- Spark assembly for new hadoop API (hadoop 2) contains avro-mapred for hadoop 1 API

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2014-11-12 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208595#comment-14208595 ] Kousuke Saruta commented on SPARK-4267: --- Hi [~ozawa], On my YARN-2.5.1(JDK 1.7.0_60)

[jira] [Comment Edited] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2014-11-12 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208595#comment-14208595 ] Kousuke Saruta edited comment on SPARK-4267 at 11/12/14 8:07 PM:

[jira] [Resolved] (SPARK-3660) Initial RDD for updateStateByKey transformation

2014-11-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-3660. -- Resolution: Fixed Fix Version/s: 1.3.0 Initial RDD for updateStateByKey transformation

[jira] [Updated] (SPARK-3660) Initial RDD for updateStateByKey transformation

2014-11-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3660: - Priority: Major (was: Minor) Initial RDD for updateStateByKey transformation

[jira] [Commented] (SPARK-4372) Make LR and SVM's default parameters consistent in Scala and Python

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208597#comment-14208597 ] Apache Spark commented on SPARK-4372: - User 'mengxr' has created a pull request for

[jira] [Resolved] (SPARK-3666) Extract interfaces for EdgeRDD and VertexRDD

2014-11-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3666. Resolution: Fixed Fix Version/s: 1.2.0 Extract interfaces for EdgeRDD and VertexRDD

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-11-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208766#comment-14208766 ] Josh Rosen commented on SPARK-3630: --- Hi [~rdub], Thanks for the detailed logs. Do you

[jira] [Commented] (SPARK-2996) Standalone and Yarn have different settings for adding the user classpath first

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208767#comment-14208767 ] Apache Spark commented on SPARK-2996: - User 'vanzin' has created a pull request for

[jira] [Resolved] (SPARK-4369) TreeModel.predict does not work with RDD

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4369. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3230

[jira] [Updated] (SPARK-4369) TreeModel.predict does not work with RDD

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4369: - Assignee: Davies Liu TreeModel.predict does not work with RDD

[jira] [Resolved] (SPARK-3667) Deprecate Graph#unpersistVertices and document how to correctly unpersist graphs

2014-11-12 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-3667. --- Resolution: Won't Fix Target Version/s: (was: 1.2.0) Deprecate Graph#unpersistVertices

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-11-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208857#comment-14208857 ] Ryan Williams commented on SPARK-3630: -- I ran a few more instances of this job,

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-11-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208863#comment-14208863 ] Ryan Williams commented on SPARK-3630: -- [~joshrosen] I do have access to the logs,

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-11-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208876#comment-14208876 ] Ryan Williams commented on SPARK-3630: -- [~joshrosen] can you see [this dropbox

[jira] [Created] (SPARK-4373) MLlib unit tests failed maven test

2014-11-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4373: Summary: MLlib unit tests failed maven test Key: SPARK-4373 URL: https://issues.apache.org/jira/browse/SPARK-4373 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-3665) Java API for GraphX

2014-11-12 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3665: -- Description: The Java API will wrap the Scala API in a similar manner as JavaRDD. Components will

[jira] [Commented] (SPARK-3665) Java API for GraphX

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208916#comment-14208916 ] Apache Spark commented on SPARK-3665: - User 'ankurdave' has created a pull request for

[jira] [Created] (SPARK-4374) LibraryClientSuite has been flaky

2014-11-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-4374: -- Summary: LibraryClientSuite has been flaky Key: SPARK-4374 URL: https://issues.apache.org/jira/browse/SPARK-4374 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-2672) Support compression in wholeFile()

2014-11-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2672. --- Resolution: Fixed Fix Version/s: 1.3.0 1.2.0 Issue resolved by pull request

[jira] [Deleted] (SPARK-4374) LibraryClientSuite has been flaky

2014-11-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin deleted SPARK-4374: --- LibraryClientSuite has been flaky - Key: SPARK-4374

[jira] [Commented] (SPARK-4326) unidoc is broken on master

2014-11-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14208998#comment-14208998 ] Marcelo Vanzin commented on SPARK-4326: --- So, this is really weird. Unidoc is run by

[jira] [Created] (SPARK-4375) assembly built with Maven is missing most of repl classes

2014-11-12 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-4375: - Summary: assembly built with Maven is missing most of repl classes Key: SPARK-4375 URL: https://issues.apache.org/jira/browse/SPARK-4375 Project: Spark Issue

[jira] [Updated] (SPARK-4375) assembly built with Maven is missing most of repl classes

2014-11-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4375: -- Description: In particular, the ones in the split scala-2.10/scala-2.11 directories aren't being added

[jira] [Commented] (SPARK-4326) unidoc is broken on master

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209058#comment-14209058 ] Xiangrui Meng commented on SPARK-4326: -- [~vanzin] Thanks for looking into this issue!

[jira] [Updated] (SPARK-4326) unidoc is broken on master

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4326: - Priority: Critical (was: Major) unidoc is broken on master --

[jira] [Closed] (SPARK-4179) Streaming Linear Regression example has type mismatch

2014-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-4179. Resolution: Not a Problem Assignee: Xiangrui Meng (was: Anant Daksh Asthana) I'm closing

[jira] [Created] (SPARK-4376) Put external modules behind build profiles

2014-11-12 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4376: -- Summary: Put external modules behind build profiles Key: SPARK-4376 URL: https://issues.apache.org/jira/browse/SPARK-4376 Project: Spark Issue Type:

[jira] [Commented] (SPARK-3325) Add a parameter to the method print in class DStream.

2014-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209153#comment-14209153 ] Apache Spark commented on SPARK-3325: - User 'watermen' has created a pull request for

[jira] [Closed] (SPARK-4364) Some variable types in org.apache.spark.streaming.JavaAPISuite are wrong

2014-11-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu closed SPARK-4364. --- Resolution: Duplicate Sorry. Didn't notice SPARK-4297 Some variable types in

[jira] [Resolved] (SPARK-4373) MLlib unit tests failed maven test

2014-11-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4373. --- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3235

[jira] [Resolved] (SPARK-4370) Limit cores used by Netty transfer service based on executor size

2014-11-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4370. Resolution: Fixed Fix Version/s: 1.2.0 Limit cores used by Netty transfer service based on

[jira] [Updated] (SPARK-4375) Assembly built with Maven is missing most of repl classes

2014-11-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4375: -- Summary: Assembly built with Maven is missing most of repl classes (was: assembly built with Maven is

[jira] [Commented] (SPARK-4326) unidoc is broken on master

2014-11-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209170#comment-14209170 ] Marcelo Vanzin commented on SPARK-4326: --- Hmm, but core/pom.xml defines an explicit

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2014-11-12 Thread Matthew Daniel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209183#comment-14209183 ] Matthew Daniel commented on SPARK-4267: --- Apologies, I don't know if we want log

[jira] [Commented] (SPARK-750) LocalSparkContext should be included in Spark JAR

2014-11-12 Thread Nathan M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209167#comment-14209167 ] Nathan M commented on SPARK-750: +1 This shouldnt be hard, in maven its a plugin to add in

[jira] [Assigned] (SPARK-4294) The same function should have the same realization.

2014-11-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-4294: Assignee: Tathagata Das The same function should have the same realization.

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2014-11-12 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209239#comment-14209239 ] Kousuke Saruta commented on SPARK-4267: --- Hi [~bugzi...@mdaniel.scdi.com]. The NPE is

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2014-11-12 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209255#comment-14209255 ] Derrick Burns commented on SPARK-2620: -- I also hit the bug when running Spark 1.1.0

[jira] [Comment Edited] (SPARK-4375) Assembly built with Maven is missing most of repl classes

2014-11-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209305#comment-14209305 ] Patrick Wendell edited comment on SPARK-4375 at 11/13/14 5:37 AM:

[jira] [Commented] (SPARK-4375) Assembly built with Maven is missing most of repl classes

2014-11-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209305#comment-14209305 ] Patrick Wendell commented on SPARK-4375: Hey Sandy, What about the following

[jira] [Commented] (SPARK-4375) Assembly built with Maven is missing most of repl classes

2014-11-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209314#comment-14209314 ] Patrick Wendell commented on SPARK-4375: One thing we could add onto that to make

  1   2   >