[jira] [Commented] (SPARK-1739) Close PR's after period of inactivity

2014-11-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204486#comment-14204486 ] Sean Owen commented on SPARK-1739: -- Sounds great. Even better if old PRs with no review

[jira] [Commented] (SPARK-4288) Add Sparse Autoencoder algorithm to MLlib

2014-11-10 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204538#comment-14204538 ] Kai Sasaki commented on SPARK-4288: --- Can I take charge of this ticket? I have a starter

[jira] [Created] (SPARK-4314) Exception throws when finding new files like intermediate result(_COPYING_ file) through hdfs interface

2014-11-10 Thread maji2014 (JIRA)
maji2014 created SPARK-4314: --- Summary: Exception throws when finding new files like intermediate result(_COPYING_ file) through hdfs interface Key: SPARK-4314 URL: https://issues.apache.org/jira/browse/SPARK-4314

[jira] [Created] (SPARK-4315) PySpark pickling of pyspark.sql.Row objects is extremely inefficient

2014-11-10 Thread Adam Davison (JIRA)
Adam Davison created SPARK-4315: --- Summary: PySpark pickling of pyspark.sql.Row objects is extremely inefficient Key: SPARK-4315 URL: https://issues.apache.org/jira/browse/SPARK-4315 Project: Spark

[jira] [Commented] (SPARK-1227) Diagnostics for ClassificationRegression

2014-11-10 Thread Martin Jaggi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204713#comment-14204713 ] Martin Jaggi commented on SPARK-1227: - actually this is still relevant, as looking at

[jira] [Comment Edited] (SPARK-1227) Diagnostics for ClassificationRegression

2014-11-10 Thread Martin Jaggi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204713#comment-14204713 ] Martin Jaggi edited comment on SPARK-1227 at 11/10/14 12:33 PM:

[jira] [Created] (SPARK-4316) Utils.isBindCollision misjudges at Non-English environment

2014-11-10 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-4316: --- Summary: Utils.isBindCollision misjudges at Non-English environment Key: SPARK-4316 URL: https://issues.apache.org/jira/browse/SPARK-4316 Project: Spark Issue

[jira] [Resolved] (SPARK-4316) Utils.isBindCollision misjudges at Non-English environment

2014-11-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4316. -- Resolution: Duplicate Another duplicate of SPARK-4169. It would be good to get

[jira] [Commented] (SPARK-1227) Diagnostics for ClassificationRegression

2014-11-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204762#comment-14204762 ] Sean Owen commented on SPARK-1227: -- OK you're interested in detecting overfitting, for

[jira] [Comment Edited] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2014-11-10 Thread Chris Heller (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204764#comment-14204764 ] Chris Heller edited comment on SPARK-2691 at 11/10/14 1:28 PM:

[jira] [Commented] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2014-11-10 Thread Chris Heller (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204764#comment-14204764 ] Chris Heller commented on SPARK-2691: - Just an update. I've been working on this patch

[jira] [Created] (SPARK-4317) Error querying Avro files imported by Sqoop: org.apache.spark.sql.catalyst.errors.package$TreeNodeException: Unresolved attributes

2014-11-10 Thread Hendy Irawan (JIRA)
Hendy Irawan created SPARK-4317: --- Summary: Error querying Avro files imported by Sqoop: org.apache.spark.sql.catalyst.errors.package$TreeNodeException: Unresolved attributes Key: SPARK-4317 URL:

[jira] [Created] (SPARK-4318) Fix empty sum distinct.

2014-11-10 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-4318: Summary: Fix empty sum distinct. Key: SPARK-4318 URL: https://issues.apache.org/jira/browse/SPARK-4318 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-4319) Enable an ignored test null count.

2014-11-10 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-4319: Summary: Enable an ignored test null count. Key: SPARK-4319 URL: https://issues.apache.org/jira/browse/SPARK-4319 Project: Spark Issue Type: Test

[jira] [Commented] (SPARK-4319) Enable an ignored test null count.

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204905#comment-14204905 ] Apache Spark commented on SPARK-4319: - User 'ueshin' has created a pull request for

[jira] [Commented] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2014-11-10 Thread Timothy St. Clair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204956#comment-14204956 ] Timothy St. Clair commented on SPARK-2691: -- [~ChrisHeller], [~tarnfeld] - I'm

[jira] [Created] (SPARK-4320) JavaPairRDD should supply a saveAsNewHadoopDataset which takes a Job object

2014-11-10 Thread Corey J. Nolet (JIRA)
Corey J. Nolet created SPARK-4320: - Summary: JavaPairRDD should supply a saveAsNewHadoopDataset which takes a Job object Key: SPARK-4320 URL: https://issues.apache.org/jira/browse/SPARK-4320

[jira] [Updated] (SPARK-4320) JavaPairRDD should supply a saveAsNewHadoopDataset which takes a Job object

2014-11-10 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Corey J. Nolet updated SPARK-4320: -- Description: I am outputting data to Accumulo using a custom OutputFormat. I have tried using

[jira] [Closed] (SPARK-1682) Add gradient descent w/o sampling and RDA L1 updater

2014-11-10 Thread Dong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Wang closed SPARK-1682. Resolution: Later revisit later Add gradient descent w/o sampling and RDA L1 updater

[jira] [Created] (SPARK-4321) Make Kryo serialization work for closures

2014-11-10 Thread Jeff Hammerbacher (JIRA)
Jeff Hammerbacher created SPARK-4321: Summary: Make Kryo serialization work for closures Key: SPARK-4321 URL: https://issues.apache.org/jira/browse/SPARK-4321 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4321) Make Kryo serialization work for closures

2014-11-10 Thread Jeff Hammerbacher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205067#comment-14205067 ] Jeff Hammerbacher commented on SPARK-4321: -- The [option to serialize closures

[jira] [Commented] (SPARK-1882) Support dynamic memory sharing in Mesos

2014-11-10 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205068#comment-14205068 ] Andrew Ash commented on SPARK-1882: --- Another major issue is to dynamically scale

[jira] [Commented] (SPARK-4206) BlockManager warnings in local mode: Block $blockId already exists on this machine; not re-adding it

2014-11-10 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205081#comment-14205081 ] Imran Rashid commented on SPARK-4206: - actually, it looks like this was kind of fixed

[jira] [Commented] (SPARK-1882) Support dynamic memory sharing in Mesos

2014-11-10 Thread Timothy St. Clair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205084#comment-14205084 ] Timothy St. Clair commented on SPARK-1882: -- [~tnachen] ^ FYI. Support dynamic

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2014-11-10 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205131#comment-14205131 ] Shivaram Venkataraman commented on SPARK-3821: -- Regarding reducing init time,

[jira] [Updated] (SPARK-4322) Analysis incorrectly rejects accessing grouping fields

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4322: Component/s: SQL Analysis incorrectly rejects accessing grouping fields

[jira] [Created] (SPARK-4322) Analysis incorrectly rejects accessing grouping fields

2014-11-10 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-4322: --- Summary: Analysis incorrectly rejects accessing grouping fields Key: SPARK-4322 URL: https://issues.apache.org/jira/browse/SPARK-4322 Project: Spark

[jira] [Updated] (SPARK-4322) Analysis incorrectly rejects accessing grouping fields

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4322: Target Version/s: 1.2.0 Analysis incorrectly rejects accessing grouping fields

[jira] [Created] (SPARK-4323) Utils#fetchFile method should close lock file certainly

2014-11-10 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4323: - Summary: Utils#fetchFile method should close lock file certainly Key: SPARK-4323 URL: https://issues.apache.org/jira/browse/SPARK-4323 Project: Spark

[jira] [Commented] (SPARK-4323) Utils#fetchFile method should close lock file certainly

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205199#comment-14205199 ] Apache Spark commented on SPARK-4323: - User 'sarutak' has created a pull request for

[jira] [Closed] (SPARK-4169) [Core] Locale dependent code

2014-11-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4169. Resolution: Fixed Fix Version/s: 1.1.1 Target Version/s: 1.1.1, 1.2.0 (was: 1.2.0)

[jira] [Resolved] (SPARK-2548) JavaRecoverableWordCount is missing

2014-11-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-2548. -- Resolution: Fixed Fix Version/s: 1.0.3 1.2.0 1.1.1

[jira] [Resolved] (SPARK-4312) bash can't `die`

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4312. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Kousuke Saruta bash can't

[jira] [Resolved] (SPARK-4230) Doc for spark.default.parallelism is incorrect

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4230. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Sandy Ryza Doc for

[jira] [Commented] (SPARK-2652) Turning default configurations for PySpark

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205299#comment-14205299 ] Apache Spark commented on SPARK-2652: - User 'mengxr' has created a pull request for

[jira] [Created] (SPARK-4324) Support numpy/scipy in all Python API of MLlib

2014-11-10 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4324: - Summary: Support numpy/scipy in all Python API of MLlib Key: SPARK-4324 URL: https://issues.apache.org/jira/browse/SPARK-4324 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4324) Support numpy/scipy in all Python API of MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4324: - Assignee: Davies Liu Support numpy/scipy in all Python API of MLlib

[jira] [Closed] (SPARK-3377) Metrics can be accidentally aggregated against our intention

2014-11-10 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta closed SPARK-3377. - Resolution: Fixed Fix Version/s: 1.2.0 Target Version/s: 1.2.0 Metrics can be

[jira] [Commented] (SPARK-4290) Provide an equivalent functionality of distributed cache as MR does

2014-11-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205346#comment-14205346 ] Sandy Ryza commented on SPARK-4290: --- SparkFiles.get needs to be called, but it will only

[jira] [Resolved] (SPARK-1297) Upgrade HBase dependency to 0.98.0

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1297. Resolution: Fixed Fix Version/s: 1.2.0 Upgrade HBase dependency to 0.98.0

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2014-11-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205358#comment-14205358 ] Nicholas Chammas commented on SPARK-3821: - Here's the [benchmark of the launch

[jira] [Created] (SPARK-4325) Improve spark-ec2 cluster launch times

2014-11-10 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-4325: --- Summary: Improve spark-ec2 cluster launch times Key: SPARK-4325 URL: https://issues.apache.org/jira/browse/SPARK-4325 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2014-11-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205358#comment-14205358 ] Nicholas Chammas edited comment on SPARK-3821 at 11/10/14 9:34 PM:

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2014-11-10 Thread Dan Osipov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205367#comment-14205367 ] Dan Osipov commented on SPARK-3821: --- [~nchammas] Excellent work, I look forward to

[jira] [Commented] (SPARK-2548) JavaRecoverableWordCount is missing

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205378#comment-14205378 ] Apache Spark commented on SPARK-2548: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-4326) unidoc is broken on master

2014-11-10 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4326: Summary: unidoc is broken on master Key: SPARK-4326 URL: https://issues.apache.org/jira/browse/SPARK-4326 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-4327) Python API for RDD.randomSplit()

2014-11-10 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4327: - Summary: Python API for RDD.randomSplit() Key: SPARK-4327 URL: https://issues.apache.org/jira/browse/SPARK-4327 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-2269) Clean up and add unit tests for resourceOffers in MesosSchedulerBackend

2014-11-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2269: - Affects Version/s: 1.1.0 Clean up and add unit tests for resourceOffers in MesosSchedulerBackend

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2014-11-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205401#comment-14205401 ] Nicholas Chammas commented on SPARK-3821: - Thanks for taking a look [~danospv].

[jira] [Commented] (SPARK-4324) Support numpy/scipy in all Python API of MLlib

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205406#comment-14205406 ] Apache Spark commented on SPARK-4324: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-3638) Commons HTTP client dependency conflict in extras/kinesis-asl module

2014-11-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-3638: --- Description: Followed instructions as mentioned @

[jira] [Updated] (SPARK-4325) Improve spark-ec2 cluster launch times

2014-11-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4325: Description: There are several optimizations we know we can make to [{{setup.sh}} |

[jira] [Updated] (SPARK-2548) JavaRecoverableWordCount is missing

2014-11-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2548: - Fix Version/s: (was: 1.0.3) JavaRecoverableWordCount is missing

[jira] [Commented] (SPARK-3990) kryo.KryoException caused by ALS.trainImplicit in pyspark

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205431#comment-14205431 ] Apache Spark commented on SPARK-3990: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-3495) Block replication fails continuously when the replication target node is dead

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205438#comment-14205438 ] Apache Spark commented on SPARK-3495: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-3496) Block replication can by mistake choose driver BlockManager as a peer for replication

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205439#comment-14205439 ] Apache Spark commented on SPARK-3496: - User 'tdas' has created a pull request for this

[jira] [Updated] (SPARK-4047) Generate runtime warning for naive implementation examples for algorithms implemented in MLlib/graphx

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4047: - Assignee: Varadharajan Generate runtime warning for naive implementation examples for algorithms

[jira] [Resolved] (SPARK-4047) Generate runtime warning for naive implementation examples for algorithms implemented in MLlib/graphx

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4047. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2894

[jira] [Created] (SPARK-4328) Python serialization updates make Python ML API more brittle to types

2014-11-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4328: Summary: Python serialization updates make Python ML API more brittle to types Key: SPARK-4328 URL: https://issues.apache.org/jira/browse/SPARK-4328 Project:

[jira] [Commented] (SPARK-4328) Python serialization updates make Python ML API more brittle to types

2014-11-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205458#comment-14205458 ] Joseph K. Bradley commented on SPARK-4328: -- [~atalwalkar] Thanks for pointing

[jira] [Commented] (SPARK-4328) Python serialization updates make Python ML API more brittle to types

2014-11-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205457#comment-14205457 ] Joseph K. Bradley commented on SPARK-4328: -- both related to Python API SerDe

[jira] [Updated] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2309: - Target Version/s: 1.3.0 (was: 1.2.0) Generalize the binary logistic regression into multinomial

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3181: - Target Version/s: 1.3.0 (was: 1.2.0) Add Robust Regression Algorithm with Huber Estimator

[jira] [Updated] (SPARK-3218) K-Means clusterer can fail on degenerate data

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3218: - Target Version/s: 1.3.0 (was: 1.2.0) K-Means clusterer can fail on degenerate data

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3181: - Assignee: Fan Jiang Add Robust Regression Algorithm with Huber Estimator

[jira] [Updated] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2199: - Target Version/s: 1.3.0 (was: 1.2.0) Distributed probabilistic latent semantic analysis in

[jira] [Commented] (SPARK-4314) Exception throws when finding new files like intermediate result(_COPYING_ file) through hdfs interface

2014-11-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205477#comment-14205477 ] Sean Owen commented on SPARK-4314: -- To clarify, the suffix is _COPYING_ right? Yeah

[jira] [Updated] (SPARK-1486) Support multi-model training in MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1486: - Target Version/s: 1.3.0 (was: 1.2.0) Support multi-model training in MLlib

[jira] [Updated] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2199: - Assignee: Valeriy Avanesov Distributed probabilistic latent semantic analysis in MLlib

[jira] [Updated] (SPARK-1473) Feature selection for high dimensional datasets

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1473: - Target Version/s: 1.3.0 (was: 1.2.0) Feature selection for high dimensional datasets

[jira] [Updated] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1405: - Target Version/s: 1.3.0 (was: 1.2.0) parallel Latent Dirichlet Allocation (LDA) atop of spark

[jira] [Updated] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1405: - Priority: Critical (was: Major) parallel Latent Dirichlet Allocation (LDA) atop of spark in

[jira] [Resolved] (SPARK-4328) Python serialization updates make Python ML API more brittle to types

2014-11-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4328. -- Resolution: Duplicate This is covered in the PR for SPARK-4324. Python serialization updates

[jira] [Commented] (SPARK-2703) Make Tachyon related unit tests execute without deploying a Tachyon system locally.

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205486#comment-14205486 ] Patrick Wendell commented on SPARK-2703: FYI I had to revert this patch because it

[jira] [Updated] (SPARK-3496) Block replication can by mistake choose driver BlockManager as a peer for replication

2014-11-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3496: - Target Version/s: 1.1.1, 1.2.0 (was: 1.2.0) Block replication can by mistake choose driver

[jira] [Updated] (SPARK-3495) Block replication fails continuously when the replication target node is dead

2014-11-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-3495: - Target Version/s: 1.1.1, 1.2.0 (was: 1.2.0) Block replication fails continuously when the

[jira] [Commented] (SPARK-3461) Support external groupByKey using repartitionAndSortWithinPartitions

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205520#comment-14205520 ] Patrick Wendell commented on SPARK-3461: I think [~sandyr] wanted to take a crack

[jira] [Resolved] (SPARK-4319) Enable an ignored test null count.

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4319. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3185

[jira] [Commented] (SPARK-4327) Python API for RDD.randomSplit()

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205603#comment-14205603 ] Apache Spark commented on SPARK-4327: - User 'davies' has created a pull request for

[jira] [Created] (SPARK-4329) Add indexing feature for HistoryPage

2014-11-10 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4329: - Summary: Add indexing feature for HistoryPage Key: SPARK-4329 URL: https://issues.apache.org/jira/browse/SPARK-4329 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4329) Add indexing feature for HistoryPage

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205625#comment-14205625 ] Apache Spark commented on SPARK-4329: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-3398) Have spark-ec2 intelligently wait for specific cluster states

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205626#comment-14205626 ] Apache Spark commented on SPARK-3398: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-4325) Improve spark-ec2 cluster launch times

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205627#comment-14205627 ] Apache Spark commented on SPARK-4325: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2014-11-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205637#comment-14205637 ] Joseph K. Bradley commented on SPARK-3717: -- [~codedeft] Are you asking me or

[jira] [Created] (SPARK-4330) Link to proper URL for YARN overview

2014-11-10 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4330: - Summary: Link to proper URL for YARN overview Key: SPARK-4330 URL: https://issues.apache.org/jira/browse/SPARK-4330 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4330) Link to proper URL for YARN overview

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205680#comment-14205680 ] Apache Spark commented on SPARK-4330: - User 'sarutak' has created a pull request for

[jira] [Resolved] (SPARK-4202) DSL support for Scala UDF

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4202. - Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Cheng Lian DSL support

[jira] [Resolved] (SPARK-4308) SQL operation state is not properly set when exception is thrown

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4308. - Resolution: Fixed Fix Version/s: 1.1.1 1.2.0 Issue resolved by

[jira] [Commented] (SPARK-3461) Support external groupByKey using repartitionAndSortWithinPartitions

2014-11-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205731#comment-14205731 ] Apache Spark commented on SPARK-3461: - User 'sryza' has created a pull request for

[jira] [Updated] (SPARK-2205) Unnecessary exchange operators in a join on multiple tables with the same join key.

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2205: Target Version/s: 1.3.0 (was: 1.2.0) Unnecessary exchange operators in a join on multiple

[jira] [Resolved] (SPARK-4250) Create constant null value for Hive Inspectors

2014-11-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4250. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3114

[jira] [Resolved] (SPARK-3954) Optimization to FileInputDStream

2014-11-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-3954. -- Resolution: Fixed Fix Version/s: 1.2.0 Optimization to FileInputDStream

[jira] [Commented] (SPARK-4314) Exception throws when finding new files like intermediate result(_COPYING_ file) through hdfs interface

2014-11-10 Thread maji2014 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205774#comment-14205774 ] maji2014 commented on SPARK-4314: - The actual behavior is the intermediate file _COPYING_

[jira] [Comment Edited] (SPARK-4314) Exception throws when the upload intermediate file(_COPYING_ file) is read through hdfs interface

2014-11-10 Thread maji2014 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205774#comment-14205774 ] maji2014 edited comment on SPARK-4314 at 11/11/14 1:48 AM: --- The

[jira] [Updated] (SPARK-4274) NEP in printing the details of query plan

2014-11-10 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4274: - Summary: NEP in printing the details of query plan (was: Hive comparison test framework doesn't print

[jira] [Updated] (SPARK-4274) NEP in printing the details of query plan

2014-11-10 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4274: - Description: NEP in printing the details of query plan, if the query is not valid. this will great

[jira] [Updated] (SPARK-4274) NPE in printing the details of query plan

2014-11-10 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4274: - Summary: NPE in printing the details of query plan (was: NEP in printing the details of query plan)

[jira] [Updated] (SPARK-4274) NPE in printing the details of query plan

2014-11-10 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4274: - Description: NPE in printing the details of query plan, if the query is not valid. This will be great

[jira] [Created] (SPARK-4331) Scalastyle doesn't work for the sources under hive's v0.12.0 and v0.13.1

2014-11-10 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4331: - Summary: Scalastyle doesn't work for the sources under hive's v0.12.0 and v0.13.1 Key: SPARK-4331 URL: https://issues.apache.org/jira/browse/SPARK-4331 Project:

[jira] [Commented] (SPARK-4331) Scalastyle doesn't work for the sources under hive's v0.12.0 and v0.13.1

2014-11-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205795#comment-14205795 ] Patrick Wendell commented on SPARK-4331: This is going to be exacerbated by the

  1   2   >