[jira] [Updated] (SPARK-3410) The priority of shutdownhook for ApplicationMaster should not be integer literal

2014-09-05 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-3410: -- Summary: The priority of shutdownhook for ApplicationMaster should not be integer literal

[jira] [Created] (SPARK-3412) Add Missing Types for Row API

2014-09-05 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-3412: Summary: Add Missing Types for Row API Key: SPARK-3412 URL: https://issues.apache.org/jira/browse/SPARK-3412 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-2491) When an OOM is thrown,the executor does not stop properly.

2014-09-05 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122579#comment-14122579 ] Guoqiang Li commented on SPARK-2491: Executor running multiple tasks at the same

[jira] [Created] (SPARK-3413) Spark Blocked due to Executor lost in FIFO MODE

2014-09-05 Thread Patrick Liu (JIRA)
Patrick Liu created SPARK-3413: -- Summary: Spark Blocked due to Executor lost in FIFO MODE Key: SPARK-3413 URL: https://issues.apache.org/jira/browse/SPARK-3413 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Lukas Nalezenec (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122611#comment-14122611 ] Lukas Nalezenec edited comment on SPARK-3369 at 9/5/14 8:44 AM:

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Lukas Nalezenec (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122611#comment-14122611 ] Lukas Nalezenec commented on SPARK-3369: Hi, it looks like serious issue for me.

[jira] [Commented] (SPARK-3403) NaiveBayes crashes with blas/lapack native libraries for breeze (netlib-java)

2014-09-05 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122699#comment-14122699 ] Alexander Ulanov commented on SPARK-3403: - I managed to compile OpenBLAS with

[jira] [Comment Edited] (SPARK-2491) When an OOM is thrown,the executor does not stop properly.

2014-09-05 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122579#comment-14122579 ] Guoqiang Li edited comment on SPARK-2491 at 9/5/14 9:00 AM:

[jira] [Created] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with upper case letter in their names

2014-09-05 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-3414: - Summary: Case insensitivity breaks when unresolved relation contains attributes with upper case letter in their names Key: SPARK-3414 URL:

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with upper case letter in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Description: Paste the following snippet to {{spark-shell}} (need Hive support) to reproduce this

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with upper case letter in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Description: Paste the following snippet to {{spark-shell}} (need Hive support) to reproduce this

[jira] [Comment Edited] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Lukas Nalezenec (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122611#comment-14122611 ] Lukas Nalezenec edited comment on SPARK-3369 at 9/5/14 9:17 AM:

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with uppercase letters in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Summary: Case insensitivity breaks when unresolved relation contains attributes with uppercase letters

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with upper case letter in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Description: Paste the following snippet to {{spark-shell}} (need Hive support) to reproduce this

[jira] [Commented] (SPARK-3412) Add Missing Types for Row API

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122729#comment-14122729 ] Apache Spark commented on SPARK-3412: - User 'chenghao-intel' has created a pull

[jira] [Comment Edited] (SPARK-3403) NaiveBayes crashes with blas/lapack native libraries for breeze (netlib-java)

2014-09-05 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122699#comment-14122699 ] Alexander Ulanov edited comment on SPARK-3403 at 9/5/14 9:53 AM:

[jira] [Comment Edited] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Lukas Nalezenec (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122611#comment-14122611 ] Lukas Nalezenec edited comment on SPARK-3369 at 9/5/14 11:59 AM:

[jira] [Created] (SPARK-3415) Using sys.stderr in pyspark results in error

2014-09-05 Thread Ward Viaene (JIRA)
Ward Viaene created SPARK-3415: -- Summary: Using sys.stderr in pyspark results in error Key: SPARK-3415 URL: https://issues.apache.org/jira/browse/SPARK-3415 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122921#comment-14122921 ] Sean Owen commented on SPARK-3369: -- The API change is unlikely to happen. Making a bunch

[jira] [Updated] (SPARK-3415) Using sys.stderr in pyspark results in error

2014-09-05 Thread Ward Viaene (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ward Viaene updated SPARK-3415: --- Component/s: PySpark Using sys.stderr in pyspark results in error

[jira] [Updated] (SPARK-3415) Using sys.stderr in pyspark results in error

2014-09-05 Thread Ward Viaene (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ward Viaene updated SPARK-3415: --- Labels: python (was: ) Using sys.stderr in pyspark results in error

[jira] [Updated] (SPARK-3415) Using sys.stderr in pyspark results in error

2014-09-05 Thread Ward Viaene (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ward Viaene updated SPARK-3415: --- Affects Version/s: 1.1.0 1.0.2 Using sys.stderr in pyspark results in error

[jira] [Commented] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2014-09-05 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123005#comment-14123005 ] Yu Ishikawa commented on SPARK-2430: Hi [~rnowling], The community had suggested

[jira] [Updated] (SPARK-3377) Don't mix metrics from different applications otherwise we cannot distinguish

2014-09-05 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-3377: -- Summary: Don't mix metrics from different applications otherwise we cannot distinguish (was:

[jira] [Resolved] (SPARK-3260) Yarn - pass acls along with executor launch

2014-09-05 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3260. -- Resolution: Fixed Fix Version/s: 1.2.0 Target Version/s: 1.2.0 (was: 1.1.1,

[jira] [Resolved] (SPARK-3375) spark on yarn container allocation issues

2014-09-05 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3375. -- Resolution: Fixed Fix Version/s: 1.2.0 spark on yarn container allocation issues

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-05 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123029#comment-14123029 ] Yu Ishikawa commented on SPARK-2966: Hi [~rnowling], {quote} Based on my reading of

[jira] [Created] (SPARK-3416) Add matrix operations for large data set

2014-09-05 Thread Yu Ishikawa (JIRA)
Yu Ishikawa created SPARK-3416: -- Summary: Add matrix operations for large data set Key: SPARK-3416 URL: https://issues.apache.org/jira/browse/SPARK-3416 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-3417) Use of old-style classes in pyspark

2014-09-05 Thread Matthew Rocklin (JIRA)
Matthew Rocklin created SPARK-3417: -- Summary: Use of old-style classes in pyspark Key: SPARK-3417 URL: https://issues.apache.org/jira/browse/SPARK-3417 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3417) Use of old-style classes in pyspark

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123059#comment-14123059 ] Apache Spark commented on SPARK-3417: - User 'mrocklin' has created a pull request for

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-05 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123077#comment-14123077 ] RJ Nowling commented on SPARK-2966: --- Wonderful! If I can help or when you're ready for

[jira] [Updated] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3369: --- Priority: Critical (was: Major) Target Version/s: 1.2.0 Java mapPartitions

[jira] [Updated] (SPARK-3174) Under YARN, add and remove executors based on load

2014-09-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3174: --- Assignee: Andrew Or Under YARN, add and remove executors based on load

[jira] [Commented] (SPARK-3174) Under YARN, add and remove executors based on load

2014-09-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123137#comment-14123137 ] Patrick Wendell commented on SPARK-3174: We should come up with a design doc for

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123143#comment-14123143 ] Davies Liu commented on SPARK-3399: --- Could you give an example to show the problem?

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123167#comment-14123167 ] Kousuke Saruta commented on SPARK-3399: --- Some test for pyspark, for instance rdd.py,

[jira] [Created] (SPARK-3418) Additional BLAS and Local Sparse Matrix support

2014-09-05 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-3418: -- Summary: Additional BLAS and Local Sparse Matrix support Key: SPARK-3418 URL: https://issues.apache.org/jira/browse/SPARK-3418 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-3418) [MLlib] Additional BLAS and Local Sparse Matrix support

2014-09-05 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-3418: --- Summary: [MLlib] Additional BLAS and Local Sparse Matrix support (was: Additional BLAS and Local

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123195#comment-14123195 ] Davies Liu commented on SPARK-3399: --- Thanks for the explain. I still can not reproduce

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with uppercase letters in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Description: Paste the following snippet to {{spark-shell}} (need Hive support) to reproduce this

[jira] [Updated] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with uppercase letters in their names

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-3414: -- Description: Paste the following snippet to {{spark-shell}} (need Hive support) to reproduce this

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123200#comment-14123200 ] Kousuke Saruta commented on SPARK-3399: --- Ah, did you set fs.defaultFs to like

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2014-09-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123203#comment-14123203 ] Nicholas Chammas commented on SPARK-3369: - {quote} The API change is unlikely to

[jira] [Commented] (SPARK-1823) ExternalAppendOnlyMap can still OOM if one key is very large

2014-09-05 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123213#comment-14123213 ] Andrew Ash commented on SPARK-1823: --- // This was not fixed in Spark 1.1 and should be

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123214#comment-14123214 ] Davies Liu commented on SPARK-3399: --- Given fs.defaultFs as hdfs://, saveAsTextFile()

[jira] [Updated] (SPARK-1823) ExternalAppendOnlyMap can still OOM if one key is very large

2014-09-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-1823: - Target Version/s: 1.2.0 Affects Version/s: (was: 1.0.0) 1.1.0

[jira] [Commented] (SPARK-1823) ExternalAppendOnlyMap can still OOM if one key is very large

2014-09-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123235#comment-14123235 ] Andrew Or commented on SPARK-1823: -- Thanks Andrew, I have updated the versions.

[jira] [Commented] (SPARK-3174) Under YARN, add and remove executors based on load

2014-09-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123238#comment-14123238 ] Sandy Ryza commented on SPARK-3174: --- I've been putting a little bit of thought into this

[jira] [Commented] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123237#comment-14123237 ] Kousuke Saruta commented on SPARK-3399: --- Yes I meant like what you mentioned. Test

[jira] [Commented] (SPARK-2892) Socket Receiver does not stop when streaming context is stopped

2014-09-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123253#comment-14123253 ] Tathagata Das commented on SPARK-2892: -- I intended it to be ERROR to catch such

[jira] [Resolved] (SPARK-3399) Test for PySpark should ignore HADOOP_CONF_DIR and YARN_CONF_DIR

2014-09-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3399. --- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2270

[jira] [Commented] (SPARK-2491) When an OOM is thrown,the executor does not stop properly.

2014-09-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123326#comment-14123326 ] Josh Rosen commented on SPARK-2491: --- Ah, I see. It looks like we don't want to display

[jira] [Created] (SPARK-3419) Scheduler shouldn't delay running a task when executors don't reside at any of its preferred locations

2014-09-05 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-3419: - Summary: Scheduler shouldn't delay running a task when executors don't reside at any of its preferred locations Key: SPARK-3419 URL: https://issues.apache.org/jira/browse/SPARK-3419

[jira] [Created] (SPARK-3420) Using Sphinx to generate API docs for PySpark

2014-09-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-3420: - Summary: Using Sphinx to generate API docs for PySpark Key: SPARK-3420 URL: https://issues.apache.org/jira/browse/SPARK-3420 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3160) Simplify DecisionTree data structure for training

2014-09-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3160: - Description: Improvement: code clarity Currently, we maintain a tree structure, a flat

[jira] [Commented] (SPARK-3280) Made sort-based shuffle the default implementation

2014-09-05 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123390#comment-14123390 ] Andrew Ash commented on SPARK-3280: --- [~joshrosen] do you have a theory for the cause of

[jira] [Created] (SPARK-3421) StructField.toString should quote the name field to allow arbitrary character as struct field name

2014-09-05 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-3421: - Summary: StructField.toString should quote the name field to allow arbitrary character as struct field name Key: SPARK-3421 URL: https://issues.apache.org/jira/browse/SPARK-3421

[jira] [Updated] (SPARK-2714) DAGScheduler should log jobid when runJob finishes

2014-09-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-2714: -- Summary: DAGScheduler should log jobid when runJob finishes (was: DAGScheduler logs jobid when runJob

[jira] [Updated] (SPARK-2714) DAGScheduler should log jobid when runJob finishes

2014-09-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-2714: -- Description: When DAGScheduler concurrently runs multiple jobs, SparkContext only logs Job finished and

[jira] [Updated] (SPARK-2491) When an OOM is thrown,the executor does not stop properly.

2014-09-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2491: - Affects Version/s: 1.1.0 When an OOM is thrown,the executor does not stop properly.

[jira] [Commented] (SPARK-2491) When an OOM is thrown,the executor does not stop properly.

2014-09-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123447#comment-14123447 ] Andrew Or commented on SPARK-2491: -- [~witgo] This doesn't seem to be specific to YARN.

[jira] [Updated] (SPARK-3174) Under YARN, add and remove executors based on load

2014-09-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-3174: -- Attachment: SPARK-3174design.pdf Under YARN, add and remove executors based on load

[jira] [Commented] (SPARK-3174) Under YARN, add and remove executors based on load

2014-09-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123504#comment-14123504 ] Sandy Ryza commented on SPARK-3174: --- Posted a high-level design doc. Under YARN, add

[jira] [Commented] (SPARK-3414) Case insensitivity breaks when unresolved relation contains attributes with uppercase letters in their names

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123538#comment-14123538 ] Apache Spark commented on SPARK-3414: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-3421) StructField.toString should quote the name field to allow arbitrary character as struct field name

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123537#comment-14123537 ] Apache Spark commented on SPARK-3421: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-2537) Workaround Timezone specific Hive tests

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123556#comment-14123556 ] Cheng Lian commented on SPARK-2537: --- PR

[jira] [Resolved] (SPARK-2537) Workaround Timezone specific Hive tests

2014-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-2537. --- Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 Workaround Timezone

[jira] [Commented] (SPARK-2099) Report TaskMetrics for running tasks

2014-09-05 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123584#comment-14123584 ] Andrew Ash commented on SPARK-2099: --- I just gave this a runthrough and most of the

[jira] [Commented] (SPARK-2099) Report TaskMetrics for running tasks

2014-09-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123587#comment-14123587 ] Sandy Ryza commented on SPARK-2099: --- Yeah, unfortunately I haven't had the chance to add

[jira] [Commented] (SPARK-3418) [MLlib] Additional BLAS and Local Sparse Matrix support

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123634#comment-14123634 ] Apache Spark commented on SPARK-3418: - User 'brkyvz' has created a pull request for

[jira] [Updated] (SPARK-3211) .take() is OOM-prone when there are empty partitions

2014-09-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-3211: - Target Version/s: 1.1.1, 1.2.0 .take() is OOM-prone when there are empty partitions

[jira] [Commented] (SPARK-3416) Add matrix operations for large data set

2014-09-05 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123695#comment-14123695 ] Yu Ishikawa commented on SPARK-3416: We discuss about this issue on the thread.

[jira] [Resolved] (SPARK-3082) yarn.Client.logClusterResourceDetails throws NPE if requested queue doesn't exist

2014-09-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-3082. --- Resolution: Fixed Fix Version/s: 1.1.0 yarn.Client.logClusterResourceDetails throws NPE if

[jira] [Created] (SPARK-3423) Implement BETWEEN support for regular SQL parser

2014-09-05 Thread William Benton (JIRA)
William Benton created SPARK-3423: - Summary: Implement BETWEEN support for regular SQL parser Key: SPARK-3423 URL: https://issues.apache.org/jira/browse/SPARK-3423 Project: Spark Issue Type:

[jira] [Commented] (SPARK-3215) Add remote interface for SparkContext

2014-09-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123749#comment-14123749 ] Marcelo Vanzin commented on SPARK-3215: --- I updated the prototype to include a Java

[jira] [Commented] (SPARK-3423) Implement BETWEEN support for regular SQL parser

2014-09-05 Thread William Benton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123814#comment-14123814 ] William Benton commented on SPARK-3423: --- (PR is here:

[jira] [Commented] (SPARK-1239) Don't fetch all map output statuses at each reducer during shuffles

2014-09-05 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123883#comment-14123883 ] Kostas Sakellis commented on SPARK-1239: [~pwendell] I'd like to take a crack at

[jira] [Commented] (SPARK-3423) Implement BETWEEN support for regular SQL parser

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124238#comment-14124238 ] Apache Spark commented on SPARK-3423: - User 'willb' has created a pull request for

[jira] [Commented] (SPARK-2334) Attribute Error calling PipelinedRDD.id() in pyspark

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124239#comment-14124239 ] Apache Spark commented on SPARK-2334: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-3211) .take() is OOM-prone when there are empty partitions

2014-09-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-3211: - Assignee: Andrew Ash .take() is OOM-prone when there are empty partitions

[jira] [Resolved] (SPARK-3211) .take() is OOM-prone when there are empty partitions

2014-09-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-3211. -- Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 .take() is OOM-prone

[jira] [Created] (SPARK-3424) KMeans Plus Plus is too slow

2014-09-05 Thread Derrick Burns (JIRA)
Derrick Burns created SPARK-3424: Summary: KMeans Plus Plus is too slow Key: SPARK-3424 URL: https://issues.apache.org/jira/browse/SPARK-3424 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-3411) Improve load-balancing of concurrently-submitted drivers across workers

2014-09-05 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic updated SPARK-3411: --- Summary: Improve load-balancing of concurrently-submitted drivers across workers (was:

[jira] [Updated] (SPARK-3411) Improve load-balancing of concurrently-submitted drivers across workers

2014-09-05 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic updated SPARK-3411: --- Description: If the waiting driver array is too big, the drivers in it will be dispatched to

[jira] [Commented] (SPARK-3361) Expand PEP 8 checks to include EC2 script and Python examples

2014-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124293#comment-14124293 ] Apache Spark commented on SPARK-3361: - User 'nchammas' has created a pull request for