[jira] [Commented] (SPARK-3172) Distinguish between shuffle spill on the map and reduce side

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144411#comment-14144411 ] Apache Spark commented on SPARK-3172: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-3577) Add task metric to report spill time

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144412#comment-14144412 ] Apache Spark commented on SPARK-3577: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-3481) HiveComparisonTest throws exception of org.apache.hadoop.hive.ql.metadata.HiveException: Database does not exist: default

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144421#comment-14144421 ] Apache Spark commented on SPARK-3481: - User 'scwf' has created a pull request for this

[jira] [Assigned] (SPARK-3541) Improve ALS internal storage

2014-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-3541: Assignee: Xiangrui Meng Improve ALS internal storage

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144490#comment-14144490 ] Patrick Wendell commented on SPARK-3633: [~nravi] if you are trying to debug this,

[jira] [Commented] (SPARK-3368) Spark cannot be used with Avro and Parquet

2014-09-23 Thread Chris Gore (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144493#comment-14144493 ] Chris Gore commented on SPARK-3368: --- Thanks for filing this; I'm glad I'm not the only

[jira] [Commented] (SPARK-3582) Spark SQL having issue with existing Hive UDFs which take Map as a parameter

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144510#comment-14144510 ] Apache Spark commented on SPARK-3582: - User 'adrian-wang' has created a pull request

[jira] [Commented] (SPARK-3582) Spark SQL having issue with existing Hive UDFs which take Map as a parameter

2014-09-23 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144508#comment-14144508 ] Adrian Wang commented on SPARK-3582: Hi Saurabh, I have created a PR(

[jira] [Commented] (SPARK-3371) Spark SQL: Renaming a function expression with group by gives error

2014-09-23 Thread Ravindra Pesala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144514#comment-14144514 ] Ravindra Pesala commented on SPARK-3371: It seems like an issue, By default SQl

[jira] [Created] (SPARK-3656) IllegalArgumentException when I using sort-based shuffle

2014-09-23 Thread yangping wu (JIRA)
yangping wu created SPARK-3656: -- Summary: IllegalArgumentException when I using sort-based shuffle Key: SPARK-3656 URL: https://issues.apache.org/jira/browse/SPARK-3656 Project: Spark Issue

[jira] [Updated] (SPARK-3656) IllegalArgumentException when I using sort-based shuffle

2014-09-23 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-3656: --- Description: The code work fine in hash-based shuffle.

[jira] [Updated] (SPARK-3656) IllegalArgumentException when I using sort-based shuffle

2014-09-23 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-3656: --- Description: The code work fine in hash-based shuffle. {code}

[jira] [Commented] (SPARK-3656) IllegalArgumentException when I using sort-based shuffle

2014-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144588#comment-14144588 ] Sean Owen commented on SPARK-3656: -- Duplicate of

[jira] [Commented] (SPARK-3356) Document when RDD elements' ordering within partitions is nondeterministic

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144755#comment-14144755 ] Apache Spark commented on SPARK-3356: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-2668) Add variable of yarn log directory for reference from the log4j configuration

2014-09-23 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-2668. -- Resolution: Fixed Fix Version/s: 1.2.0 Add variable of yarn log directory for reference

[jira] [Commented] (SPARK-3644) REST API for Spark application info (jobs / stages / tasks / storage info)

2014-09-23 Thread Trevor McKay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144943#comment-14144943 ] Trevor McKay commented on SPARK-3644: - Anecdotal notes from a consumer :) I recently

[jira] [Commented] (SPARK-3445) Deprecate and later remove YARN alpha support

2014-09-23 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144946#comment-14144946 ] Thomas Graves commented on SPARK-3445: -- Any other comments or objections to this?

[jira] [Created] (SPARK-3657) yarn alpha YarnRMClientImpl throws NPE appMasterRequest.setTrackingUrl starting spark-shell

2014-09-23 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-3657: Summary: yarn alpha YarnRMClientImpl throws NPE appMasterRequest.setTrackingUrl starting spark-shell Key: SPARK-3657 URL: https://issues.apache.org/jira/browse/SPARK-3657

[jira] [Resolved] (SPARK-3477) Clean up code in Yarn Client / ClientBase

2014-09-23 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3477. -- Resolution: Fixed Fix Version/s: 1.2.0 Clean up code in Yarn Client / ClientBase

[jira] [Resolved] (SPARK-3304) ApplicationMaster's Finish status is wrong when uncaught exception is thrown from ReporterThread

2014-09-23 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3304. -- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Kousuke Saruta

[jira] [Created] (SPARK-3658) Take thrift server as a daemon

2014-09-23 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-3658: -- Summary: Take thrift server as a daemon Key: SPARK-3658 URL: https://issues.apache.org/jira/browse/SPARK-3658 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-3658) Take thrift server as a daemon

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145017#comment-14145017 ] Apache Spark commented on SPARK-3658: - User 'WangTaoTheTonic' has created a pull

[jira] [Commented] (SPARK-2870) Thorough schema inference directly on RDDs of Python dictionaries

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145192#comment-14145192 ] Michael Armbrust commented on SPARK-2870: - Yeah, though we might want to benchmark

[jira] [Resolved] (SPARK-3598) cast to timestamp should be the same as hive

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3598. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2458

[jira] [Updated] (SPARK-3582) Spark SQL having issue with existing Hive UDFs which take Map as a parameter

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3582: Assignee: Adrian Wang Spark SQL having issue with existing Hive UDFs which take Map as a

[jira] [Resolved] (SPARK-3536) SELECT on empty parquet table throws exception

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3536. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2456

[jira] [Created] (SPARK-3659) Set EC2 version to 1.1.0 in master branch

2014-09-23 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-3659: Summary: Set EC2 version to 1.1.0 in master branch Key: SPARK-3659 URL: https://issues.apache.org/jira/browse/SPARK-3659 Project: Spark

[jira] [Commented] (SPARK-3659) Set EC2 version to 1.1.0 in master branch

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145239#comment-14145239 ] Apache Spark commented on SPARK-3659: - User 'shivaram' has created a pull request for

[jira] [Updated] (SPARK-2745) Add Java friendly methods to Duration class

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2745: - Assignee: Sean Owen (was: Sean Owen) Add Java friendly methods to Duration class

[jira] [Updated] (SPARK-2745) Add Java friendly methods to Duration class

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2745: - Assignee: Sean Owen (was: Tathagata Das) Add Java friendly methods to Duration class

[jira] [Resolved] (SPARK-2745) Add Java friendly methods to Duration class

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2745. -- Resolution: Fixed Fix Version/s: 1.2.0 Add Java friendly methods to Duration class

[jira] [Commented] (SPARK-3371) Spark SQL: Renaming a function expression with group by gives error

2014-09-23 Thread Ravindra Pesala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145258#comment-14145258 ] Ravindra Pesala commented on SPARK-3371: https://github.com/apache/spark/pull/2511

[jira] [Commented] (SPARK-3371) Spark SQL: Renaming a function expression with group by gives error

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145262#comment-14145262 ] Apache Spark commented on SPARK-3371: - User 'ravipesala' has created a pull request

[jira] [Updated] (SPARK-3389) Add converter class to make reading Parquet files easy with PySpark

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-3389: - Target Version/s: 1.2.0 Add converter class to make reading Parquet files easy with PySpark

[jira] [Updated] (SPARK-3389) Add converter class to make reading Parquet files easy with PySpark

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-3389: - Assignee: Uri Laserson Add converter class to make reading Parquet files easy with PySpark

[jira] [Resolved] (SPARK-3268) DoubleType should support modulus

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3268. - Resolution: Fixed Fix Version/s: 1.2.0 DoubleType should support modulus

[jira] [Updated] (SPARK-3268) DoubleType should support modulus

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3268: Assignee: Venkata Ramana G DoubleType should support modulus

[jira] [Resolved] (SPARK-3646) Copy SQL options from the spark context

2014-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3646. - Resolution: Fixed Fix Version/s: 1.2.0 Copy SQL options from the spark context

[jira] [Comment Edited] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145324#comment-14145324 ] Matei Zaharia edited comment on SPARK-3129 at 9/23/14 7:53 PM:

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145324#comment-14145324 ] Matei Zaharia commented on SPARK-3129: -- Is that 100 MB/s per node or in total? That

[jira] [Created] (SPARK-3660) Initial RDD for updateStateByKey transformation

2014-09-23 Thread Soumitra Kumar (JIRA)
Soumitra Kumar created SPARK-3660: - Summary: Initial RDD for updateStateByKey transformation Key: SPARK-3660 URL: https://issues.apache.org/jira/browse/SPARK-3660 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-3647) Shaded Guava patch causes access issues with package private classes

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3647. Resolution: Fixed Fixed by Marcelo in this patch:

[jira] [Resolved] (SPARK-3612) Executor shouldn't quit if heartbeat message fails to reach the driver

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3612. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Assignee:

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-23 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145411#comment-14145411 ] Hari Shreedharan commented on SPARK-3129: - It is per node, single threaded.

[jira] [Closed] (SPARK-3653) SPARK_{DRIVER|EXECUTOR}_MEMORY is ignored in cluster mode

2014-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3653. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 SPARK_{DRIVER|EXECUTOR}_MEMORY is

[jira] [Created] (SPARK-3661) spark.driver.memory is ignored in cluster mode

2014-09-23 Thread Andrew Or (JIRA)
Andrew Or created SPARK-3661: Summary: spark.driver.memory is ignored in cluster mode Key: SPARK-3661 URL: https://issues.apache.org/jira/browse/SPARK-3661 Project: Spark Issue Type: Bug

[jira] [Closed] (SPARK-1853) Show Streaming application code context (file, line number) in Spark Stages UI

2014-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-1853. Resolution: Fixed Fix Version/s: 1.1.1 Target Version/s: 1.1.1, 1.2.0 (was: 1.2.0) Show

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145537#comment-14145537 ] Matei Zaharia commented on SPARK-3129: -- Alright, in that case, this sounds pretty

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-23 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145541#comment-14145541 ] Hari Shreedharan commented on SPARK-3129: - Sure. Thanks Matei! Prevent data loss

[jira] [Updated] (SPARK-3573) Dataset

2014-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3573: - Description: This JIRA is for discussion of ML dataset, essentially a SchemaRDD with

[jira] [Updated] (SPARK-3573) Dataset

2014-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3573: - Description: This JIRA is for discussion of ML dataset, essentially a SchemaRDD with

[jira] [Created] (SPARK-3662) Importing pandas breaks included pi.py example

2014-09-23 Thread Evan Samanas (JIRA)
Evan Samanas created SPARK-3662: --- Summary: Importing pandas breaks included pi.py example Key: SPARK-3662 URL: https://issues.apache.org/jira/browse/SPARK-3662 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-3663) Document SPARK_LOG_DIR and SPARK_PID_DIR

2014-09-23 Thread Andrew Ash (JIRA)
Andrew Ash created SPARK-3663: - Summary: Document SPARK_LOG_DIR and SPARK_PID_DIR Key: SPARK-3663 URL: https://issues.apache.org/jira/browse/SPARK-3663 Project: Spark Issue Type: Documentation

[jira] [Created] (SPARK-3664) Graduate GraphX from alpha to stable

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3664: - Summary: Graduate GraphX from alpha to stable Key: SPARK-3664 URL: https://issues.apache.org/jira/browse/SPARK-3664 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-3665) Java API for GraphX

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3665: -- Component/s: GraphX Java API for GraphX --- Key: SPARK-3665

[jira] [Created] (SPARK-3665) Java API for GraphX

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3665: - Summary: Java API for GraphX Key: SPARK-3665 URL: https://issues.apache.org/jira/browse/SPARK-3665 Project: Spark Issue Type: Improvement Reporter:

[jira] [Created] (SPARK-3666) Extract interfaces for EdgeRDD and VertexRDD

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3666: - Summary: Extract interfaces for EdgeRDD and VertexRDD Key: SPARK-3666 URL: https://issues.apache.org/jira/browse/SPARK-3666 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-3667) Deprecate Graph#unpersistVertices and document how to correctly unpersist graphs

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3667: - Summary: Deprecate Graph#unpersistVertices and document how to correctly unpersist graphs Key: SPARK-3667 URL: https://issues.apache.org/jira/browse/SPARK-3667 Project:

[jira] [Updated] (SPARK-3665) Java API for GraphX

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3665: -- Description: The Java API will wrap the Scala API in a similar manner as JavaRDD. Components will

[jira] [Updated] (SPARK-3664) Graduate GraphX from alpha to stable

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3664: -- Description: The GraphX API is officially marked as alpha but has been moving toward stability. This

[jira] [Updated] (SPARK-3665) Java API for GraphX

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3665: -- Description: The Java API will wrap the Scala API in a similar manner as JavaRDD. Components will

[jira] [Resolved] (SPARK-2142) Give better indicator of how GC cuts into task time

2014-09-23 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-2142. --- Resolution: Not a Problem I ran some tests that indicated that only stop-the-world GC time gets

[jira] [Commented] (SPARK-3610) History server log name should not be based on user input

2014-09-23 Thread SK (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145663#comment-14145663 ] SK commented on SPARK-3610: --- I made the following change to EventLoggingListener.scala, L57 and

[jira] [Created] (SPARK-3668) Support for arbitrary key types in IndexedRDD

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3668: - Summary: Support for arbitrary key types in IndexedRDD Key: SPARK-3668 URL: https://issues.apache.org/jira/browse/SPARK-3668 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3668) Support for arbitrary key types in IndexedRDD

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3668: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-2365 Support for arbitrary key types in

[jira] [Created] (SPARK-3669) Extract IndexedRDD interface

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3669: - Summary: Extract IndexedRDD interface Key: SPARK-3669 URL: https://issues.apache.org/jira/browse/SPARK-3669 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-3670) Add log-structured updates with merge

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3670: - Summary: Add log-structured updates with merge Key: SPARK-3670 URL: https://issues.apache.org/jira/browse/SPARK-3670 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145676#comment-14145676 ] Nishkam Ravi commented on SPARK-3633: - [~pwendell] I see a large number of open files

[jira] [Created] (SPARK-3671) Batch multiput updates within partitions

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3671: - Summary: Batch multiput updates within partitions Key: SPARK-3671 URL: https://issues.apache.org/jira/browse/SPARK-3671 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-3672) Add a non-updatable implementation for read performance

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3672: - Summary: Add a non-updatable implementation for read performance Key: SPARK-3672 URL: https://issues.apache.org/jira/browse/SPARK-3672 Project: Spark Issue Type:

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145686#comment-14145686 ] Andrew Ash commented on SPARK-3633: --- Hi [~nravi] you can also check the limits on an

[jira] [Created] (SPARK-3673) Move IndexedRDD from a pull request into a separate repository

2014-09-23 Thread Ankur Dave (JIRA)
Ankur Dave created SPARK-3673: - Summary: Move IndexedRDD from a pull request into a separate repository Key: SPARK-3673 URL: https://issues.apache.org/jira/browse/SPARK-3673 Project: Spark

[jira] [Commented] (SPARK-2365) Add IndexedRDD, an efficient updatable key-value store

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145698#comment-14145698 ] Ankur Dave commented on SPARK-2365: --- [~imranr] Thanks for the comments and

[jira] [Updated] (SPARK-3665) Java API for GraphX

2014-09-23 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3665: -- Component/s: Java API Java API for GraphX --- Key: SPARK-3665

[jira] [Commented] (SPARK-3645) Make caching using SQL commands eager by default, with the option of being lazy

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145709#comment-14145709 ] Apache Spark commented on SPARK-3645: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145713#comment-14145713 ] Nishkam Ravi commented on SPARK-3633: - Looks ok (still waiting on the run to

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145749#comment-14145749 ] Nishkam Ravi commented on SPARK-3633: - Unsurprisingly, the run goes through. It would

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145754#comment-14145754 ] Andrew Ash commented on SPARK-3633: --- You're getting exceptions but they're not fatal to

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145763#comment-14145763 ] Nishkam Ravi commented on SPARK-3633: - With higher values of ulimit and timeout

[jira] [Commented] (SPARK-3468) WebUI Timeline-View feature

2014-09-23 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145829#comment-14145829 ] Sandy Ryza commented on SPARK-3468: --- This looks like a really cool addition. WebUI

[jira] [Created] (SPARK-3674) Add support for launching YARN clusters in spark-ec2

2014-09-23 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-3674: Summary: Add support for launching YARN clusters in spark-ec2 Key: SPARK-3674 URL: https://issues.apache.org/jira/browse/SPARK-3674 Project: Spark

[jira] [Commented] (SPARK-3032) Potential bug when running sort-based shuffle with sorting using TimSort

2014-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145887#comment-14145887 ] Apache Spark commented on SPARK-3032: - User 'jerryshao' has created a pull request for

[jira] [Updated] (SPARK-3032) Potential bug when running sort-based shuffle with sorting using TimSort

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3032: --- Priority: Blocker (was: Critical) Potential bug when running sort-based shuffle with

[jira] [Commented] (SPARK-3642) Better document the nuances of shared variables

2014-09-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145918#comment-14145918 ] Josh Rosen commented on SPARK-3642: --- I've linked this JIRA to a couple of related