[jira] [Updated] (SPARK-2664) Deal with `--conf` options in spark-submit that relate to flags

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2664: --- Description: If someone sets a spark conf that relates to an existing flag `--master`, we sh

[jira] [Created] (SPARK-2664) Deal with `--conf` options in spark-submit that relate to flags

2014-07-23 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-2664: -- Summary: Deal with `--conf` options in spark-submit that relate to flags Key: SPARK-2664 URL: https://issues.apache.org/jira/browse/SPARK-2664 Project: Spark

[jira] [Updated] (SPARK-2310) Support arbitrary options on the command line with spark-submit

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2310: --- Assignee: Sandy Ryza > Support arbitrary options on the command line with spark-submit >

[jira] [Resolved] (SPARK-2310) Support arbitrary options on the command line with spark-submit

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2310. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1253 [https://

[jira] [Commented] (SPARK-2456) Scheduler refactoring

2014-07-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072860#comment-14072860 ] Reynold Xin commented on SPARK-2456: One related PR: https://github.com/apache/spark/p

[jira] [Updated] (SPARK-2456) Scheduler refactoring

2014-07-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2456: --- Description: This is an umbrella ticket to track scheduler refactoring. We want to clearly define se

[jira] [Updated] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2661: - Fix Version/s: 1.1.0 > Unpersist last RDD in bagel iteration > --

[jira] [Resolved] (SPARK-2658) HiveQL: 1 = true should evaluate to true

2014-07-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2658. Resolution: Fixed Fix Version/s: 1.1.0 1.0.1 > HiveQL: 1 = true should ev

[jira] [Updated] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2661: - Assignee: Adrian Wang > Unpersist last RDD in bagel iteration > -

[jira] [Updated] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2661: - Affects Version/s: (was: 1.0.1) > Unpersist last RDD in bagel iteration > ---

[jira] [Updated] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2661: - Affects Version/s: (was: 1.0.0) > Unpersist last RDD in bagel iteration > ---

[jira] [Resolved] (SPARK-2662) Fix NPE for JsonProtocol

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2662. -- Resolution: Fixed Fix Version/s: 1.1.0 > Fix NPE for JsonProtocol >

[jira] [Updated] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2661: - Affects Version/s: 1.0.0 > Unpersist last RDD in bagel iteration > --

[jira] [Assigned] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout reassigned SPARK-1726: - Assignee: Kay Ousterhout (was: Andrew Or) > Tasks that fail to serialize remain in activ

[jira] [Commented] (SPARK-2663) Support the GroupingSet/ROLLUP/CUBE

2014-07-23 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072841#comment-14072841 ] Cheng Hao commented on SPARK-2663: -- https://github.com/apache/spark/pull/1567 > Support

[jira] [Issue Comment Deleted] (SPARK-2663) Support the GroupingSet/ROLLUP/CUBE

2014-07-23 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-2663: - Comment: was deleted (was: https://github.com/apache/spark/pull/1567) > Support the GroupingSet/ROLLUP/C

[jira] [Commented] (SPARK-2663) Support the GroupingSet/ROLLUP/CUBE

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072832#comment-14072832 ] Apache Spark commented on SPARK-2663: - User 'chenghao-intel' has created a pull reques

[jira] [Created] (SPARK-2663) Support the GroupingSet/ROLLUP/CUBE

2014-07-23 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2663: Summary: Support the GroupingSet/ROLLUP/CUBE Key: SPARK-2663 URL: https://issues.apache.org/jira/browse/SPARK-2663 Project: Spark Issue Type: New Feature C

[jira] [Commented] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072808#comment-14072808 ] Apache Spark commented on SPARK-1726: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072809#comment-14072809 ] Apache Spark commented on SPARK-2567: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-23 Thread Chengxiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072807#comment-14072807 ] Chengxiang Li commented on SPARK-2633: -- Thanks [~vanzin], that's a very good point. S

[jira] [Commented] (SPARK-2484) Build should not run hive compatibility tests by default.

2014-07-23 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072804#comment-14072804 ] Guoqiang Li commented on SPARK-2484: spark-examples is dependent on spark-hive therefo

[jira] [Comment Edited] (SPARK-2484) Build should not run hive compatibility tests by default.

2014-07-23 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072804#comment-14072804 ] Guoqiang Li edited comment on SPARK-2484 at 7/24/14 5:03 AM: -

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-23 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072778#comment-14072778 ] Xuefu Zhang commented on SPARK-2420: Is shading guava in Spark build a reasonable comp

[jira] [Commented] (SPARK-1981) Add AWS Kinesis streaming support

2014-07-23 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072761#comment-14072761 ] Chris Fregly commented on SPARK-1981: - in addition to the ec2 scripts, can someone ver

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-23 Thread Tobias Pfeiffer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072759#comment-14072759 ] Tobias Pfeiffer commented on SPARK-2492: Hi, thanks for updating the pull request

[jira] [Comment Edited] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-23 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072749#comment-14072749 ] Saisai Shao edited comment on SPARK-2492 at 7/24/14 3:27 AM: -

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-23 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072749#comment-14072749 ] Saisai Shao commented on SPARK-2492: Hi Tobias, I've updated the code, mind taking a

[jira] [Commented] (SPARK-2568) RangePartitioner should go through the data only once

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072729#comment-14072729 ] Apache Spark commented on SPARK-2568: - User 'mengxr' has created a pull request for th

[jira] [Created] (SPARK-2662) Fix NPE for JsonProtocol

2014-07-23 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-2662: -- Summary: Fix NPE for JsonProtocol Key: SPARK-2662 URL: https://issues.apache.org/jira/browse/SPARK-2662 Project: Spark Issue Type: Bug Components: Spar

[jira] [Commented] (SPARK-2662) Fix NPE for JsonProtocol

2014-07-23 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072689#comment-14072689 ] Guoqiang Li commented on SPARK-2662: PR: https://github.com/apache/spark/pull/1511 >

[jira] [Created] (SPARK-2661) Unpersist last RDD in bagel iteration

2014-07-23 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-2661: -- Summary: Unpersist last RDD in bagel iteration Key: SPARK-2661 URL: https://issues.apache.org/jira/browse/SPARK-2661 Project: Spark Issue Type: Improvement A

[jira] [Commented] (SPARK-2660) Enable pretty-printing SchemaRDD Rows

2014-07-23 Thread Larry Xiao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072687#comment-14072687 ] Larry Xiao commented on SPARK-2660: --- I think this one is suitable for newbie like me, th

[jira] [Updated] (SPARK-2648) Randomize order of executors when fetching shuffle blocks

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2648: --- Summary: Randomize order of executors when fetching shuffle blocks (was: through shuffling b

[jira] [Commented] (SPARK-2010) Support for nested data in PySpark SQL

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072681#comment-14072681 ] Apache Spark commented on SPARK-2010: - User 'davies' has created a pull request for th

[jira] [Created] (SPARK-2660) Enable pretty-printing SchemaRDD Rows

2014-07-23 Thread Aaron Davidson (JIRA)
Aaron Davidson created SPARK-2660: - Summary: Enable pretty-printing SchemaRDD Rows Key: SPARK-2660 URL: https://issues.apache.org/jira/browse/SPARK-2660 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-2484) Build should not run hive compatibility tests by default.

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072668#comment-14072668 ] Patrick Wendell commented on SPARK-2484: A workaround has been merged here: https:

[jira] [Commented] (SPARK-2458) Make failed application log visible on History Server

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072665#comment-14072665 ] Apache Spark commented on SPARK-2458: - User 'tsudukim' has created a pull request for

[jira] [Updated] (SPARK-2316) StorageStatusListener should avoid O(blocks) operations

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2316: --- Priority: Critical (was: Major) > StorageStatusListener should avoid O(blocks) operations >

[jira] [Updated] (SPARK-2316) StorageStatusListener should avoid O(blocks) operations

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2316: --- Target Version/s: 1.1.0 (was: 1.0.2) > StorageStatusListener should avoid O(blocks) operatio

[jira] [Commented] (SPARK-2659) HiveQL: Division operator should always perform fractional division

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072655#comment-14072655 ] Apache Spark commented on SPARK-2659: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-2659) HiveQL: Division operator should always perform fractional division

2014-07-23 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2659: --- Summary: HiveQL: Division operator should always perform fractional division Key: SPARK-2659 URL: https://issues.apache.org/jira/browse/SPARK-2659 Project: Spar

[jira] [Commented] (SPARK-2658) HiveQL: 1 = true should evaluate to true

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072629#comment-14072629 ] Apache Spark commented on SPARK-2658: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-2658) HiveQL: 1 = true should evaluate to true

2014-07-23 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2658: --- Summary: HiveQL: 1 = true should evaluate to true Key: SPARK-2658 URL: https://issues.apache.org/jira/browse/SPARK-2658 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-2574) Avoid allocating new ArrayBuffer in groupByKey's mergeCombiner

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2574: - Priority: Trivial (was: Major) > Avoid allocating new ArrayBuffer in groupByKey's mergeCombiner

[jira] [Resolved] (SPARK-2549) Functions defined inside of other functions trigger failures

2014-07-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2549. Resolution: Fixed Issue resolved by pull request 1510 [https://github.com/apache/spark/pull

[jira] [Commented] (SPARK-2574) Avoid allocating new ArrayBuffer in groupByKey's mergeCombiner

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072605#comment-14072605 ] Matei Zaharia commented on SPARK-2574: -- I implemented this as part of https://github.

[jira] [Commented] (SPARK-2657) Use more compact data structures than ArrayBuffer in groupBy and cogroup

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072608#comment-14072608 ] Apache Spark commented on SPARK-2657: - User 'mateiz' has created a pull request for th

[jira] [Assigned] (SPARK-2574) Avoid allocating new ArrayBuffer in groupByKey's mergeCombiner

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-2574: Assignee: Matei Zaharia > Avoid allocating new ArrayBuffer in groupByKey's mergeCombiner >

[jira] [Created] (SPARK-2657) Use more compact data structures than ArrayBuffer in groupBy and cogroup

2014-07-23 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2657: Summary: Use more compact data structures than ArrayBuffer in groupBy and cogroup Key: SPARK-2657 URL: https://issues.apache.org/jira/browse/SPARK-2657 Project: Spark

[jira] [Commented] (SPARK-2656) Python version without support for exact sample size

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072585#comment-14072585 ] Apache Spark commented on SPARK-2656: - User 'dorx' has created a pull request for this

[jira] [Commented] (SPARK-2633) support register spark listener to listener bus with Java API

2014-07-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072579#comment-14072579 ] Marcelo Vanzin commented on SPARK-2633: --- So, being able to register listeners is pro

[jira] [Created] (SPARK-2656) Python version without support for exact sample size

2014-07-23 Thread Doris Xin (JIRA)
Doris Xin created SPARK-2656: Summary: Python version without support for exact sample size Key: SPARK-2656 URL: https://issues.apache.org/jira/browse/SPARK-2656 Project: Spark Issue Type: Sub-ta

[jira] [Resolved] (SPARK-2102) Caching with GENERIC column type causes query execution to slow down significantly

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2102. - Resolution: Fixed Fix Version/s: 1.1.0 > Caching with GENERIC column type causes q

[jira] [Resolved] (SPARK-2569) Customized UDFs in hive not running with Spark SQL

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2569. - Resolution: Fixed Fix Version/s: 1.1.0 > Customized UDFs in hive not running with

[jira] [Resolved] (SPARK-2226) HAVING should be able to contain aggregate expressions that don't appear in the aggregation list.

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2226. - Resolution: Fixed Fix Version/s: 1.1.0 > HAVING should be able to contain aggregat

[jira] [Reopened] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout reopened SPARK-1726: --- > Tasks that fail to serialize remain in active stages forever. > ---

[jira] [Updated] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-1726: -- Affects Version/s: 1.0.1 Fix Version/s: (was: 1.1.0) [~marmbrus] Sorry I spoke too

[jira] [Resolved] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-1726. - Resolution: Fixed Fix Version/s: 1.1.0 [~kayousterhout] reports this is fixed in m

[jira] [Resolved] (SPARK-2588) Add some more DSLs.

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2588. - Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Takuya Ueshin > Add some

[jira] [Updated] (SPARK-2648) through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time

2014-07-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2648: --- Target Version/s: 1.1.0 Assignee: Lianhui Wang > through shuffling blocksByAddress avoid

[jira] [Updated] (SPARK-2648) through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time

2014-07-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2648: --- Priority: Critical (was: Major) > through shuffling blocksByAddress avoid much reducers to fetch dat

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072334#comment-14072334 ] Sean Owen commented on SPARK-2420: -- Marcelo pointed out offline that Guava 17 is *not* ne

[jira] [Created] (SPARK-2655) Change the default logging level to WARN

2014-07-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2655: - Summary: Change the default logging level to WARN Key: SPARK-2655 URL: https://issues.apache.org/jira/browse/SPARK-2655 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-2654) Leveled logging in PySpark

2014-07-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2654: - Summary: Leveled logging in PySpark Key: SPARK-2654 URL: https://issues.apache.org/jira/browse/SPARK-2654 Project: Spark Issue Type: Improvement Report

[jira] [Created] (SPARK-2653) Heap size should be the sum of driver.memory and executor.memory in local mode

2014-07-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2653: - Summary: Heap size should be the sum of driver.memory and executor.memory in local mode Key: SPARK-2653 URL: https://issues.apache.org/jira/browse/SPARK-2653 Project: Spark

[jira] [Created] (SPARK-2652) Turning default configurations for PySpark

2014-07-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2652: - Summary: Turning default configurations for PySpark Key: SPARK-2652 URL: https://issues.apache.org/jira/browse/SPARK-2652 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072259#comment-14072259 ] Marcelo Vanzin commented on SPARK-2420: --- Hi Sean, I agree in part about the brokenn

[jira] [Commented] (SPARK-2569) Customized UDFs in hive not running with Spark SQL

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072258#comment-14072258 ] Apache Spark commented on SPARK-2569: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-1630) PythonRDDs don't handle nulls gracefully

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072235#comment-14072235 ] Apache Spark commented on SPARK-1630: - User 'davies' has created a pull request for th

[jira] [Updated] (SPARK-2649) EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge

2014-07-23 Thread npanj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] npanj updated SPARK-2649: - Description: On EC2 httpd daemon doesn't start (so ganglia is not accessble) on Hvm machines like r3.4xlarge( d

[jira] [Updated] (SPARK-2649) EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge

2014-07-23 Thread npanj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] npanj updated SPARK-2649: - Description: On EC2 httpd daemon doesn't start (so ganglia is not accessble) on Hvm machines like r3.4xlarge( d

[jira] [Updated] (SPARK-2649) EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge

2014-07-23 Thread npanj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] npanj updated SPARK-2649: - Description: On EC2 httpd daemon doesn't start (so ganglia is not accessble) on Hvm machines like r3.4xlarge( d

[jira] [Updated] (SPARK-1362) Web UI should provide page of showing statistics and stage list for a given job

2014-07-23 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Masayoshi TSUZUKI updated SPARK-1362: - Component/s: Web UI > Web UI should provide page of showing statistics and stage list for

[jira] [Commented] (SPARK-2642) Add jobId in web UI

2014-07-23 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072171#comment-14072171 ] Masayoshi TSUZUKI commented on SPARK-2642: -- Is this the same as [SPARK-1362] ? >

[jira] [Commented] (SPARK-2651) Add maven scalastyle plugin

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072164#comment-14072164 ] Apache Spark commented on SPARK-2651: - User 'rahulsinghaliitd' has created a pull requ

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-23 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072161#comment-14072161 ] Masayoshi TSUZUKI commented on SPARK-2567: -- I noticed now but the cause of [SPARK

[jira] [Commented] (SPARK-2651) Add maven scalastyle plugin

2014-07-23 Thread Rahul Singhal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072156#comment-14072156 ] Rahul Singhal commented on SPARK-2651: -- PR: https://github.com/apache/spark/pull/1550

[jira] [Commented] (SPARK-2575) SVMWithSGD throwing Input Validation failed

2014-07-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072140#comment-14072140 ] Xiangrui Meng commented on SPARK-2575: -- loadLibSVMFile converts labels to binary by d

[jira] [Created] (SPARK-2651) Add maven scalastyle plugin

2014-07-23 Thread Rahul Singhal (JIRA)
Rahul Singhal created SPARK-2651: Summary: Add maven scalastyle plugin Key: SPARK-2651 URL: https://issues.apache.org/jira/browse/SPARK-2651 Project: Spark Issue Type: Improvement C

[jira] [Updated] (SPARK-2569) Customized UDFs in hive not running with Spark SQL

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2569: Priority: Critical (was: Major) Target Version/s: 1.1.0 > Customized UDFs in h

[jira] [Assigned] (SPARK-2569) Customized UDFs in hive not running with Spark SQL

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-2569: --- Assignee: Michael Armbrust > Customized UDFs in hive not running with Spark SQL > ---

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072076#comment-14072076 ] Yin Huai commented on SPARK-2576: - [~prashant_] I have also created a [REPL test|https://

[jira] [Updated] (SPARK-2650) Wrong initial sizes for in-memory column buffers

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2650: Target Version/s: 1.1.0 > Wrong initial sizes for in-memory column buffers > --

[jira] [Updated] (SPARK-1630) PythonRDDs don't handle nulls gracefully

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1630: Assignee: Davies Liu > PythonRDDs don't handle nulls gracefully > -

[jira] [Resolved] (SPARK-2561) Repartitioning a SchemaRDD breaks resolution

2014-07-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2561. - Resolution: Fixed Fix Version/s: 1.0.2 1.1.0 > Repartitioning a

[jira] [Created] (SPARK-2650) Wrong initial sizes for in-memory column buffers

2014-07-23 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2650: --- Summary: Wrong initial sizes for in-memory column buffers Key: SPARK-2650 URL: https://issues.apache.org/jira/browse/SPARK-2650 Project: Spark Issue Ty

[jira] [Resolved] (SPARK-2277) Make TaskScheduler track whether there's host on a rack

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2277. -- Resolution: Fixed > Make TaskScheduler track whether there's host on a rack > -

[jira] [Updated] (SPARK-2277) Make TaskScheduler track whether there's host on a rack

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2277: - Fix Version/s: 1.1.0 > Make TaskScheduler track whether there's host on a rack >

[jira] [Updated] (SPARK-2277) Make TaskScheduler track whether there's host on a rack

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2277: - Assignee: Rui Li > Make TaskScheduler track whether there's host on a rack >

[jira] [Updated] (SPARK-2640) In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2640: - Assignee: woshilaiceshide > In "local[N]", free cores of the only executor should be touched by

[jira] [Updated] (SPARK-2640) In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2640: - Priority: Minor (was: Major) > In "local[N]", free cores of the only executor should be touched

[jira] [Resolved] (SPARK-2640) In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2640. -- Resolution: Fixed Fix Version/s: 1.1.0 > In "local[N]", free cores of the only executor

[jira] [Updated] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2632: Priority: Major (was: Blocker) > Importing a method of class in Spark REPL causes the REPL to pulls in >

[jira] [Commented] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072044#comment-14072044 ] Yin Huai commented on SPARK-2632: - Seems the exception triggered by importing a method of

[jira] [Updated] (SPARK-2609) Log thread ID when spilling ExternalAppendOnlyMap

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2609: - Assignee: Andrew Or > Log thread ID when spilling ExternalAppendOnlyMap > ---

[jira] [Resolved] (SPARK-2609) Log thread ID when spilling ExternalAppendOnlyMap

2014-07-23 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2609. -- Resolution: Fixed > Log thread ID when spilling ExternalAppendOnlyMap > ---

[jira] [Created] (SPARK-2649) EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge

2014-07-23 Thread npanj (JIRA)
npanj created SPARK-2649: Summary: EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge Key: SPARK-2649 URL: https://issues.apache.org/jira/browse/SPARK-2649 Project: Spark Issue Type: Bu

[jira] [Commented] (SPARK-2648) through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071996#comment-14071996 ] Apache Spark commented on SPARK-2648: - User 'lianhuiwang' has created a pull request f

[jira] [Created] (SPARK-2648) through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time

2014-07-23 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-2648: --- Summary: through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time Key: SPARK-2648 URL: https://issues.apache.org/jira/browse/SPARK-2648

  1   2   >