[jira] [Resolved] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2576. - Resolution: Fixed Fix Version/s: 1.1.0 slave node throws NoClassDefFoundError

[jira] [Updated] (SPARK-2738) Remove redundant imports in BlockManagerSuite

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2738: --- Assignee: Sandy Ryza Remove redundant imports in BlockManagerSuite

[jira] [Resolved] (SPARK-2738) Remove redundant imports in BlockManagerSuite

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2738. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1642

[jira] [Created] (SPARK-2786) Python correlations

2014-08-01 Thread Doris Xin (JIRA)
Doris Xin created SPARK-2786: Summary: Python correlations Key: SPARK-2786 URL: https://issues.apache.org/jira/browse/SPARK-2786 Project: Spark Issue Type: Sub-task Reporter: Doris

[jira] [Resolved] (SPARK-2648) Randomize order of executors when fetching shuffle blocks

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2648. Resolution: Not a Problem It turns out we already randomized these, just in a different

[jira] [Commented] (SPARK-2786) Python correlations

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081978#comment-14081978 ] Apache Spark commented on SPARK-2786: - User 'dorx' has created a pull request for this

[jira] [Resolved] (SPARK-2670) FetchFailedException should be thrown when local fetch has failed

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2670. -- Resolution: Fixed Fix Version/s: 1.1.0 FetchFailedException should be thrown when

[jira] [Updated] (SPARK-2670) FetchFailedException should be thrown when local fetch has failed

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2670: - Assignee: Kousuke Saruta FetchFailedException should be thrown when local fetch has failed

[jira] [Updated] (SPARK-2670) FetchFailedException should be thrown when local fetch has failed

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2670: - Priority: Major (was: Critical) FetchFailedException should be thrown when local fetch has

[jira] [Resolved] (SPARK-983) Support external sorting for RDD#sortByKey()

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-983. - Resolution: Fixed Fix Version/s: 1.1.0 Support external sorting for RDD#sortByKey()

[jira] [Created] (SPARK-2787) Make sort-based shuffle write files directly when there is no sorting / aggregation and # of partitions is small

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2787: Summary: Make sort-based shuffle write files directly when there is no sorting / aggregation and # of partitions is small Key: SPARK-2787 URL:

[jira] [Resolved] (SPARK-2134) Report metrics before application finishes

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2134. -- Resolution: Fixed Fix Version/s: 1.1.0 Report metrics before application finishes

[jira] [Resolved] (SPARK-2557) createTaskScheduler should be consistent between local and local-n-failures

2014-08-01 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson resolved SPARK-2557. --- Resolution: Fixed Fix Version/s: 1.1.0 createTaskScheduler should be consistent

[jira] [Updated] (SPARK-2750) Add Https support for Web UI

2014-08-01 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic updated SPARK-2750: --- Description: Now I try to add https support for web ui using Jetty ssl integration.Below is

[jira] [Commented] (SPARK-2780) Create a StreamingContext.setLocalProperty for setting local property of jobs launched by streaming

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082118#comment-14082118 ] Tathagata Das commented on SPARK-2780: -- Yeah, this isnt very intuitive. Two possible

[jira] [Commented] (SPARK-2750) Add Https support for Web UI

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082119#comment-14082119 ] Apache Spark commented on SPARK-2750: - User 'WangTaoTheTonic' has created a pull

[jira] [Commented] (SPARK-2678) `Spark-submit` overrides user application options

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082123#comment-14082123 ] Apache Spark commented on SPARK-2678: - User 'liancheng' has created a pull request for

[jira] [Updated] (SPARK-2103) Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2103: - Fix Version/s: 1.1.0 Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

[jira] [Updated] (SPARK-2103) Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2103: - Assignee: Saisai Shao Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

[jira] [Resolved] (SPARK-2103) Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-2103. -- Resolution: Fixed Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

[jira] [Updated] (SPARK-2768) Add product, user recommend method to MatrixFactorizationModel

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2768: - Assignee: Sean Owen Add product, user recommend method to MatrixFactorizationModel

[jira] [Resolved] (SPARK-2768) Add product, user recommend method to MatrixFactorizationModel

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2768. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1687

[jira] [Updated] (SPARK-2768) Add product, user recommend method to MatrixFactorizationModel

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2768: - Target Version/s: 1.1.0 Add product, user recommend method to MatrixFactorizationModel

[jira] [Resolved] (SPARK-1997) Update breeze to version 0.8.1

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1997. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 940

[jira] [Updated] (SPARK-1486) Support multi-model training in MLlib

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1486: - Target Version/s: 1.2.0 (was: 1.1.0) Support multi-model training in MLlib

[jira] [Updated] (SPARK-1856) Standardize MLlib interfaces

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1856: - Target Version/s: 1.2.0 (was: 1.1.0) Standardize MLlib interfaces

[jira] [Created] (SPARK-2788) Add location filtering to Twitter streams

2014-08-01 Thread Shawn Brunsting (JIRA)
Shawn Brunsting created SPARK-2788: -- Summary: Add location filtering to Twitter streams Key: SPARK-2788 URL: https://issues.apache.org/jira/browse/SPARK-2788 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-2788) Add location filtering to Twitter streams

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082346#comment-14082346 ] Apache Spark commented on SPARK-2788: - User 'sjbrunst' has created a pull request for

[jira] [Updated] (SPARK-2510) word2vec: Distributed Representation of Words

2014-08-01 Thread Liquan Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liquan Pei updated SPARK-2510: -- Description: We would like to add parallel implementation of word2vec to MLlib. word2vec finds

[jira] [Created] (SPARK-2789) Apply names to RDD to becoming SchemaRDD

2014-08-01 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2789: - Summary: Apply names to RDD to becoming SchemaRDD Key: SPARK-2789 URL: https://issues.apache.org/jira/browse/SPARK-2789 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-2099) Report TaskMetrics for running tasks

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2099. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1056

[jira] [Resolved] (SPARK-2212) Hash Outer Joins

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2212. - Resolution: Fixed Fix Version/s: 1.1.0 Hash Outer Joins

[jira] [Updated] (SPARK-2718) YARN does not handle spark configs with quotes or backslashes

2014-08-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-2718: -- Ran into this while working on some other stuff, so I'll work on a fix. YARN does not handle

[jira] [Resolved] (SPARK-2729) Forgot to match Timestamp type in ColumnBuilder

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2729. - Resolution: Fixed Fix Version/s: 1.1.0 Forgot to match Timestamp type in

[jira] [Resolved] (SPARK-2767) SparkSQL CLI doens't output error message if query failed.

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2767. - Resolution: Fixed Fix Version/s: 1.1.0 SparkSQL CLI doens't output error message

[jira] [Resolved] (SPARK-2735) Remove deprecation in jekyll for pygment in _config.yml

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2735. Resolution: Won't Fix Since this will break compatibility for Jekyll 1.x, I'm proposing we

[jira] [Resolved] (SPARK-695) Exponential recursion in getPreferredLocations

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-695. - Resolution: Fixed Fix Version/s: 1.1.0 Exponential recursion in getPreferredLocations

[jira] [Resolved] (SPARK-2490) StackOverflowError when RDD dependencies are too long

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2490. -- Resolution: Fixed Fix Version/s: 1.1.0 StackOverflowError when RDD dependencies are

[jira] [Updated] (SPARK-2420) Dependency changes for compatibility with Hive

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2420: --- Summary: Dependency changes for compatibility with Hive (was: Change Spark build to

[jira] [Created] (SPARK-2790) PySpark zip() doesn't work properly if RDDs have different serializers

2014-08-01 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-2790: - Summary: PySpark zip() doesn't work properly if RDDs have different serializers Key: SPARK-2790 URL: https://issues.apache.org/jira/browse/SPARK-2790 Project: Spark

[jira] [Updated] (SPARK-2789) Apply names to RDD to becoming SchemaRDD

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2789: Component/s: SQL Apply names to RDD to becoming SchemaRDD

[jira] [Commented] (SPARK-2532) Fix issues with consolidated shuffle

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082878#comment-14082878 ] Matei Zaharia commented on SPARK-2532: -- I'm going to create a few sub-tasks for the

[jira] [Created] (SPARK-2791) Fix committing, reverting and state tracking in shuffle file consolidation

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2791: Summary: Fix committing, reverting and state tracking in shuffle file consolidation Key: SPARK-2791 URL: https://issues.apache.org/jira/browse/SPARK-2791 Project:

[jira] [Created] (SPARK-2792) Fix reading too much or too little data from each stream in ExternalMap / Sorter

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2792: Summary: Fix reading too much or too little data from each stream in ExternalMap / Sorter Key: SPARK-2792 URL: https://issues.apache.org/jira/browse/SPARK-2792

[jira] [Created] (SPARK-2793) Correctly lock directory creation in DiskBlockManager.getFile

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2793: Summary: Correctly lock directory creation in DiskBlockManager.getFile Key: SPARK-2793 URL: https://issues.apache.org/jira/browse/SPARK-2793 Project: Spark

[jira] [Created] (SPARK-2795) Improve DiskBlockObjectWriter API

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2795: Summary: Improve DiskBlockObjectWriter API Key: SPARK-2795 URL: https://issues.apache.org/jira/browse/SPARK-2795 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-2794) Use Java 7 isSymlink when available

2014-08-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-2794: Summary: Use Java 7 isSymlink when available Key: SPARK-2794 URL: https://issues.apache.org/jira/browse/SPARK-2794 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-2017) web ui stage page becomes unresponsive when the number of tasks is large

2014-08-01 Thread Carlos Fuertes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082909#comment-14082909 ] Carlos Fuertes commented on SPARK-2017: --- I have been digging in on why the bad

[jira] [Resolved] (SPARK-1612) Potential resource leaks in Utils.copyStream and Utils.offsetBytes

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1612. -- Resolution: Fixed Fix Version/s: 1.1.0 Potential resource leaks in Utils.copyStream

[jira] [Updated] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2379: - Fix Version/s: 1.0.3 1.1.0 stopReceive in dead loop, cause stackoverflow

[jira] [Resolved] (SPARK-2379) stopReceive in dead loop, cause stackoverflow exception

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-2379. -- Resolution: Fixed stopReceive in dead loop, cause stackoverflow exception

[jira] [Closed] (SPARK-1730) Make receiver store data reliably to avoid data-loss on executor failures

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-1730. Resolution: Fixed Make receiver store data reliably to avoid data-loss on executor failures

[jira] [Commented] (SPARK-1730) Make receiver store data reliably to avoid data-loss on executor failures

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14082929#comment-14082929 ] Tathagata Das commented on SPARK-1730: -- [~hshreedharan] I am closing this JIRA based

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Fix Version/s: 1.1.0 Improve Spark Streaming compatibility with Flume

[jira] [Resolved] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-1645. -- Resolution: Fixed Improve Spark Streaming compatibility with Flume

[jira] [Updated] (SPARK-2745) Add Java friendly methods to Duration class

2014-08-01 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2745: - Target Version/s: 1.2.0 (was: 1.1.0) Add Java friendly methods to Duration class

[jira] [Updated] (SPARK-2791) Fix committing, reverting and state tracking in shuffle file consolidation

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2791: - Assignee: Mridul Muralidharan Fix committing, reverting and state tracking in shuffle file

[jira] [Updated] (SPARK-2532) Fix issues with consolidated shuffle

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2532: - Fix Version/s: (was: 1.1.0) Fix issues with consolidated shuffle

[jira] [Resolved] (SPARK-2791) Fix committing, reverting and state tracking in shuffle file consolidation

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2791. -- Resolution: Fixed Fix committing, reverting and state tracking in shuffle file consolidation

[jira] [Resolved] (SPARK-2684) Update ExternalAppendOnlyMap to take an iterator as input

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2684. -- Resolution: Fixed Fix Version/s: 1.1.0 Update ExternalAppendOnlyMap to take an

[jira] [Created] (SPARK-2796) DecisionTree bug with ordered categorical features

2014-08-01 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-2796: Summary: DecisionTree bug with ordered categorical features Key: SPARK-2796 URL: https://issues.apache.org/jira/browse/SPARK-2796 Project: Spark

[jira] [Updated] (SPARK-2797) SchemaRDDs don't support unpersist()

2014-08-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-2797: Description: Looks like something simple got missed in the Java layer? {code} from

[jira] [Commented] (SPARK-2796) DecisionTree bug with ordered categorical features

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083018#comment-14083018 ] Apache Spark commented on SPARK-2796: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-2012) PySpark StatCounter with numpy arrays

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083150#comment-14083150 ] Apache Spark commented on SPARK-2012: - User 'freeman-lab' has created a pull request

[jira] [Created] (SPARK-2798) Jenkins build failing due to missing scalatest in flume-sink module

2014-08-01 Thread Sean Owen (JIRA)
Sean Owen created SPARK-2798: Summary: Jenkins build failing due to missing scalatest in flume-sink module Key: SPARK-2798 URL: https://issues.apache.org/jira/browse/SPARK-2798 Project: Spark

[jira] [Commented] (SPARK-2798) Jenkins build failing due to missing scalatest in flume-sink module

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083215#comment-14083215 ] Apache Spark commented on SPARK-2798: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-2478) Add Python APIs for decision tree

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083223#comment-14083223 ] Apache Spark commented on SPARK-2478: - User 'jkbradley' has created a pull request for

[jira] [Created] (SPARK-2799) Simplify some Scala operations for clarity, performance

2014-08-01 Thread Sean Owen (JIRA)
Sean Owen created SPARK-2799: Summary: Simplify some Scala operations for clarity, performance Key: SPARK-2799 URL: https://issues.apache.org/jira/browse/SPARK-2799 Project: Spark Issue Type:

[jira] [Updated] (SPARK-2799) Simplify some Scala operations for clarity, performance

2014-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-2799: - Description: For fun, here's a last minor suggestion for consideration before the 1.1 window closes.

[jira] [Commented] (SPARK-2799) Simplify some Scala operations for clarity, performance

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083227#comment-14083227 ] Apache Spark commented on SPARK-2799: - User 'srowen' has created a pull request for

[jira] [Created] (SPARK-2800) Add scalastyle-output.xml to .rat-excludes file

2014-08-01 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-2800: -- Summary: Add scalastyle-output.xml to .rat-excludes file Key: SPARK-2800 URL: https://issues.apache.org/jira/browse/SPARK-2800 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-2800) Add scalastyle-output.xml to .rat-excludes file.

2014-08-01 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2800: --- Summary: Add scalastyle-output.xml to .rat-excludes file. (was: Add scalastyle-output.xml to

[jira] [Commented] (SPARK-2800) Exclude scalastyle-output.xml Apache RAT checks

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083261#comment-14083261 ] Apache Spark commented on SPARK-2800: - User 'witgo' has created a pull request for

[jira] [Updated] (SPARK-2800) Exclude scalastyle-output.xml Apache RAT checks

2014-08-01 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2800: --- Summary: Exclude scalastyle-output.xml Apache RAT checks (was: Add scalastyle-output.xml to

[jira] [Commented] (SPARK-2016) rdd in-memory storage UI becomes unresponsive when the number of RDD partitions is large

2014-08-01 Thread Carlos Fuertes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083263#comment-14083263 ] Carlos Fuertes commented on SPARK-2016: --- The real problem with the unresponsiveness

[jira] [Updated] (SPARK-2792) Fix reading too much or too little data from each stream in ExternalMap / Sorter

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2792: - Assignee: (was: Matei Zaharia) Fix reading too much or too little data from each stream in

[jira] [Reopened] (SPARK-2212) Hash Outer Joins

2014-08-01 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai reopened SPARK-2212: - I am reopening it because of https://github.com/apache/spark/pull/1721. Hash Outer Joins

[jira] [Updated] (SPARK-2212) Hash Outer Joins

2014-08-01 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2212: Priority: Blocker (was: Minor) Hash Outer Joins Key: SPARK-2212

[jira] [Resolved] (SPARK-2010) Support for nested data in PySpark SQL

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2010. - Resolution: Fixed Fix Version/s: 1.1.0 Support for nested data in PySpark SQL

[jira] [Resolved] (SPARK-2212) Hash Outer Joins

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2212. - Resolution: Fixed Hash Outer Joins Key: SPARK-2212

[jira] [Updated] (SPARK-2116) Load spark-defaults.conf from directory specified by SPARK_CONF_DIR

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2116: - Assignee: Albert Chu Load spark-defaults.conf from directory specified by SPARK_CONF_DIR

[jira] [Resolved] (SPARK-2116) Load spark-defaults.conf from directory specified by SPARK_CONF_DIR

2014-08-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2116. -- Resolution: Fixed Fix Version/s: 1.1.0 Load spark-defaults.conf from directory

[jira] [Created] (SPARK-2801) Generalize RandomRDD Generator output to generic type

2014-08-01 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-2801: -- Summary: Generalize RandomRDD Generator output to generic type Key: SPARK-2801 URL: https://issues.apache.org/jira/browse/SPARK-2801 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-2784) Make language configurable using SQLConf instead of hql/sql functions

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-2784: --- Assignee: Michael Armbrust Make language configurable using SQLConf instead of

[jira] [Updated] (SPARK-2189) Method for removing temp tables created by registerAsTable

2014-08-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2189: Target Version/s: 1.2.0 (was: 1.1.0) Method for removing temp tables created by

[jira] [Updated] (SPARK-2801) Generalize RandomRDD Generator output to generic type

2014-08-01 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-2801: --- Description: The RandomRDDGenerators only output RDD[Double]. The DistributionGenerator will be

[jira] [Resolved] (SPARK-2800) Exclude scalastyle-output.xml Apache RAT checks

2014-08-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2800. Resolution: Fixed Issue resolved by pull request 1729

[jira] [Commented] (SPARK-1580) [MLlib] ALS: Estimate communication and computation costs given a partitioner

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083305#comment-14083305 ] Apache Spark commented on SPARK-1580: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-2801) Generalize RandomRDD Generator output to generic type

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083311#comment-14083311 ] Apache Spark commented on SPARK-2801: - User 'brkyvz' has created a pull request for

[jira] [Resolved] (SPARK-2438) Streaming + MLLib

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2438. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1361

[jira] [Commented] (SPARK-2515) Hypothesis testing

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083330#comment-14083330 ] Apache Spark commented on SPARK-2515: - User 'dorx' has created a pull request for this

[jira] [Resolved] (SPARK-2550) Support regularization and intercept in pyspark's linear methods

2014-08-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2550. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1624

[jira] [Commented] (SPARK-2454) Separate driver spark home from executor spark home

2014-08-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14083379#comment-14083379 ] Apache Spark commented on SPARK-2454: - User 'andrewor14' has created a pull request