[jira] [Commented] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-05-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526322#comment-14526322 ] Bryan Cutler commented on SPARK-7127: - I have a couple questions [~josephkb] to make

[jira] [Updated] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7322: --- Description: class Column { ... def over(): WindowFunctionSpec ... } // By default frame =

[jira] [Updated] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7322: --- Description: Here's a proposal for supporting window functions in the DataFrame DSL: 1. Add an over

[jira] [Updated] (SPARK-7320) Add rollup and cube support to DataFrame DSL

2015-05-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7320: --- Description: We should add two functions to GroupedData in order to support rollup and cube for the

[jira] [Assigned] (SPARK-6940) PySpark CrossValidator

2015-05-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-6940: Assignee: Xiangrui Meng PySpark CrossValidator --

[jira] [Resolved] (SPARK-5563) LDA with online variational inference

2015-05-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-5563. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4419

[jira] [Created] (SPARK-7333) PySpark BinaryClassificationEvaluator

2015-05-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7333: Summary: PySpark BinaryClassificationEvaluator Key: SPARK-7333 URL: https://issues.apache.org/jira/browse/SPARK-7333 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-04 Thread Olivier Girardot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526338#comment-14526338 ] Olivier Girardot commented on SPARK-7322: - Great ! Add DataFrame DSL for window

[jira] [Updated] (SPARK-7333) PySpark BinaryClassificationEvaluator

2015-05-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7333: - Issue Type: New Feature (was: Bug) PySpark BinaryClassificationEvaluator

[jira] [Assigned] (SPARK-7333) PySpark BinaryClassificationEvaluator

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7333: --- Assignee: Apache Spark (was: Xiangrui Meng) PySpark BinaryClassificationEvaluator

[jira] [Commented] (SPARK-7333) PySpark BinaryClassificationEvaluator

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526398#comment-14526398 ] Apache Spark commented on SPARK-7333: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-7333) PySpark BinaryClassificationEvaluator

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7333: --- Assignee: Xiangrui Meng (was: Apache Spark) PySpark BinaryClassificationEvaluator

[jira] [Updated] (SPARK-5956) Transformer/Estimator should be copyable.

2015-05-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5956: - Priority: Blocker (was: Major) Transformer/Estimator should be copyable.

[jira] [Created] (SPARK-7334) Implement RandomProjection for Dimensionality Reduction

2015-05-04 Thread Sebastian Alfers (JIRA)
Sebastian Alfers created SPARK-7334: --- Summary: Implement RandomProjection for Dimensionality Reduction Key: SPARK-7334 URL: https://issues.apache.org/jira/browse/SPARK-7334 Project: Spark

[jira] [Updated] (SPARK-7334) Implement RandomProjection for Dimensionality Reduction

2015-05-04 Thread Sebastian Alfers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Alfers updated SPARK-7334: Description: Implement RandomProjection (RP) for dimensionality reduction RP is a popular

[jira] [Resolved] (SPARK-5100) Spark Thrift server monitor page

2015-05-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-5100. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5730

[jira] [Created] (SPARK-7335) Submitting a query to Thrift Server occurs error: java.lang.IllegalStateException: unread block data

2015-05-04 Thread meiyoula (JIRA)
meiyoula created SPARK-7335: --- Summary: Submitting a query to Thrift Server occurs error: java.lang.IllegalStateException: unread block data Key: SPARK-7335 URL: https://issues.apache.org/jira/browse/SPARK-7335

[jira] [Commented] (SPARK-7335) Submitting a query to Thrift Server occurs error: java.lang.IllegalStateException: unread block data

2015-05-04 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526439#comment-14526439 ] meiyoula commented on SPARK-7335: - It appears frequently. Below is my query command:

[jira] [Updated] (SPARK-7286) Precedence of operator not behaving properly

2015-05-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7286: --- Priority: Critical (was: Major) Precedence of operator not behaving properly

[jira] [Updated] (SPARK-7236) AkkaUtils askWithReply sleeps indefinitely when a timeout exception is thrown

2015-05-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7236: --- Priority: Critical (was: Trivial) AkkaUtils askWithReply sleeps indefinitely when a timeout

[jira] [Created] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread ShaoChuan (JIRA)
ShaoChuan created SPARK-7336: Summary: Sometimes the status of finished job show on JobHistory UI will be active, and never update. Key: SPARK-7336 URL: https://issues.apache.org/jira/browse/SPARK-7336

[jira] [Commented] (SPARK-1437) Jenkins should build with Java 6

2015-05-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526474#comment-14526474 ] Patrick Wendell commented on SPARK-1437: Hey [~shaneknapp] we had punted on this

[jira] [Updated] (SPARK-1437) Jenkins should build with Java 6

2015-05-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1437: --- Assignee: shane knapp Jenkins should build with Java 6

[jira] [Updated] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread ShaoChuan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ShaoChuan updated SPARK-7336: - Description: When I run a SparkPi job, the status of the job on JobHistory UI was 'active'. After the

[jira] [Updated] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread ShaoChuan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ShaoChuan updated SPARK-7336: - Description: When I run a SparkPi job, the status of the job on JobHistory UI was 'active'. After the

[jira] [Created] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
Amit Gupta created SPARK-7337: - Summary: FPGrowth algo throwing OutOfMemoryError Key: SPARK-7337 URL: https://issues.apache.org/jira/browse/SPARK-7337 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Gupta updated SPARK-7337: -- Attachment: FPGrowthBug.png FPGrowth algo throwing OutOfMemoryError

[jira] [Updated] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Gupta updated SPARK-7337: -- Description: When running FPGrowth algo with huge data in GBs and with numPartitions=500 then after

[jira] [Created] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-04 Thread Amit Gupta (JIRA)
Amit Gupta created SPARK-7338: - Summary: Survival Modelling - Cox proportional hazards Key: SPARK-7338 URL: https://issues.apache.org/jira/browse/SPARK-7338 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread ShaoChuan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526498#comment-14526498 ] ShaoChuan commented on SPARK-7336: -- There is a timer in 'FsHistoryProvider', and it will

[jira] [Assigned] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7336: --- Assignee: Apache Spark Sometimes the status of finished job show on JobHistory UI will be

[jira] [Commented] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526500#comment-14526500 ] Apache Spark commented on SPARK-7336: - User 'ArcherShao' has created a pull request

[jira] [Assigned] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7336: --- Assignee: (was: Apache Spark) Sometimes the status of finished job show on JobHistory

[jira] [Commented] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526529#comment-14526529 ] Sean Owen commented on SPARK-7338: -- [~amit.gupta.niit-tech] Please review

[jira] [Resolved] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7337. -- Resolution: Invalid There's no info here on your input, your settings, your cluster, or the source of

[jira] [Commented] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526536#comment-14526536 ] Amit Gupta commented on SPARK-7337: --- I am running it in local mode. It should spill over

[jira] [Reopened] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Gupta reopened SPARK-7337: --- I am running it in local mode and using Java API. It should spill over hard-disk. I can clearly see that

[jira] [Commented] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526544#comment-14526544 ] Amit Gupta commented on SPARK-7338: --- Be widely known: Implemented by well known open

[jira] [Updated] (SPARK-7334) Implement RandomProjection for Dimensionality Reduction

2015-05-04 Thread Sebastian Alfers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Alfers updated SPARK-7334: Target Version/s: (was: 1.3.1) Implement RandomProjection for Dimensionality Reduction

[jira] [Comment Edited] (SPARK-7337) FPGrowth algo throwing OutOfMemoryError

2015-05-04 Thread Amit Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526539#comment-14526539 ] Amit Gupta edited comment on SPARK-7337 at 5/4/15 11:36 AM: I

[jira] [Updated] (SPARK-4922) Support dynamic allocation for coarse-grained Mesos

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4922: - Target Version/s: (was: 1.3.0) Support dynamic allocation for coarse-grained Mesos

[jira] [Updated] (SPARK-4763) All-pairs shortest paths algorithm

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4763: - Target Version/s: (was: 1.3.0) All-pairs shortest paths algorithm --

[jira] [Updated] (SPARK-4991) Worker should reconnect to Master when Master actor restart

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4991: - Target Version/s: (was: 1.3.0) Worker should reconnect to Master when Master actor restart

[jira] [Updated] (SPARK-3682) Add helpful warnings to the UI

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3682: - Target Version/s: (was: 1.3.0) Add helpful warnings to the UI --

[jira] [Updated] (SPARK-3000) drop old blocks to disk in parallel when memory is not large enough for caching new blocks

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3000: - Target Version/s: (was: 1.3.0) drop old blocks to disk in parallel when memory is not large enough for

[jira] [Updated] (SPARK-5197) Support external shuffle service in fine-grained mode on mesos cluster

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5197: - Target Version/s: (was: 1.3.0) Support external shuffle service in fine-grained mode on mesos cluster

[jira] [Updated] (SPARK-4751) Support dynamic allocation for standalone mode

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4751: - Target Version/s: (was: 1.3.0) Support dynamic allocation for standalone mode

[jira] [Updated] (SPARK-3650) Triangle Count handles reverse edges incorrectly

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3650: - Target Version/s: (was: 1.3.0) Triangle Count handles reverse edges incorrectly

[jira] [Updated] (SPARK-5610) Generate Java docs without package private classes and methods

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5610: - Target Version/s: (was: 1.3.0) Generate Java docs without package private classes and methods

[jira] [Updated] (SPARK-5980) Add GradientBoostedTrees Python examples to ML guide

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5980: - Target Version/s: (was: 1.3.0) Add GradientBoostedTrees Python examples to ML guide

[jira] [Updated] (SPARK-3963) Support getting task-scoped properties from TaskContext

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3963: - Target Version/s: (was: 1.3.0) Support getting task-scoped properties from TaskContext

[jira] [Updated] (SPARK-5077) Map output statuses can still exceed spark.akka.frameSize

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5077: - Target Version/s: (was: 1.2.1, 1.3.0) Map output statuses can still exceed spark.akka.frameSize

[jira] [Updated] (SPARK-3373) Filtering operations should optionally rebuild routing tables

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3373: - Target Version/s: (was: 1.2.1, 1.3.0) Filtering operations should optionally rebuild routing tables

[jira] [Updated] (SPARK-6509) MDLP discretizer

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6509: - Target Version/s: (was: 1.3.0) MDLP discretizer Key: SPARK-6509

[jira] [Updated] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5362: - Target Version/s: (was: 1.3.0) Gradient and Optimizer to support generic output (instead of label) and

[jira] [Updated] (SPARK-4653) DAGScheduler refactoring and cleanup

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4653: - Target Version/s: (was: 1.3.0) DAGScheduler refactoring and cleanup

[jira] [Updated] (SPARK-4555) Add forward compatibility tests to JsonProtocol

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4555: - Target Version/s: (was: 1.3.0) Add forward compatibility tests to JsonProtocol

[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5098: - Target Version/s: (was: 1.2.1, 1.3.0) Number of running tasks become negative after tasks lost

[jira] [Updated] (SPARK-5748) Improve Vectors.sqdist implementation

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5748: - Target Version/s: (was: 1.3.0) Improve Vectors.sqdist implementation

[jira] [Updated] (SPARK-5091) Hooks for PySpark tasks

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5091: - Target Version/s: (was: 1.3.0) Hooks for PySpark tasks --- Key:

[jira] [Updated] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4352: - Target Version/s: (was: 1.3.0) Incorporate locality preferences in dynamic allocation requests

[jira] [Updated] (SPARK-5043) Implement updated Receiver API

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5043: - Target Version/s: (was: 1.3.0) Implement updated Receiver API --

[jira] [Resolved] (SPARK-6771) Table alias in Spark SQL

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6771. -- Resolution: Invalid Table alias in Spark SQL Key:

[jira] [Updated] (SPARK-5160) Python module in jars

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5160: - Target Version/s: (was: 1.3.0) Python module in jars - Key:

[jira] [Updated] (SPARK-4898) Replace cloudpickle with Dill

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4898: - Target Version/s: (was: 1.3.0) Replace cloudpickle with Dill -

[jira] [Updated] (SPARK-4696) Yarn: spark.driver.extra* variables not applied consistently to yarn client mode AM

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4696: - Target Version/s: (was: 1.3.0) Yarn: spark.driver.extra* variables not applied consistently to yarn

[jira] [Updated] (SPARK-4223) Support * (meaning all users) as part of the acls

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4223: - Target Version/s: (was: 1.3.0) Support * (meaning all users) as part of the acls

[jira] [Updated] (SPARK-5177) Add environment variables in dev/run-tests to enable Hive 0.12 and Scala 2.11 Jenkins builder

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5177: - Target Version/s: (was: 1.2.1, 1.3.0) Add environment variables in dev/run-tests to enable Hive 0.12

[jira] [Updated] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5016: - Target Version/s: (was: 1.3.0) GaussianMixtureEM should distribute matrix inverse for large

[jira] [Updated] (SPARK-4654) Clean up DAGScheduler's getMissingParentStages() and stageDependsOn() methods

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4654: - Target Version/s: (was: 1.3.0) Clean up DAGScheduler's getMissingParentStages() and stageDependsOn()

[jira] [Updated] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5243: - Target Version/s: (was: 1.3.0) Spark will hang if (driver memory + executor memory) exceeds limit on a

[jira] [Updated] (SPARK-6359) Expose IMain binding as part of ILoop Developer API

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6359: - Target Version/s: (was: 1.3.1) Expose IMain binding as part of ILoop Developer API

[jira] [Commented] (SPARK-6360) For Spark 1.1 and 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal or UDT column throws

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526554#comment-14526554 ] Sean Owen commented on SPARK-6360: -- [~lian cheng] Given the target version did you

[jira] [Updated] (SPARK-6139) Allow pre-populate sliding window with initial data

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6139: - Target Version/s: (was: 1.3.1) Allow pre-populate sliding window with initial data

[jira] [Resolved] (SPARK-5054) Missing DataType DecimalType in the API Doc

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5054. -- Resolution: Not A Problem Target Version/s: (was: 1.2.1) I believe this is, at least as of

[jira] [Created] (SPARK-7339) PySpark shuffle spill memory sometimes are not correct

2015-05-04 Thread Weizhong (JIRA)
Weizhong created SPARK-7339: --- Summary: PySpark shuffle spill memory sometimes are not correct Key: SPARK-7339 URL: https://issues.apache.org/jira/browse/SPARK-7339 Project: Spark Issue Type:

[jira] [Created] (SPARK-7340) Use latest parquet release 1.6.0 in spark

2015-05-04 Thread Yash Datta (JIRA)
Yash Datta created SPARK-7340: - Summary: Use latest parquet release 1.6.0 in spark Key: SPARK-7340 URL: https://issues.apache.org/jira/browse/SPARK-7340 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-7339) PySpark shuffle spill memory sometimes are not correct

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7339: --- Assignee: (was: Apache Spark) PySpark shuffle spill memory sometimes are not correct

[jira] [Assigned] (SPARK-7339) PySpark shuffle spill memory sometimes are not correct

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7339: --- Assignee: Apache Spark PySpark shuffle spill memory sometimes are not correct

[jira] [Commented] (SPARK-7339) PySpark shuffle spill memory sometimes are not correct

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526580#comment-14526580 ] Apache Spark commented on SPARK-7339: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-7340) Use latest parquet release 1.6.0 in spark

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7340: --- Assignee: Apache Spark Use latest parquet release 1.6.0 in spark

[jira] [Commented] (SPARK-7340) Use latest parquet release 1.6.0 in spark

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526583#comment-14526583 ] Apache Spark commented on SPARK-7340: - User 'saucam' has created a pull request for

[jira] [Resolved] (SPARK-7340) Use latest parquet release 1.6.0 in spark

2015-05-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7340. -- Resolution: Duplicate Fix Version/s: (was: 1.4.0) [~saucam] Please don't set Fix Version.

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-05-04 Thread Marc Reichman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526627#comment-14526627 ] Marc Reichman commented on SPARK-1867: -- I'm running into the same problem, and have

[jira] [Assigned] (SPARK-6566) Update Spark to use the latest version of Parquet libraries

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6566: --- Assignee: (was: Apache Spark) Update Spark to use the latest version of Parquet

[jira] [Commented] (SPARK-6566) Update Spark to use the latest version of Parquet libraries

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526736#comment-14526736 ] Apache Spark commented on SPARK-6566: - User 'saucam' has created a pull request for

[jira] [Assigned] (SPARK-6566) Update Spark to use the latest version of Parquet libraries

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6566: --- Assignee: Apache Spark Update Spark to use the latest version of Parquet libraries

[jira] [Assigned] (SPARK-6267) Python API for IsotonicRegression

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6267: --- Assignee: Apache Spark (was: Yanbo Liang) Python API for IsotonicRegression

[jira] [Assigned] (SPARK-6267) Python API for IsotonicRegression

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6267: --- Assignee: Yanbo Liang (was: Apache Spark) Python API for IsotonicRegression

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-05-04 Thread Marc Reichman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526767#comment-14526767 ] Marc Reichman commented on SPARK-1867: -- Leaving a follow-up: I was able to get this

[jira] [Commented] (SPARK-7275) Make LogicalRelation public

2015-05-04 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526799#comment-14526799 ] Glenn Weidner commented on SPARK-7275: -- Santiago M. Mola - can you provide example of

[jira] [Commented] (SPARK-1437) Jenkins should build with Java 6

2015-05-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526899#comment-14526899 ] shane knapp commented on SPARK-1437: shouldn't be too much work... i can get a 1.6

[jira] [Commented] (SPARK-7013) Add unit test for spark.ml StandardScaler

2015-05-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526907#comment-14526907 ] Joseph K. Bradley commented on SPARK-7013: -- Great, please go ahead! Also, I

[jira] [Commented] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-05-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526943#comment-14526943 ] Joseph K. Bradley commented on SPARK-7127: -- [~bryanc] 1. Broadcasting ensures

[jira] [Updated] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-05-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7127: - Labels: (was: starter) Broadcast spark.ml tree ensemble models for predict

[jira] [Created] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2015-05-04 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-7341: --- Summary: Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream Key: SPARK-7341 URL: https://issues.apache.org/jira/browse/SPARK-7341

[jira] [Updated] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2015-05-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-7341: Priority: Minor (was: Major) Fix the flaky test:

[jira] [Assigned] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7341: --- Assignee: Apache Spark Fix the flaky test:

[jira] [Commented] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526962#comment-14526962 ] Apache Spark commented on SPARK-7341: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2015-05-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7341: --- Assignee: (was: Apache Spark) Fix the flaky test:

  1   2   >