[jira] [Resolved] (SPARK-8074) Parquet should throw AnalysisException during setup for data type/name related failures

2015-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8074. Resolution: Fixed Fix Version/s: 1.5.0 1.4.1 Parquet should throw

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-06-03 Thread Nicolas PHUNG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571609#comment-14571609 ] Nicolas PHUNG commented on SPARK-7122: -- Platon, I have tried with a 10 seconds batch

[jira] [Created] (SPARK-8087) PipelineModel.copy didn't copy the stages

2015-06-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8087: Summary: PipelineModel.copy didn't copy the stages Key: SPARK-8087 URL: https://issues.apache.org/jira/browse/SPARK-8087 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-8085) Pass in user-specified schema in read.df

2015-06-03 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-8085: Summary: Pass in user-specified schema in read.df Key: SPARK-8085 URL: https://issues.apache.org/jira/browse/SPARK-8085 Project: Spark Issue

[jira] [Commented] (SPARK-8062) NullPointerException in SparkHadoopUtil.getFileSystemThreadStatistics

2015-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571546#comment-14571546 ] Josh Rosen commented on SPARK-8062: --- While working to try to reproduce this bug, I

[jira] [Commented] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571613#comment-14571613 ] Joseph K. Bradley commented on SPARK-7857: -- Do you mean the idf minimum document

[jira] [Updated] (SPARK-8086) [INVALID] InputOutputMetricsSuite should not call side-effecting getFSBytesWrittenOnThreadCallback to detect whether we're running on Hadoop 2.5+

2015-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8086: -- Summary: [INVALID] InputOutputMetricsSuite should not call side-effecting

[jira] [Resolved] (SPARK-8086) InputOutputMetricsSuite should not call side-effecting getFSBytesWrittenOnThreadCallback to detect whether we're running on Hadoop 2.5+

2015-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8086. --- Resolution: Invalid I'm closing this as Invalid for now, since it turns out that I wasn't paying

[jira] [Closed] (SPARK-8012) ArrayIndexOutOfBoundsException in SerializationDebugger

2015-06-03 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-8012. Resolution: Duplicate ArrayIndexOutOfBoundsException in SerializationDebugger

[jira] [Created] (SPARK-8086) InputOutputMetricsSuite should not call side-effecting getFSBytesWrittenOnThreadCallback to detect whether we're running on Hadoop 2.5+

2015-06-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8086: - Summary: InputOutputMetricsSuite should not call side-effecting getFSBytesWrittenOnThreadCallback to detect whether we're running on Hadoop 2.5+ Key: SPARK-8086 URL:

[jira] [Commented] (SPARK-8062) NullPointerException in SparkHadoopUtil.getFileSystemThreadStatistics

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571620#comment-14571620 ] Apache Spark commented on SPARK-8062: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-8062) NullPointerException in SparkHadoopUtil.getFileSystemThreadStatistics

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8062: --- Assignee: (was: Apache Spark) NullPointerException in

[jira] [Assigned] (SPARK-8085) Pass in user-specified schema in read.df

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8085: --- Assignee: Apache Spark Pass in user-specified schema in read.df

[jira] [Commented] (SPARK-8085) Pass in user-specified schema in read.df

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571622#comment-14571622 ] Apache Spark commented on SPARK-8085: - User 'shivaram' has created a pull request for

[jira] [Commented] (SPARK-8062) NullPointerException in SparkHadoopUtil.getFileSystemThreadStatistics

2015-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571566#comment-14571566 ] Josh Rosen commented on SPARK-8062: --- Alright, I've filed

[jira] [Updated] (SPARK-8063) Spark master URL conflict between MASTER env variable and --master command line option

2015-06-03 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8063: - Assignee: Sun Rui Spark master URL conflict between MASTER env variable and

[jira] [Created] (SPARK-8084) SparkR install script should fail with error if any packages required are not found

2015-06-03 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-8084: Summary: SparkR install script should fail with error if any packages required are not found Key: SPARK-8084 URL: https://issues.apache.org/jira/browse/SPARK-8084

[jira] [Commented] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros

2015-06-03 Thread Karl Higley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571641#comment-14571641 ] Karl Higley commented on SPARK-7857: That does seem like the intent of the test. I

[jira] [Reopened] (SPARK-8052) Hive on Spark: CAST string AS BIGINT produces wrong value

2015-06-03 Thread Andrey Kurochkin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrey Kurochkin reopened SPARK-8052: - spark-sql bug is not a hive bug, can you please open a jira for spark project? Hive on

[jira] [Commented] (SPARK-8069) Add support for cutoff to RandomForestClassifier

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571549#comment-14571549 ] Joseph K. Bradley commented on SPARK-8069: -- For others who look at this JIRA, the

[jira] [Assigned] (SPARK-8062) NullPointerException in SparkHadoopUtil.getFileSystemThreadStatistics

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8062: --- Assignee: Apache Spark NullPointerException in

[jira] [Assigned] (SPARK-8085) Pass in user-specified schema in read.df

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8085: --- Assignee: (was: Apache Spark) Pass in user-specified schema in read.df

[jira] [Assigned] (SPARK-6777) Implement backwards-compatibility rules in Parquet schema converters

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6777: --- Assignee: (was: Apache Spark) Implement backwards-compatibility rules in Parquet schema

[jira] [Resolved] (SPARK-8063) Spark master URL conflict between MASTER env variable and --master command line option

2015-06-03 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8063. -- Resolution: Fixed Fix Version/s: 1.4.1 1.5.0 Issue

[jira] [Commented] (SPARK-8078) Spark MLlib Decision Trees Improvement

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571544#comment-14571544 ] Joseph K. Bradley commented on SPARK-8078: -- [~yang] I don't have a problem

[jira] [Commented] (SPARK-8084) SparkR install script should fail with error if any packages required are not found

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571705#comment-14571705 ] Apache Spark commented on SPARK-8084: - User 'shivaram' has created a pull request for

[jira] [Assigned] (SPARK-8084) SparkR install script should fail with error if any packages required are not found

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8084: --- Assignee: Apache Spark SparkR install script should fail with error if any packages

[jira] [Updated] (SPARK-8083) Fix return to drivers link in Mesos driver page

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8083: - Target Version/s: 1.4.1, 1.5.0 Fix return to drivers link in Mesos driver page

[jira] [Updated] (SPARK-8083) Fix return to drivers link in Mesos driver page

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8083: - Affects Version/s: 1.4.0 Fix return to drivers link in Mesos driver page

[jira] [Commented] (SPARK-8089) Operations on RDD cause empty collections

2015-06-03 Thread Malte Buecken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571774#comment-14571774 ] Malte Buecken commented on SPARK-8089: -- mea culpa. there was a cassandra connector

[jira] [Assigned] (SPARK-8087) PipelineModel.copy didn't copy the stages

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8087: --- Assignee: Xiangrui Meng (was: Apache Spark) PipelineModel.copy didn't copy the stages

[jira] [Commented] (SPARK-8087) PipelineModel.copy didn't copy the stages

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571670#comment-14571670 ] Apache Spark commented on SPARK-8087: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-8087) PipelineModel.copy didn't copy the stages

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8087: --- Assignee: Apache Spark (was: Xiangrui Meng) PipelineModel.copy didn't copy the stages

[jira] [Created] (SPARK-8088) ExecutionAllocationManager spamming INFO logs about Lowering target number of executors

2015-06-03 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-8088: Summary: ExecutionAllocationManager spamming INFO logs about Lowering target number of executors Key: SPARK-8088 URL: https://issues.apache.org/jira/browse/SPARK-8088

[jira] [Closed] (SPARK-8089) Operations on RDD cause empty collections

2015-06-03 Thread Malte Buecken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Malte Buecken closed SPARK-8089. Resolution: Invalid Operations on RDD cause empty collections

[jira] [Updated] (SPARK-8088) ExecutionAllocationManager spamming INFO logs about Lowering target number of executors

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8088: - Assignee: Ryan Williams ExecutionAllocationManager spamming INFO logs about Lowering target number of

[jira] [Created] (SPARK-8091) SerializableDebugger does not handle classes with writeObject method

2015-06-03 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-8091: Summary: SerializableDebugger does not handle classes with writeObject method Key: SPARK-8091 URL: https://issues.apache.org/jira/browse/SPARK-8091 Project: Spark

[jira] [Updated] (SPARK-8083) Fix return to drivers link in Mesos driver page

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8083: - Assignee: Timothy Chen Fix return to drivers link in Mesos driver page

[jira] [Commented] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571779#comment-14571779 ] Joseph K. Bradley commented on SPARK-7857: -- Sure, please do! IDF w/ minDocFreq

[jira] [Updated] (SPARK-7739) Improve ChiSqSelector example code in the user guide

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7739: - Labels: starter (was: ) Improve ChiSqSelector example code in the user guide

[jira] [Created] (SPARK-8089) Operations on RDD cause empty collections

2015-06-03 Thread Malte Buecken (JIRA)
Malte Buecken created SPARK-8089: Summary: Operations on RDD cause empty collections Key: SPARK-8089 URL: https://issues.apache.org/jira/browse/SPARK-8089 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-8084) SparkR install script should fail with error if any packages required are not found

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8084: --- Assignee: (was: Apache Spark) SparkR install script should fail with error if any

[jira] [Commented] (SPARK-8089) Operations on RDD cause empty collections

2015-06-03 Thread Malte Buecken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571718#comment-14571718 ] Malte Buecken commented on SPARK-8089: -- That's what I thought, but I did not assembly

[jira] [Closed] (SPARK-8083) Fix return to drivers link in Mesos driver page

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8083. Resolution: Fixed Fix Version/s: 1.5.0 1.4.1 Fix return to drivers link in Mesos

[jira] [Updated] (SPARK-8083) Fix return to drivers link in Mesos driver page

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8083: - Component/s: Web UI Mesos Fix return to drivers link in Mesos driver page

[jira] [Assigned] (SPARK-7180) SerializationDebugger fails with ArrayOutOfBoundsException

2015-06-03 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-7180: Assignee: Tathagata Das SerializationDebugger fails with ArrayOutOfBoundsException

[jira] [Resolved] (SPARK-8051) StringIndexerModel (and other models) shouldn't complain if the input column is missing.

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-8051. -- Resolution: Fixed Fix Version/s: 1.4.1 1.5.0 Issue resolved

[jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571782#comment-14571782 ] Joseph K. Bradley commented on SPARK-7536: -- Ping! We need to wrap this up ASAP.

[jira] [Commented] (SPARK-8089) Operations on RDD cause empty collections

2015-06-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571700#comment-14571700 ] Sean Owen commented on SPARK-8089: -- Surely this is some problem specific to your env or

[jira] [Assigned] (SPARK-8090) SerializationDebugger does not handle classes with writeReplace correctly

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8090: --- Assignee: Apache Spark (was: Tathagata Das) SerializationDebugger does not handle classes

[jira] [Commented] (SPARK-8090) SerializationDebugger does not handle classes with writeReplace correctly

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571723#comment-14571723 ] Apache Spark commented on SPARK-8090: - User 'tdas' has created a pull request for this

[jira] [Assigned] (SPARK-8090) SerializationDebugger does not handle classes with writeReplace correctly

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8090: --- Assignee: Tathagata Das (was: Apache Spark) SerializationDebugger does not handle classes

[jira] [Closed] (SPARK-8059) Reduce latency between executor requests and RM heartbeat

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8059. Resolution: Fixed Fix Version/s: 1.5.0 Assignee: Marcelo Vanzin Target

[jira] [Commented] (SPARK-6071) ALS doc example fails randomly in PythonAccumulatorParam

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571805#comment-14571805 ] Joseph K. Bradley commented on SPARK-6071: -- I'm going to close this since I

[jira] [Updated] (SPARK-7989) Fix flaky tests in ExternalShuffleServiceSuite and SparkListenerWithClusterSuite

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7989: - Priority: Major (was: Minor) Fix flaky tests in ExternalShuffleServiceSuite and

[jira] [Resolved] (SPARK-7980) Support SQLContext.range(end)

2015-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7980. Resolution: Fixed Fix Version/s: 1.5.0 1.4.1 I'm having trouble with

[jira] [Updated] (SPARK-7989) Fix flaky tests in ExternalShuffleServiceSuite and SparkListenerWithClusterSuite

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7989: - Assignee: Shixiong Zhu Fix flaky tests in ExternalShuffleServiceSuite and

[jira] [Commented] (SPARK-8088) ExecutionAllocationManager spamming INFO logs about Lowering target number of executors

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571714#comment-14571714 ] Apache Spark commented on SPARK-8088: - User 'ryan-williams' has created a pull request

[jira] [Assigned] (SPARK-8088) ExecutionAllocationManager spamming INFO logs about Lowering target number of executors

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8088: --- Assignee: Apache Spark ExecutionAllocationManager spamming INFO logs about Lowering target

[jira] [Updated] (SPARK-8059) Reduce latency between executor requests and RM heartbeat

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8059: - Affects Version/s: (was: 1.4.0) 1.5.0 Reduce latency between executor

[jira] [Updated] (SPARK-7583) User guide update for RegexTokenizer

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7583: - Assignee: (was: Augustin Borsu) User guide update for RegexTokenizer

[jira] [Commented] (SPARK-7583) User guide update for RegexTokenizer

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571786#comment-14571786 ] Joseph K. Bradley commented on SPARK-7583: -- I'll remove you from Assignee, but

[jira] [Updated] (SPARK-7013) Add unit test for spark.ml StandardScaler

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7013: - Labels: starter (was: ) Add unit test for spark.ml StandardScaler

[jira] [Created] (SPARK-8093) Failure to save empty json object as parquet

2015-06-03 Thread Harish Butani (JIRA)
Harish Butani created SPARK-8093: Summary: Failure to save empty json object as parquet Key: SPARK-8093 URL: https://issues.apache.org/jira/browse/SPARK-8093 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8093) Failure to save empty json object as parquet

2015-06-03 Thread Harish Butani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harish Butani updated SPARK-8093: - Attachment: t1.json Failure to save empty json object as parquet

[jira] [Assigned] (SPARK-8092) OneVsRest doesn't allow flexibility in label/ feature column renaming

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8092: --- Assignee: Apache Spark (was: Ram Sriharsha) OneVsRest doesn't allow flexibility in label/

[jira] [Commented] (SPARK-8092) OneVsRest doesn't allow flexibility in label/ feature column renaming

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571930#comment-14571930 ] Apache Spark commented on SPARK-8092: - User 'harsha2010' has created a pull request

[jira] [Assigned] (SPARK-8092) OneVsRest doesn't allow flexibility in label/ feature column renaming

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8092: --- Assignee: Ram Sriharsha (was: Apache Spark) OneVsRest doesn't allow flexibility in label/

[jira] [Resolved] (SPARK-8054) Java compatibility fixes for MLlib 1.4

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-8054. -- Resolution: Fixed Fix Version/s: 1.4.1 1.5.0 Issue resolved

[jira] [Commented] (SPARK-7180) SerializationDebugger fails with ArrayOutOfBoundsException

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571722#comment-14571722 ] Apache Spark commented on SPARK-7180: - User 'tdas' has created a pull request for this

[jira] [Assigned] (SPARK-7180) SerializationDebugger fails with ArrayOutOfBoundsException

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7180: --- Assignee: (was: Apache Spark) SerializationDebugger fails with

[jira] [Assigned] (SPARK-7180) SerializationDebugger fails with ArrayOutOfBoundsException

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7180: --- Assignee: Apache Spark SerializationDebugger fails with ArrayOutOfBoundsException

[jira] [Closed] (SPARK-7989) Fix flaky tests in ExternalShuffleServiceSuite and SparkListenerWithClusterSuite

2015-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7989. Resolution: Fixed Fix Version/s: 1.5.0 1.4.1 Target Version/s: 1.4.1,

[jira] [Commented] (SPARK-8018) KMeans should accept initial cluster centers as param

2015-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571757#comment-14571757 ] Xiangrui Meng commented on SPARK-8018: -- Sounds good to me. I was thinking about make

[jira] [Updated] (SPARK-6164) CrossValidatorModel should keep stats from fitting

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6164: - Assignee: Leah McGuire CrossValidatorModel should keep stats from fitting

[jira] [Resolved] (SPARK-6164) CrossValidatorModel should keep stats from fitting

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6164. -- Resolution: Fixed Fix Version/s: 1.5.0 CrossValidatorModel should keep stats

[jira] [Commented] (SPARK-7653) ML Pipeline and meta-algs should take random seed param

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571798#comment-14571798 ] Joseph K. Bradley commented on SPARK-7653: -- Update: We should not really test for

[jira] [Commented] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros

2015-06-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571662#comment-14571662 ] Joseph K. Bradley commented on SPARK-7857: -- I looked again, and I agree the test

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-06-03 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571675#comment-14571675 ] Cody Koeninger commented on SPARK-7122: --- Try doing something very straightforward

[jira] [Created] (SPARK-8090) SerializationDebugger does not handle classes with writeReplace correctly

2015-06-03 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-8090: Summary: SerializationDebugger does not handle classes with writeReplace correctly Key: SPARK-8090 URL: https://issues.apache.org/jira/browse/SPARK-8090 Project:

[jira] [Assigned] (SPARK-8088) ExecutionAllocationManager spamming INFO logs about Lowering target number of executors

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8088: --- Assignee: (was: Apache Spark) ExecutionAllocationManager spamming INFO logs about

[jira] [Commented] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros

2015-06-03 Thread Karl Higley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571771#comment-14571771 ] Karl Higley commented on SPARK-7857: Ah, okay, that makes more sense. Seems like a

[jira] [Created] (SPARK-8077) Optimisation of TreeNode for large number of children

2015-06-03 Thread Mick Davies (JIRA)
Mick Davies created SPARK-8077: -- Summary: Optimisation of TreeNode for large number of children Key: SPARK-8077 URL: https://issues.apache.org/jira/browse/SPARK-8077 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8018) KMeans should accept initial cluster centers as param

2015-06-03 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570620#comment-14570620 ] Meethu Mathew commented on SPARK-8018: -- [~josephkb] For initialization using an

[jira] [Resolved] (SPARK-8040) Remove Debian specific loopback address setting code

2015-06-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8040. -- Resolution: Not A Problem This is an issue with your local environment's DNS then. I don't even know if

[jira] [Commented] (SPARK-7993) Improve DataFrame.show() output

2015-06-03 Thread Akhil Thatipamula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570704#comment-14570704 ] Akhil Thatipamula commented on SPARK-7993: -- [~rxin] I have come up with 2

[jira] [Created] (SPARK-8074) Parquet should throw AnalysisException during setup for data type/name related failures

2015-06-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-8074: -- Summary: Parquet should throw AnalysisException during setup for data type/name related failures Key: SPARK-8074 URL: https://issues.apache.org/jira/browse/SPARK-8074

[jira] [Commented] (SPARK-7541) Check model save/load for MLlib 1.4

2015-06-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570527#comment-14570527 ] yuhao yang commented on SPARK-7541: --- I find no more issues. Check model save/load for

[jira] [Resolved] (SPARK-8073) Directory traversal vulnerability

2015-06-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8073. -- Resolution: Fixed Fix Version/s: 1.4.0 This is already fixed. Directory traversal

[jira] [Created] (SPARK-8071) Run PySpark dataframe.rollup/cube test failed

2015-06-03 Thread Weizhong (JIRA)
Weizhong created SPARK-8071: --- Summary: Run PySpark dataframe.rollup/cube test failed Key: SPARK-8071 URL: https://issues.apache.org/jira/browse/SPARK-8071 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8071) Run PySpark dataframe.rollup/cube test failed

2015-06-03 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-8071: Description: I run test for Spark, and failed on PySpark, details are: File

[jira] [Comment Edited] (SPARK-8008) JDBC data source can overload the external database system due to high concurrency

2015-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570450#comment-14570450 ] Reynold Xin edited comment on SPARK-8008 at 6/3/15 8:08 AM:

[jira] [Updated] (SPARK-8071) Run PySpark dataframe.rollup/cube test failed

2015-06-03 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-8071: Environment: OS: SUSE 11 SP3; JDK: 1.8.0_40; Python: 2.6.8; Hadoop: 2.7.0; Spark: master branch was:

[jira] [Commented] (SPARK-8008) JDBC data source can overload the external database system due to high concurrency

2015-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570450#comment-14570450 ] Reynold Xin commented on SPARK-8008: [~rtreffer] can you submit a patch to the jdbc

[jira] [Updated] (SPARK-7983) Add require for one-based indices in loadLibSVMFile

2015-06-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7983: -- Priority: Minor (was: Trivial) Add require for one-based indices in loadLibSVMFile

[jira] [Updated] (SPARK-8071) Run PySpark dataframe.rollup/cube test failed

2015-06-03 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-8071: Description: I run test for Spark, and failed on PySpark, details are: File

[jira] [Commented] (SPARK-7988) Mechanism to control receiver scheduling

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570438#comment-14570438 ] Apache Spark commented on SPARK-7988: - User 'nishkamravi2' has created a pull request

[jira] [Assigned] (SPARK-7988) Mechanism to control receiver scheduling

2015-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7988: --- Assignee: Apache Spark Mechanism to control receiver scheduling

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5463: --- Priority: Blocker (was: Critical) Fix Parquet filter push-down

[jira] [Created] (SPARK-8070) Improve createDataFrame in Python

2015-06-03 Thread Davies Liu (JIRA)
Davies Liu created SPARK-8070: - Summary: Improve createDataFrame in Python Key: SPARK-8070 URL: https://issues.apache.org/jira/browse/SPARK-8070 Project: Spark Issue Type: Improvement

  1   2   3   >