[jira] [Updated] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10771: -- Priority: Minor (was: Major) > Implement the shuffle encryption with AES-CTR crypto using JCE key

[jira] [Resolved] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10769. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Fix

[jira] [Updated] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10769: -- Assignee: Shixiong Zhu > Fix o.a.s.streaming.CheckpointSuite.maintains rate controller >

[jira] [Comment Edited] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread rerngvit yanggratoke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904075#comment-14904075 ] rerngvit yanggratoke edited comment on SPARK-9798 at 9/23/15 8:15 AM: --

[jira] [Commented] (SPARK-2737) ClassCastExceptions when collect()ing JavaRDDs' underlying Scala RDDs

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904141#comment-14904141 ] Sean Owen commented on SPARK-2737: -- [~glenn.stryc...@gmail.com] you can use JIRA to link issues if you're

[jira] [Resolved] (SPARK-10224) BlockGenerator may lost data in the last block

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10224. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > BlockGenerator may

[jira] [Comment Edited] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Ian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905327#comment-14905327 ] Ian edited comment on SPARK-10741 at 9/23/15 9:36 PM: -- Yes, going through all rules

[jira] [Commented] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Ian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905327#comment-14905327 ] Ian commented on SPARK-10741: - Yes, going through all rules when resolve Sort on Aggregate is a correct

[jira] [Created] (SPARK-10784) Flaky Streaming ML test umbrella

2015-09-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10784: - Summary: Flaky Streaming ML test umbrella Key: SPARK-10784 URL: https://issues.apache.org/jira/browse/SPARK-10784 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10086: -- Description: Here's a report on investigating test failures in StreamingKMeans in

[jira] [Updated] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10765: - Target Version/s: 1.6.0 > use new aggregate interface for hive UDAF >

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-23 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904913#comment-14904913 ] Narine Kokhlikyan commented on SPARK-9325: -- Hi everyone, how far are are you with this feature.

[jira] [Assigned] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10763: Assignee: Apache Spark > Update Java MLLIB/ML tests to use simplified dataframe

[jira] [Updated] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10728: -- Affects Version/s: (was: 1.6.0) > Failed to set Jenkins Identity header on email. >

[jira] [Commented] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904979#comment-14904979 ] Xiangrui Meng commented on SPARK-10728: --- This is still an issue, though not high-priority. We

[jira] [Created] (SPARK-10780) Set initialModel in KMeans in Pipelines API

2015-09-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10780: - Summary: Set initialModel in KMeans in Pipelines API Key: SPARK-10780 URL: https://issues.apache.org/jira/browse/SPARK-10780 Project: Spark Issue

[jira] [Commented] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905279#comment-14905279 ] Andrew Or commented on SPARK-10474: --- Re-opening this because I found the real cause for this issue

[jira] [Assigned] (SPARK-10622) Race condition between scheduler and YARN executor status update

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10622: Assignee: (was: Apache Spark) > Race condition between scheduler and YARN executor

[jira] [Assigned] (SPARK-10622) Race condition between scheduler and YARN executor status update

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10622: Assignee: Apache Spark > Race condition between scheduler and YARN executor status update

[jira] [Commented] (SPARK-10622) Race condition between scheduler and YARN executor status update

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905285#comment-14905285 ] Apache Spark commented on SPARK-10622: -- User 'vanzin' has created a pull request for this issue:

[jira] [Updated] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-10767: Description: Namely "." shows up in some places in the template when using the param docstring and not in

[jira] [Commented] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905299#comment-14905299 ] Andrew Or commented on SPARK-10474: --- Alright, I think this should fix it for real:

[jira] [Commented] (SPARK-8616) SQLContext doesn't handle tricky column names when loading from JDBC

2015-09-23 Thread Rick Hillegas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905409#comment-14905409 ] Rick Hillegas commented on SPARK-8616: -- The following email thread may be useful for understanding

[jira] [Updated] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10668: -- Shepherd: Xiangrui Meng > Use WeightedLeastSquares in LinearRegression with L2 regularization

[jira] [Resolved] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10733. --- Resolution: Duplicate Looks like this is a duplicate of SPARK-10474 after all. I'm closing this...

[jira] [Reopened] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reopened SPARK-10474: --- > TungstenAggregation cannot acquire memory for pointer array after switching > to sort-based >

[jira] [Commented] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905291#comment-14905291 ] Apache Spark commented on SPARK-10474: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Commented] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905303#comment-14905303 ] holdenk commented on SPARK-10767: - Updated the description, sorry about that. This comes from

[jira] [Commented] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905322#comment-14905322 ] Joseph K. Bradley commented on SPARK-10767: --- Oh, yeah, that is annoying. + 1 > Make pyspark

[jira] [Commented] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905329#comment-14905329 ] holdenk commented on SPARK-10767: - My plan was to wait for that PR to go in and then do this as a quick

[jira] [Commented] (SPARK-4885) Enable fetched blocks to exceed 2 GB

2015-09-23 Thread Sai Nishanth Parepally (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905338#comment-14905338 ] Sai Nishanth Parepally commented on SPARK-4885: --- I am using spark 1.4.1 and facing the same

[jira] [Created] (SPARK-10783) Do track the pointer array in UnsafeInMemorySorter

2015-09-23 Thread Andrew Or (JIRA)
Andrew Or created SPARK-10783: - Summary: Do track the pointer array in UnsafeInMemorySorter Key: SPARK-10783 URL: https://issues.apache.org/jira/browse/SPARK-10783 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Ian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905327#comment-14905327 ] Ian edited comment on SPARK-10741 at 9/23/15 9:45 PM: -- Yes, going through all rules

[jira] [Resolved] (SPARK-9715) Store numFeatures in all ML PredictionModel types

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9715. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8675

[jira] [Resolved] (SPARK-10686) Add quantileCol to AFTSurvivalRegression

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10686. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8836

[jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10086: -- Description: Here's a report on investigating test failures in StreamingKMeans in

[jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10086: -- Description: Here's a report on investigating test failures in StreamingKMeans in

[jira] [Created] (SPARK-10781) Allow certain number of failed tasks and allow job to succeed

2015-09-23 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-10781: - Summary: Allow certain number of failed tasks and allow job to succeed Key: SPARK-10781 URL: https://issues.apache.org/jira/browse/SPARK-10781 Project: Spark

[jira] [Commented] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905166#comment-14905166 ] Yin Huai commented on SPARK-10741: -- The second options sounds better. > Hive Query Having/OrderBy

[jira] [Commented] (SPARK-10782) Duplicate examples for drop_duplicates and DropDuplicates

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905180#comment-14905180 ] Sean Owen commented on SPARK-10782: --- Looks like you're right, feel free to make a PR with the correct

[jira] [Commented] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905196#comment-14905196 ] Joseph K. Bradley commented on SPARK-10767: --- What issues specifically? > Make pyspark shared

[jira] [Commented] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905230#comment-14905230 ] Joseph K. Bradley commented on SPARK-10413: --- For API, I think my main question is whether

[jira] [Created] (SPARK-10782) Duplicate examples for drop_duplicates and DropDuplicates

2015-09-23 Thread Asoka Diggs (JIRA)
Asoka Diggs created SPARK-10782: --- Summary: Duplicate examples for drop_duplicates and DropDuplicates Key: SPARK-10782 URL: https://issues.apache.org/jira/browse/SPARK-10782 Project: Spark

[jira] [Comment Edited] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-09-23 Thread Mohamed Baddar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904471#comment-14904471 ] Mohamed Baddar edited comment on SPARK-9836 at 9/23/15 8:39 PM: Thanks a

[jira] [Updated] (SPARK-10787) Reset ObjectOutputStream more often to prevent OOME

2015-09-23 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-10787: --- Description: In the thread, Spark ClosureCleaner or java serializer OOM when trying to grow

[jira] [Commented] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904276#comment-14904276 ] Yanbo Liang commented on SPARK-10413: - [~mengxr] I think to support prediction on single instance

[jira] [Assigned] (SPARK-10786) SparkSQLCLIDriver should take the whole statement to generate the CommandProcessor

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10786: Assignee: (was: Apache Spark) > SparkSQLCLIDriver should take the whole statement to

[jira] [Commented] (SPARK-10786) SparkSQLCLIDriver should take the whole statement to generate the CommandProcessor

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905692#comment-14905692 ] Apache Spark commented on SPARK-10786: -- User 'SaintBacchus' has created a pull request for this

[jira] [Assigned] (SPARK-10786) SparkSQLCLIDriver should take the whole statement to generate the CommandProcessor

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10786: Assignee: Apache Spark > SparkSQLCLIDriver should take the whole statement to generate

[jira] [Closed] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-09-23 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-10474. - Resolution: Fixed > TungstenAggregation cannot acquire memory for pointer array after switching > to

[jira] [Resolved] (SPARK-10692) Failed batches are never reported through the StreamingListener interface

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10692. --- Resolution: Fixed Fix Version/s: 1.6.0 1.5.1 > Failed batches are

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905729#comment-14905729 ] Tathagata Das commented on SPARK-10086: --- You could use a maintain a counter for the number of

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905731#comment-14905731 ] Tathagata Das commented on SPARK-10086: --- Actually never mind, its already in eventually. The

[jira] [Created] (SPARK-10787) Reset ObjectOutputStream more often to prevent OOME

2015-09-23 Thread Ted Yu (JIRA)
Ted Yu created SPARK-10787: -- Summary: Reset ObjectOutputStream more often to prevent OOME Key: SPARK-10787 URL: https://issues.apache.org/jira/browse/SPARK-10787 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-10787) Reset ObjectOutputStream more often to prevent OOME

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10787: Assignee: (was: Apache Spark) > Reset ObjectOutputStream more often to prevent OOME >

[jira] [Commented] (SPARK-10787) Reset ObjectOutputStream more often to prevent OOME

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905769#comment-14905769 ] Apache Spark commented on SPARK-10787: -- User 'tedyu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10787) Reset ObjectOutputStream more often to prevent OOME

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10787: Assignee: Apache Spark > Reset ObjectOutputStream more often to prevent OOME >

[jira] [Updated] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9798: - Shepherd: Joseph K. Bradley Target Version/s: 1.6.0 > CrossValidatorModel

[jira] [Resolved] (SPARK-10731) The head() implementation of dataframe is very slow

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10731. - Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 1.5.1

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905573#comment-14905573 ] Joseph K. Bradley commented on SPARK-9798: -- This is a very small task, so I don't think it should

[jira] [Assigned] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10741: Assignee: (was: Apache Spark) > Hive Query Having/OrderBy against Parquet table is

[jira] [Commented] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905479#comment-14905479 ] Apache Spark commented on SPARK-10741: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10741: Assignee: Apache Spark > Hive Query Having/OrderBy against Parquet table is not working

[jira] [Resolved] (SPARK-10699) Support checkpointInterval can be disabled

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10699. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8820

[jira] [Created] (SPARK-10785) Scale QuantileDiscretizer using distributed binning

2015-09-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10785: - Summary: Scale QuantileDiscretizer using distributed binning Key: SPARK-10785 URL: https://issues.apache.org/jira/browse/SPARK-10785 Project: Spark

[jira] [Updated] (SPARK-5890) Add QuantileDiscretizer

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5890: - Description: A `QuantileDiscretizer` takes a column with continuous features and outputs

[jira] [Updated] (SPARK-5890) Add QuantileDiscretizer

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5890: - Summary: Add QuantileDiscretizer (was: Add FeatureDiscretizer) > Add QuantileDiscretizer

[jira] [Updated] (SPARK-9841) Params.clear needs to be public

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9841: - Shepherd: Joseph K. Bradley Assignee: holdenk > Params.clear needs to be public >

[jira] [Updated] (SPARK-8115) Remove TestData

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8115: --- Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > Remove TestData > --- > >

[jira] [Updated] (SPARK-10538) java.lang.NegativeArraySizeException during join

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10538: Target Version/s: 1.5.2 (was: 1.5.1) > java.lang.NegativeArraySizeException during join >

[jira] [Commented] (SPARK-10692) Failed batches are never reported through the StreamingListener interface

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905653#comment-14905653 ] Apache Spark commented on SPARK-10692: -- User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10724: Assignee: Apache Spark > SQL's floor() returns DOUBLE > > >

[jira] [Assigned] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10724: Assignee: (was: Apache Spark) > SQL's floor() returns DOUBLE >

[jira] [Commented] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905658#comment-14905658 ] Apache Spark commented on SPARK-10724: -- User 'navis' has created a pull request for this issue:

[jira] [Resolved] (SPARK-6028) Provide an alternative RPC implementation based on the network transport module

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6028. Resolution: Fixed Fix Version/s: 1.6.0 > Provide an alternative RPC implementation based on

[jira] [Updated] (SPARK-10692) Failed batches are never reported through the StreamingListener interface

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10692: Priority: Critical (was: Blocker) > Failed batches are never reported through the

[jira] [Updated] (SPARK-10043) Add window functions into SparkR

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10043: Target Version/s: 1.6.0 (was: 1.5.1, 1.6.0) > Add window functions into SparkR >

[jira] [Updated] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10741: - Assignee: Wenchen Fan > Hive Query Having/OrderBy against Parquet table is not working >

[jira] [Created] (SPARK-10786) SparkSQLCLIDriver should take the whole statement to generate the CommandProcessor

2015-09-23 Thread SaintBacchus (JIRA)
SaintBacchus created SPARK-10786: Summary: SparkSQLCLIDriver should take the whole statement to generate the CommandProcessor Key: SPARK-10786 URL: https://issues.apache.org/jira/browse/SPARK-10786

[jira] [Assigned] (SPARK-10709) When loading a json dataset as a data frame, if the input path is wrong, the error message is very confusing

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10709: Assignee: (was: Apache Spark) > When loading a json dataset as a data frame, if the

[jira] [Commented] (SPARK-10709) When loading a json dataset as a data frame, if the input path is wrong, the error message is very confusing

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905803#comment-14905803 ] Apache Spark commented on SPARK-10709: -- User 'navis' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10709) When loading a json dataset as a data frame, if the input path is wrong, the error message is very confusing

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10709: Assignee: Apache Spark > When loading a json dataset as a data frame, if the input path

[jira] [Commented] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905817#comment-14905817 ] Apache Spark commented on SPARK-10770: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-09-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10788: - Summary: Decision Tree duplicates bins for unordered categorical features Key: SPARK-10788 URL: https://issues.apache.org/jira/browse/SPARK-10788 Project:

<    1   2