[jira] [Commented] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904298#comment-14904298 ] Yanbo Liang commented on SPARK-10668: - [~mengxr] If [~lewuathe] is busy with other issues, I can take

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread rerngvit yanggratoke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904334#comment-14904334 ] rerngvit yanggratoke commented on SPARK-9798: - [~fliang] Could you please allow the build,

[jira] [Updated] (SPARK-10774) Put different event log to different directory according to different conditions

2015-09-23 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-10774: Summary: Put different event log to different directory according to different conditions (was:

[jira] [Commented] (SPARK-10602) Univariate statistics as UDAFs: single-pass continuous stats

2015-09-23 Thread Sabyasachi Nayak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904437#comment-14904437 ] Sabyasachi Nayak commented on SPARK-10602: -- Thanks Seth. I wanted to implement skewness and

[jira] [Comment Edited] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904276#comment-14904276 ] Yanbo Liang edited comment on SPARK-10413 at 9/23/15 10:23 AM: --- [~mengxr]

[jira] [Commented] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904390#comment-14904390 ] Apache Spark commented on SPARK-10668: -- User 'Lewuathe' has created a pull request for this issue:

[jira] [Commented] (SPARK-9710) RPackageUtilsSuite fails if R is not installed

2015-09-23 Thread Pete Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904485#comment-14904485 ] Pete Robbins commented on SPARK-9710: - The Fix Version for this says 1.5.0 but the PR is not in the

[jira] [Resolved] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7122. -- Resolution: Cannot Reproduce Not sure whether this is 'fixed' or 'can't reproduce now' but the de facto

[jira] [Resolved] (SPARK-6161) sqlCtx.parquetFile(dataFilePath) throws NPE when using s3, but OK when using local filesystem

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6161. -- Resolution: Not A Problem I think this is maybe a question for user@ first, but also, appears to be an

[jira] [Commented] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-09-23 Thread Mohamed Baddar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904471#comment-14904471 ] Mohamed Baddar commented on SPARK-9836: --- Thanks a lot , i will try one of the starter tasks , but

[jira] [Assigned] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10413: Assignee: Apache Spark > Model should support prediction on single instance >

[jira] [Assigned] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10413: Assignee: (was: Apache Spark) > Model should support prediction on single instance >

[jira] [Commented] (SPARK-10413) Model should support prediction on single instance

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904291#comment-14904291 ] Apache Spark commented on SPARK-10413: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-10773) Repartition operation failing on RDD with "argument type mismatch" error

2015-09-23 Thread Da Fox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Da Fox updated SPARK-10773: --- Description: Hello, Erorr occures in following Spark application: {code} object RunSpark { def

[jira] [Resolved] (SPARK-10768) How to access columns with "." dot in their name in Spark SQL

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10768. --- Resolution: Invalid Would you mind asking your question on u...@spark.apache.org? JIRA is for

[jira] [Commented] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2015-09-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904500#comment-14904500 ] Özgür Demir commented on SPARK-5992: hi, we just open sourced an lsh topk implementation for spark.

[jira] [Updated] (SPARK-10773) Repartition operation failing on RDD with "argument type mismatch" error

2015-09-23 Thread Da Fox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Da Fox updated SPARK-10773: --- Description: Hello, Erorr occures in following Spark application: {code} object RunSpark { def

[jira] [Assigned] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10668: Assignee: Apache Spark (was: Kai Sasaki) > Use WeightedLeastSquares in LinearRegression

[jira] [Assigned] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10668: Assignee: Kai Sasaki (was: Apache Spark) > Use WeightedLeastSquares in LinearRegression

[jira] [Commented] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904395#comment-14904395 ] Kai Sasaki commented on SPARK-10668: So sorry for being late for submitting patch and thank you for

[jira] [Updated] (SPARK-9928) LogicalLocalTable in ExistingRDD.scala is not referenced by any code else

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9928: - Priority: Trivial (was: Minor) Issue Type: Improvement (was: Question) I think you're right, but

[jira] [Comment Edited] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-23 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904298#comment-14904298 ] Yanbo Liang edited comment on SPARK-10668 at 9/23/15 10:15 AM: --- [~mengxr] I

[jira] [Updated] (SPARK-10774) Put different event log to different directory according to different conditions

2015-09-23 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-10774: Description: Right now, Spark logging all event logs(inprogress or finished) into the some

[jira] [Resolved] (SPARK-4311) ContainerLauncher setting up executor -- invalid Xms settings (-Xms0m -Xmx0m)

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4311. -- Resolution: Cannot Reproduce > ContainerLauncher setting up executor -- invalid Xms settings (-Xms0m

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Mohamed Baddar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904467#comment-14904467 ] Mohamed Baddar commented on SPARK-9798: --- Hello rerngvit I am also new to contribution , can we work

[jira] [Created] (SPARK-10775) add search keywords in history page ui

2015-09-23 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-10775: Summary: add search keywords in history page ui Key: SPARK-10775 URL: https://issues.apache.org/jira/browse/SPARK-10775 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4489) JavaPairRDD.collectAsMap from checkpoint RDD may fail with ClassCastException

2015-09-23 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904572#comment-14904572 ] Glenn Strycker commented on SPARK-4489: --- My ticket SPARK-10762 may have just been a user error, but

[jira] [Issue Comment Deleted] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-09-23 Thread Nicolas PHUNG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas PHUNG updated SPARK-7122: - Comment: was deleted (was: Sorry for the delay. I will test as soon as Cloudera distribution has

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-09-23 Thread Nicolas PHUNG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904515#comment-14904515 ] Nicolas PHUNG commented on SPARK-7122: -- Sorry for the delay. I will test as soon as Cloudera

[jira] [Assigned] (SPARK-10775) add search keywords in history page ui

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10775: Assignee: Apache Spark > add search keywords in history page ui >

[jira] [Closed] (SPARK-10762) GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

2015-09-23 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glenn Strycker closed SPARK-10762. -- This probably isn't completely fixed, but should be a new ticket for casting ArrayBuffers

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-09-23 Thread Nicolas PHUNG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904513#comment-14904513 ] Nicolas PHUNG commented on SPARK-7122: -- Sorry for the delay. I will test as soon as Cloudera

[jira] [Resolved] (SPARK-10762) GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

2015-09-23 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glenn Strycker resolved SPARK-10762. Resolution: Not A Problem Instead of

[jira] [Commented] (SPARK-10775) add search keywords in history page ui

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904563#comment-14904563 ] Apache Spark commented on SPARK-10775: -- User 'lianhuiwang' has created a pull request for this

[jira] [Assigned] (SPARK-10775) add search keywords in history page ui

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10775: Assignee: (was: Apache Spark) > add search keywords in history page ui >

[jira] [Commented] (SPARK-10762) GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

2015-09-23 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904567#comment-14904567 ] Glenn Strycker commented on SPARK-10762: Please see the accepted solution to

[jira] [Updated] (SPARK-9710) RPackageUtilsSuite fails if R is not installed

2015-09-23 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-9710: - Fix Version/s: (was: 1.5.0) 1.5.1 > RPackageUtilsSuite

[jira] [Commented] (SPARK-9710) RPackageUtilsSuite fails if R is not installed

2015-09-23 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904580#comment-14904580 ] Shivaram Venkataraman commented on SPARK-9710: -- Thanks for the catch. I cherry picked this to

[jira] [Commented] (SPARK-1040) Collect as Map throws a casting exception when run on a JavaPairRDD object

2015-09-23 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904571#comment-14904571 ] Glenn Strycker commented on SPARK-1040: --- My ticket SPARK-10762 may have just been a user error, but

[jira] [Updated] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10728: -- Assignee: shane knapp (was: Josh Rosen) > Failed to set Jenkins Identity header on email. >

[jira] [Commented] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904778#comment-14904778 ] Cheng Hao commented on SPARK-10733: --- [~jameszhouyi] Can you please patch the

[jira] [Updated] (SPARK-10644) Applications wait even if free executors are available

2015-09-23 Thread Balagopal Nair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balagopal Nair updated SPARK-10644: --- Priority: Minor (was: Major) > Applications wait even if free executors are available >

[jira] [Created] (SPARK-10776) Pass location of SparkR source files from R process to JVM

2015-09-23 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-10776: -- Summary: Pass location of SparkR source files from R process to JVM Key: SPARK-10776 URL: https://issues.apache.org/jira/browse/SPARK-10776 Project: Spark

[jira] [Commented] (SPARK-10644) Applications wait even if free executors are available

2015-09-23 Thread Balagopal Nair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904831#comment-14904831 ] Balagopal Nair commented on SPARK-10644: I'm overallocating hardware here. Each machine has 4

[jira] [Commented] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904805#comment-14904805 ] Yin Huai commented on SPARK-10733: -- btw, how many parallel tasks can be executed in an executor? >

[jira] [Updated] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10763: -- Assignee: holdenk > Update Java MLLIB/ML tests to use simplified dataframe construction >

[jira] [Resolved] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10763. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8886

[jira] [Commented] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2015-09-23 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905856#comment-14905856 ] Cheng Lian commented on SPARK-10659: This behavior had once been a hacky way to workaround

[jira] [Updated] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10763: -- Affects Version/s: 1.6.0 Target Version/s: 1.6.0 > Update Java MLLIB/ML tests to use

[jira] [Comment Edited] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2015-09-23 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905856#comment-14905856 ] Cheng Lian edited comment on SPARK-10659 at 9/24/15 5:51 AM: - This behavior

[jira] [Resolved] (SPARK-10494) Multiple Python UDFs together with aggregation or sort merge join may cause OOM (failed to acquire memory)

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10494. -- Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 1.5.1 >

[jira] [Updated] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10297: - Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > When save data to a data source table, we

[jira] [Commented] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904964#comment-14904964 ] Yin Huai commented on SPARK-10733: -- [~jameszhouyi] Another two places for logging are

[jira] [Created] (SPARK-10779) Set initialModel for KMeans model in PySpark (spark.mllib)

2015-09-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10779: - Summary: Set initialModel for KMeans model in PySpark (spark.mllib) Key: SPARK-10779 URL: https://issues.apache.org/jira/browse/SPARK-10779 Project: Spark

[jira] [Commented] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904909#comment-14904909 ] Michael Armbrust commented on SPARK-10659: -- /cc [~lian cheng] > DataFrames and SparkSQL

[jira] [Commented] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904906#comment-14904906 ] shane knapp commented on SPARK-10728: - let's close this... the stack trace doesn't affect the build

[jira] [Assigned] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10763: Assignee: (was: Apache Spark) > Update Java MLLIB/ML tests to use simplified

[jira] [Commented] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904978#comment-14904978 ] Apache Spark commented on SPARK-10763: -- User 'holdenk' has created a pull request for this issue:

[jira] [Updated] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10728: -- Labels: (was: flaky-test) > Failed to set Jenkins Identity header on email. >

[jira] [Updated] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10728: -- Target Version/s: (was: 1.6.0) > Failed to set Jenkins Identity header on email. >

[jira] [Commented] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904986#comment-14904986 ] shane knapp commented on SPARK-10728: - well, the spark project doesn't need to fix it. if you want

[jira] [Commented] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905083#comment-14905083 ] Yin Huai commented on SPARK-10733: -- Can you attach your query plan? > TungstenAggregation cannot

[jira] [Updated] (SPARK-10728) Failed to set Jenkins Identity header on email.

2015-09-23 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp updated SPARK-10728: Priority: Trivial (was: Major) > Failed to set Jenkins Identity header on email. >

[jira] [Updated] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10294: - Target Version/s: 1.6.0 (was: 1.5.1) > When Parquet writer's close method throws an

[jira] [Created] (SPARK-10777) order by fails when column is aliased and projection includes windowed aggregate

2015-09-23 Thread N Campbell (JIRA)
N Campbell created SPARK-10777: -- Summary: order by fails when column is aliased and projection includes windowed aggregate Key: SPARK-10777 URL: https://issues.apache.org/jira/browse/SPARK-10777

[jira] [Created] (SPARK-10778) Implement toString for AssociationRules.Rule

2015-09-23 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10778: - Summary: Implement toString for AssociationRules.Rule Key: SPARK-10778 URL: https://issues.apache.org/jira/browse/SPARK-10778 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10778) Implement toString for AssociationRules.Rule

2015-09-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10778: -- Description: pretty print for association rules, e.g. {code} {a, b, c} => {d}: 0.8 {code}

[jira] [Commented] (SPARK-10741) Hive Query Having/OrderBy against Parquet table is not working

2015-09-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905069#comment-14905069 ] Wenchen Fan commented on SPARK-10741: - This bug is caused by a conflict between 2 tricky part in our

[jira] [Comment Edited] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904964#comment-14904964 ] Yin Huai edited comment on SPARK-10733 at 9/23/15 7:23 PM: --- [~jameszhouyi]

[jira] [Commented] (SPARK-10727) Dataframe count is zero after 'except' operation

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904900#comment-14904900 ] Michael Armbrust commented on SPARK-10727: -- Thanks for reporting. This should be fixed in

[jira] [Updated] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10765: - Assignee: Wenchen Fan > use new aggregate interface for hive UDAF >

[jira] [Updated] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2015-09-23 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10659: --- Description: DataFrames currently automatically promotes all Parquet schema fields to optional when

[jira] [Updated] (SPARK-10448) Parquet schema merging should NOT merge UDT

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10448: - Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > Parquet schema merging should NOT merge

[jira] [Resolved] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10403. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved

[jira] [Updated] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2015-09-23 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10659: --- Description: DataFrames currently automatically promotes all Parquet schema fields to optional when

[jira] [Updated] (SPARK-10780) Set initialModel in KMeans in Pipelines API

2015-09-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10780: -- Description: This is for the Scala version. After this is merged, create a JIRA for

[jira] [Updated] (SPARK-10494) Multiple Python UDFs together with aggregation or sort merge join may cause OOM (failed to acquire memory)

2015-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10494: Fix Version/s: 1.6.0 > Multiple Python UDFs together with aggregation or sort merge join may cause

[jira] [Assigned] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10770: Assignee: Reynold Xin (was: Apache Spark) > SparkPlan.executeCollect/executeTake should

[jira] [Commented] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904046#comment-14904046 ] Apache Spark commented on SPARK-10770: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10770: Assignee: Apache Spark (was: Reynold Xin) > SparkPlan.executeCollect/executeTake should

[jira] [Resolved] (SPARK-10742) Add the ability to embed HTML relative links in job descriptions

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10742. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Add the ability to

[jira] [Resolved] (SPARK-10652) Set meaningful job descriptions for streaming related jobs

2015-09-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10652. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Set meaningful job

[jira] [Created] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-23 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-10770: --- Summary: SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row Key: SPARK-10770 URL: https://issues.apache.org/jira/browse/SPARK-10770

[jira] [Assigned] (SPARK-10772) NullPointerException when transform function in DStream returns NULL

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10772: Assignee: (was: Apache Spark) > NullPointerException when transform function in

[jira] [Commented] (SPARK-10772) NullPointerException when transform function in DStream returns NULL

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904080#comment-14904080 ] Apache Spark commented on SPARK-10772: -- User 'jhu-chang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10772) NullPointerException when transform function in DStream returns NULL

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10772: Assignee: Apache Spark > NullPointerException when transform function in DStream returns

[jira] [Created] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-23 Thread Ferdinand Xu (JIRA)
Ferdinand Xu created SPARK-10771: Summary: Implement the shuffle encryption with AES-CTR crypto using JCE key provider. Key: SPARK-10771 URL: https://issues.apache.org/jira/browse/SPARK-10771

[jira] [Assigned] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10720: Assignee: (was: Apache Spark) > Add a java wrapper to create dataframe from a local

[jira] [Assigned] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10720: Assignee: Apache Spark > Add a java wrapper to create dataframe from a local list of Java

[jira] [Commented] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904059#comment-14904059 ] Apache Spark commented on SPARK-10720: -- User 'holdenk' has created a pull request for this issue:

[jira] [Commented] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904073#comment-14904073 ] Apache Spark commented on SPARK-10771: -- User 'winningsix' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10771: Assignee: Apache Spark > Implement the shuffle encryption with AES-CTR crypto using JCE

[jira] [Assigned] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10771: Assignee: (was: Apache Spark) > Implement the shuffle encryption with AES-CTR crypto

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread rerngvit yanggratoke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904075#comment-14904075 ] rerngvit yanggratoke commented on SPARK-9798: - Ok, may I be assigned to this task then? I am

[jira] [Updated] (SPARK-10721) Log warning when file deletion fails

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10721: -- Assignee: Ted Yu Issue Type: Improvement (was: Bug) > Log warning when file deletion fails >

[jira] [Resolved] (SPARK-10721) Log warning when file deletion fails

2015-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10721. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8843

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904208#comment-14904208 ] Apache Spark commented on SPARK-9798: - User 'rerngvit' has created a pull request for this issue:

[jira] [Assigned] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9798: --- Assignee: Apache Spark > CrossValidatorModel Documentation Improvements >

[jira] [Closed] (SPARK-10695) spark.mesos.constraints documentation uses "=" to separate value instead ":" as parser and mesos expects.

2015-09-23 Thread Akash Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akash Mishra closed SPARK-10695. Change is successfully merged in master and 1.5 Branch. > spark.mesos.constraints documentation uses

[jira] [Assigned] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9798: --- Assignee: (was: Apache Spark) > CrossValidatorModel Documentation Improvements >

  1   2   >