[jira] [Updated] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11332: -- Assignee: Nakul Jindal > WeightedLeastSquares should use ml features generic Instance class instead of

[jira] [Assigned] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11369: Assignee: Apache Spark > SparkR glm should support setting standardize >

[jira] [Commented] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978256#comment-14978256 ] Apache Spark commented on SPARK-11369: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Created] (SPARK-11373) Add metrics to the History Server and providers

2015-10-28 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-11373: -- Summary: Add metrics to the History Server and providers Key: SPARK-11373 URL: https://issues.apache.org/jira/browse/SPARK-11373 Project: Spark Issue

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses StringType column as intermediate

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses StringType column as intermediate

[jira] [Commented] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978412#comment-14978412 ] Maciej Bryński commented on SPARK-11368: Problem exists only when using Pyspark. When I did the

[jira] [Updated] (SPARK-11368) Spark shouldn't scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11368: --- Summary: Spark shouldn't scan all partitions when using Python UDF and filter over

[jira] [Resolved] (SPARK-11313) Implement cogroup

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11313. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9324

[jira] [Updated] (SPARK-11313) Implement cogroup

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-11313: - Assignee: Wenchen Fan > Implement cogroup > - > > Key:

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses one StringType column as intermediate

[jira] [Commented] (SPARK-11373) Add metrics to the History Server and providers

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978322#comment-14978322 ] Steve Loughran commented on SPARK-11373: # This has tangible benefit for the SPARK-1537 YARN ATS

[jira] [Created] (SPARK-11375) History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders

2015-10-28 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-11375: -- Summary: History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders Key: SPARK-11375 URL:

[jira] [Commented] (SPARK-11375) History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978329#comment-14978329 ] Steve Loughran commented on SPARK-11375: This could be implemented with a new method on

[jira] [Comment Edited] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978412#comment-14978412 ] Maciej Bryński edited comment on SPARK-11368 at 10/28/15 1:27 PM: --

[jira] [Updated] (SPARK-11317) YARN HBase token code shouldn't swallow invocation target exceptions

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-11317: --- Affects Version/s: 1.5.1 > YARN HBase token code shouldn't swallow invocation target

[jira] [Updated] (SPARK-11317) YARN HBase token code shouldn't swallow invocation target exceptions

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-11317: --- Description: As with SPARK-11265; the HBase token retrieval code of SPARK-6918 1. swallows

[jira] [Created] (SPARK-11374) skip.header.line.count is ignored in HiveContext

2015-10-28 Thread Daniel Haviv (JIRA)
Daniel Haviv created SPARK-11374: Summary: skip.header.line.count is ignored in HiveContext Key: SPARK-11374 URL: https://issues.apache.org/jira/browse/SPARK-11374 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978406#comment-14978406 ] Michael Armbrust commented on SPARK-11303: -- I picked it into branch-1.5, but I'm not sure if it

[jira] [Commented] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978528#comment-14978528 ] Saif Addin Ellafi commented on SPARK-11330: --- Hello Cheng Hao, and thank you very much for

[jira] [Created] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-11376: -- Summary: Invalid generated Java code in GenerateColumnAccessor Key: SPARK-11376 URL: https://issues.apache.org/jira/browse/SPARK-11376 Project: Spark Issue

[jira] [Updated] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9836: - Assignee: Yanbo Liang > Provide R-like summary statistics for ordinary least squares via normal

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-10-28 Thread swetha k (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978607#comment-14978607 ] swetha k commented on SPARK-3655: - [~koert] The final output for this RDD is RDD[(String, List[(Long,

[jira] [Created] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-11377: Summary: withNewChildren should not convert StructType to Seq Key: SPARK-11377 URL: https://issues.apache.org/jira/browse/SPARK-11377 Project: Spark

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Attachment: bug_reproduce.zip > Filter operation on StringType after groupBy PERSISTED

[jira] [Assigned] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11378: Assignee: (was: Apache Spark) > StreamingContext.awaitTerminationOrTimeout does not

[jira] [Created] (SPARK-11379) ExpressionEncoder can't handle top level primitive type correctly

2015-10-28 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11379: --- Summary: ExpressionEncoder can't handle top level primitive type correctly Key: SPARK-11379 URL: https://issues.apache.org/jira/browse/SPARK-11379 Project: Spark

[jira] [Assigned] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11371: Assignee: Apache Spark > Make "mean" an alias for "avg" operator >

[jira] [Assigned] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11377: Assignee: Michael Armbrust (was: Apache Spark) > withNewChildren should not convert

[jira] [Commented] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978505#comment-14978505 ] Apache Spark commented on SPARK-11377: -- User 'marmbrus' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11377: Assignee: Apache Spark (was: Michael Armbrust) > withNewChildren should not convert

[jira] [Comment Edited] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978528#comment-14978528 ] Saif Addin Ellafi edited comment on SPARK-11330 at 10/28/15 2:46 PM: -

[jira] [Assigned] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11371: Assignee: (was: Apache Spark) > Make "mean" an alias for "avg" operator >

[jira] [Commented] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978482#comment-14978482 ] Apache Spark commented on SPARK-11371: -- User 'ted-yu' has created a pull request for this issue:

[jira] [Updated] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11376: --- Priority: Major (was: Minor) > Invalid generated Java code in GenerateColumnAccessor >

[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978603#comment-14978603 ] Xiangrui Meng commented on SPARK-11337: --- How about per markdown file? I don't want to create too

[jira] [Updated] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11376: --- Description: There are two {{mutableRow}} fields in the generated code within

[jira] [Commented] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978541#comment-14978541 ] Apache Spark commented on SPARK-11378: -- User 'manygrams' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11378: Assignee: Apache Spark > StreamingContext.awaitTerminationOrTimeout does not return >

[jira] [Assigned] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11376: Assignee: Apache Spark (was: Cheng Lian) > Invalid generated Java code in

[jira] [Assigned] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11376: Assignee: Cheng Lian (was: Apache Spark) > Invalid generated Java code in

[jira] [Commented] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978600#comment-14978600 ] Xiangrui Meng commented on SPARK-9836: -- [~yanboliang] Note that the feature freeze deadline for 1.6

[jira] [Resolved] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-11332. - Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9325

[jira] [Commented] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977940#comment-14977940 ] Apache Spark commented on SPARK-11246: -- User 'xwu0226' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11246: Assignee: (was: Apache Spark) > [1.5] Table cache for Parquet broken in 1.5 >

[jira] [Assigned] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11246: Assignee: Apache Spark > [1.5] Table cache for Parquet broken in 1.5 >

[jira] [Commented] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977961#comment-14977961 ] Cheng Lian commented on SPARK-11103: Quoted from my reply on the user list: For 1: This one is

[jira] [Assigned] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11103: Assignee: Hyukjin Kwon (was: Apache Spark) > Filter applied on Merged Parquet shema with

[jira] [Commented] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977976#comment-14977976 ] Apache Spark commented on SPARK-11103: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11103: Assignee: Apache Spark (was: Hyukjin Kwon) > Filter applied on Merged Parquet shema with

[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977811#comment-14977811 ] Xusen Yin commented on SPARK-11337: --- I want to add new sub-tasks. How to assign the work amounts? I

[jira] [Resolved] (SPARK-11302) Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11302. --- Resolution: Fixed Fix Version/s: 1.5.2 1.3.2

[jira] [Updated] (SPARK-11302) Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11302: -- Priority: Critical (was: Minor) > Multivariate Gaussian Model with Covariance matrix

[jira] [Commented] (SPARK-11364) HadoopFsRelation doesn't reload the hadoop configuration for each execution

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977847#comment-14977847 ] Apache Spark commented on SPARK-11364: -- User 'chenghao-intel' has created a pull request for this

[jira] [Assigned] (SPARK-11364) HadoopFsRelation doesn't reload the hadoop configuration for each execution

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11364: Assignee: (was: Apache Spark) > HadoopFsRelation doesn't reload the hadoop

[jira] [Updated] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-11332: Assignee: (was: DB Tsai) > WeightedLeastSquares should use ml features generic Instance class instead

[jira] [Commented] (SPARK-11045) Contributing Receiver based Low Level Kafka Consumer from Spark-Packages to Apache Spark Project

2015-10-28 Thread Dibyendu Bhattacharya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977874#comment-14977874 ] Dibyendu Bhattacharya commented on SPARK-11045: --- hi [~tdas] , let me know what is your

[jira] [Updated] (SPARK-11302) Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11302: -- Assignee: Sean Owen Affects Version/s: 1.6.0 1.3.1

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977868#comment-14977868 ] DB Tsai commented on SPARK-11332: - [~srowen] Do you know how to assign to new users in JIRA? I tried to

[jira] [Commented] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977890#comment-14977890 ] Maciej Bryński commented on SPARK-10517: No. I think that output field is always empty. >

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977893#comment-14977893 ] DB Tsai commented on SPARK-11332: - Thanks. Please help me to add his as contributor. >

[jira] [Commented] (SPARK-11358) Deprecate `runs` in k-means

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977818#comment-14977818 ] Apache Spark commented on SPARK-11358: -- User 'mengxr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11358) Deprecate `runs` in k-means

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11358: Assignee: Xiangrui Meng (was: Apache Spark) > Deprecate `runs` in k-means >

[jira] [Assigned] (SPARK-11358) Deprecate `runs` in k-means

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11358: Assignee: Apache Spark (was: Xiangrui Meng) > Deprecate `runs` in k-means >

[jira] [Commented] (SPARK-11313) Implement cogroup

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977846#comment-14977846 ] Apache Spark commented on SPARK-11313: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11364) HadoopFsRelation doesn't reload the hadoop configuration for each execution

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11364: Assignee: Apache Spark > HadoopFsRelation doesn't reload the hadoop configuration for

[jira] [Assigned] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11332: Assignee: DB Tsai (was: Apache Spark) > WeightedLeastSquares should use ml features

[jira] [Assigned] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11332: Assignee: Apache Spark (was: DB Tsai) > WeightedLeastSquares should use ml features

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977864#comment-14977864 ] Apache Spark commented on SPARK-11332: -- User 'nakul02' has created a pull request for this issue:

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977891#comment-14977891 ] Sean Owen commented on SPARK-11332: --- They need to be a Contributor in JIRA. I can add this person. If

[jira] [Created] (SPARK-11365) consolidate aggregates for summary statistics in weighted least squares

2015-10-28 Thread holdenk (JIRA)
holdenk created SPARK-11365: --- Summary: consolidate aggregates for summary statistics in weighted least squares Key: SPARK-11365 URL: https://issues.apache.org/jira/browse/SPARK-11365 Project: Spark

[jira] [Updated] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11103: --- Assignee: Hyukjin Kwon > Filter applied on Merged Parquet shema with new column fail with >

[jira] [Commented] (SPARK-2089) With YARN, preferredNodeLocalityData isn't honored

2015-10-28 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977969#comment-14977969 ] Saisai Shao commented on SPARK-2089: Hi [~pwendell], [~mridulm80], [~sandyr] and [~lianhuiwang], I'm

[jira] [Created] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-11369: --- Summary: SparkR glm should support setting standardize Key: SPARK-11369 URL: https://issues.apache.org/jira/browse/SPARK-11369 Project: Spark Issue Type:

[jira] [Created] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11370: --- Summary: fix a bug in GroupedIterator and create unit test for it Key: SPARK-11370 URL: https://issues.apache.org/jira/browse/SPARK-11370 Project: Spark Issue

[jira] [Commented] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978125#comment-14978125 ] Apache Spark commented on SPARK-11370: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11370: Assignee: (was: Apache Spark) > fix a bug in GroupedIterator and create unit test for

[jira] [Assigned] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11370: Assignee: Apache Spark > fix a bug in GroupedIterator and create unit test for it >

[jira] [Updated] (SPARK-11349) Support transform string label for RFormula

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11349: -- Shepherd: Xiangrui Meng Target Version/s: 1.6.0 > Support transform string label

[jira] [Updated] (SPARK-11349) Support transform string label for RFormula

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11349: -- Assignee: Yanbo Liang > Support transform string label for RFormula >

[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978614#comment-14978614 ] Xiangrui Meng commented on SPARK-11337: --- [~yinxusen] I created one sub-task (SPARK-11380) as a

[jira] [Assigned] (SPARK-11379) ExpressionEncoder can't handle top level primitive type correctly

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11379: Assignee: (was: Apache Spark) > ExpressionEncoder can't handle top level primitive

[jira] [Commented] (SPARK-11379) ExpressionEncoder can't handle top level primitive type correctly

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978616#comment-14978616 ] Apache Spark commented on SPARK-11379: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Updated] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11369: -- Assignee: Yanbo Liang > SparkR glm should support setting standardize >

[jira] [Updated] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11369: -- Target Version/s: 1.6.0 > SparkR glm should support setting standardize >

[jira] [Assigned] (SPARK-11379) ExpressionEncoder can't handle top level primitive type correctly

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11379: Assignee: Apache Spark > ExpressionEncoder can't handle top level primitive type

[jira] [Commented] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978615#comment-14978615 ] Xiangrui Meng commented on SPARK-9836: -- Sorry for late response! There are more starter tasks coming

[jira] [Commented] (SPARK-11346) Spark EventLog for completed applications

2015-10-28 Thread Milan Brna (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978671#comment-14978671 ] Milan Brna commented on SPARK-11346: Marcelo, thank you, you're right with the

[jira] [Resolved] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11377. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9334

[jira] [Updated] (SPARK-10561) Provide tooling for auto-generating Spark SQL reference manual

2015-10-28 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-10561: --- Description: Here is the discussion thread: http://search-hadoop.com/m/q3RTtcD20F1o62xE Richard Hillegas

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-10-28 Thread swetha k (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978610#comment-14978610 ] swetha k commented on SPARK-3655: - [~koert] If I don't put the list as a materialized view in memory,

[jira] [Created] (SPARK-11380) Replace example code in mllib-frequent-pattern-mining.md using include_example

2015-10-28 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-11380: - Summary: Replace example code in mllib-frequent-pattern-mining.md using include_example Key: SPARK-11380 URL: https://issues.apache.org/jira/browse/SPARK-11380

[jira] [Resolved] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11369. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9331

[jira] [Resolved] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11367. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9328

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11367: -- Assignee: Yanbo Liang > Python LinearRegression should support setting solver >

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11367: -- Target Version/s: 1.6.0 > Python LinearRegression should support setting solver >

[jira] [Commented] (SPARK-11346) Spark EventLog for completed applications

2015-10-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978763#comment-14978763 ] Marcelo Vanzin commented on SPARK-11346: Sorry, I don't really understand your question. But

[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978799#comment-14978799 ] Xusen Yin commented on SPARK-11337: --- I'll create subtasks later. One more thing, we need to

[jira] [Comment Edited] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978799#comment-14978799 ] Xusen Yin edited comment on SPARK-11337 at 10/28/15 5:24 PM: - I'll create

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Attachment: bug_reproduce_50k.zip With this data size, you will surely reproduce the

  1   2   3   >