[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095588#comment-15095588 ] Sun Rui commented on SPARK-6817: Attached the first draft design doc, please review and give comments >

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095590#comment-15095590 ] Sun Rui commented on SPARK-6817: [~mpollock], this PR will support row-based UDF. UDF operating on columns

[jira] [Updated] (SPARK-12373) Type coercion rule of dividing two decimal values may choose an intermediate precision that does not have enough number of digits at the left of decimal point

2016-01-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12373: - Target Version/s: 2.0.0 (was: 1.6.1, 2.0.0) > Type coercion rule of dividing two decimal values may

[jira] [Updated] (SPARK-10538) java.lang.NegativeArraySizeException during join

2016-01-12 Thread mayxine (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mayxine updated SPARK-10538: Attachment: java.lang.NegativeArraySizeException.png > java.lang.NegativeArraySizeException during join >

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095600#comment-15095600 ] Sun Rui commented on SPARK-6817: [~shivaram] I first focus on the row-based UDF functionality. For

[jira] [Updated] (SPARK-12558) AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12558: - Assignee: Dilip Biswal > AnalysisException when multiple functions applied in GROUP BY clause >

[jira] [Assigned] (SPARK-12796) initial prototype: projection/filter/range

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12796: Assignee: Apache Spark (was: Davies Liu) > initial prototype: projection/filter/range >

[jira] [Commented] (SPARK-12796) initial prototype: projection/filter/range

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095710#comment-15095710 ] Apache Spark commented on SPARK-12796: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12796) initial prototype: projection/filter/range

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12796: Assignee: Davies Liu (was: Apache Spark) > initial prototype: projection/filter/range >

[jira] [Comment Edited] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095721#comment-15095721 ] Reynold Xin edited comment on SPARK-6817 at 1/13/16 6:57 AM: - [~sunrui] Why

[jira] [Comment Edited] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095721#comment-15095721 ] Reynold Xin edited comment on SPARK-6817 at 1/13/16 6:58 AM: - [~sunrui] Why

[jira] [Created] (SPARK-12792) Refactor RRDD to support R UDF

2016-01-12 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12792: --- Summary: Refactor RRDD to support R UDF Key: SPARK-12792 URL: https://issues.apache.org/jira/browse/SPARK-12792 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095594#comment-15095594 ] Sun Rui commented on SPARK-6817: [~piccolbo] I am not sure If I understand your meaning. This is to

[jira] [Created] (SPARK-12797) Aggregation without grouping keys

2016-01-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12797: -- Summary: Aggregation without grouping keys Key: SPARK-12797 URL: https://issues.apache.org/jira/browse/SPARK-12797 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-12796) initial prototype: projection/filter/range

2016-01-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12796: -- Summary: initial prototype: projection/filter/range Key: SPARK-12796 URL: https://issues.apache.org/jira/browse/SPARK-12796 Project: Spark Issue Type: New

[jira] [Comment Edited] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095734#comment-15095734 ] Jeff Zhang edited comment on SPARK-6817 at 1/13/16 7:09 AM: +1 on block based

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Weiqiang Zhuang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095756#comment-15095756 ] Weiqiang Zhuang commented on SPARK-6817: We did see both apply use cases. But the

[jira] [Updated] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sun Rui updated SPARK-6817: --- Attachment: SparkR UDF Design Documentation v1.pdf > DataFrame UDFs in R > --- > >

[jira] [Resolved] (SPARK-12558) AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-12558. -- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved by pull request

[jira] [Created] (SPARK-12798) Broadcast hash join

2016-01-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12798: -- Summary: Broadcast hash join Key: SPARK-12798 URL: https://issues.apache.org/jira/browse/SPARK-12798 Project: Spark Issue Type: New Feature

[jira] [Assigned] (SPARK-12728) Integrate SQL generation feature with native view

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12728: Assignee: (was: Apache Spark) > Integrate SQL generation feature with native view >

[jira] [Resolved] (SPARK-12785) Implement columnar in memory representation

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12785. - Resolution: Fixed Assignee: Nong Li Fix Version/s: 2.0.0 > Implement columnar in

[jira] [Created] (SPARK-12790) Remove HistoryServer old multiple files format

2016-01-12 Thread Andrew Or (JIRA)
Andrew Or created SPARK-12790: - Summary: Remove HistoryServer old multiple files format Key: SPARK-12790 URL: https://issues.apache.org/jira/browse/SPARK-12790 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12791) Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095537#comment-15095537 ] Apache Spark commented on SPARK-12791: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12791) Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12791: Assignee: Apache Spark (was: Reynold Xin) > Simplify CaseWhen by breaking "branches"

[jira] [Commented] (SPARK-12172) Consider removing SparkR internal RDD APIs

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095605#comment-15095605 ] Sun Rui commented on SPARK-12172: - As Spark is migrating from RDD API to Dataset API, after Dataset API

[jira] [Comment Edited] (SPARK-12172) Consider removing SparkR internal RDD APIs

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095605#comment-15095605 ] Sun Rui edited comment on SPARK-12172 at 1/13/16 4:50 AM: -- As Spark is migrating

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095734#comment-15095734 ] Jeff Zhang commented on SPARK-6817: --- +1 on block based API, UDF would usually call other R packages and

[jira] [Created] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode

2016-01-12 Thread Chester (JIRA)
Chester created SPARK-12800: --- Summary: Subtle bug on Spark Yarn Client under Kerberos Security Mode Key: SPARK-12800 URL: https://issues.apache.org/jira/browse/SPARK-12800 Project: Spark Issue

[jira] [Assigned] (SPARK-12771) Improve code generation for CaseWhen

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12771: Assignee: (was: Apache Spark) > Improve code generation for CaseWhen >

[jira] [Assigned] (SPARK-12771) Improve code generation for CaseWhen

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12771: Assignee: Apache Spark > Improve code generation for CaseWhen >

[jira] [Commented] (SPARK-12771) Improve code generation for CaseWhen

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095790#comment-15095790 ] Apache Spark commented on SPARK-12771: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-12635) More efficient (column batch) serialization for Python/R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095494#comment-15095494 ] Sun Rui commented on SPARK-12635: - [~dselivanov] PySpark uses pickle and CloudPickle on python side and

[jira] [Comment Edited] (SPARK-12635) More efficient (column batch) serialization for Python/R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095494#comment-15095494 ] Sun Rui edited comment on SPARK-12635 at 1/13/16 2:35 AM: -- [~dselivanov] PySpark

[jira] [Resolved] (SPARK-12788) Simplify BooleanEquality by using casts

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12788. - Resolution: Fixed Fix Version/s: 2.0.0 > Simplify BooleanEquality by using casts >

[jira] [Created] (SPARK-12793) Support R UDF Evaluation

2016-01-12 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12793: --- Summary: Support R UDF Evaluation Key: SPARK-12793 URL: https://issues.apache.org/jira/browse/SPARK-12793 Project: Spark Issue Type: Sub-task Components:

[jira] [Created] (SPARK-12794) Support Defining and Registration of R UDF

2016-01-12 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12794: --- Summary: Support Defining and Registration of R UDF Key: SPARK-12794 URL: https://issues.apache.org/jira/browse/SPARK-12794 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma and colon

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095714#comment-15095714 ] Apache Spark commented on SPARK-12692: -- User 'sarutak' has created a pull request for this issue:

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095747#comment-15095747 ] Reynold Xin commented on SPARK-6817: Please take a look at the original design doc for this:

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095745#comment-15095745 ] Sun Rui commented on SPARK-6817: [~rxin] Row-oriented R UDF is for SQL and is similar to Python UDF. I am

[jira] [Created] (SPARK-12799) Simplify various string output for expressions

2016-01-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12799: --- Summary: Simplify various string output for expressions Key: SPARK-12799 URL: https://issues.apache.org/jira/browse/SPARK-12799 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12728) Integrate SQL generation feature with native view

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095478#comment-15095478 ] Apache Spark commented on SPARK-12728: -- User 'liancheng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12791) Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12791: Assignee: Reynold Xin (was: Apache Spark) > Simplify CaseWhen by breaking "branches"

[jira] [Created] (SPARK-12791) Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12791: --- Summary: Simplify CaseWhen by breaking "branches" into "conditions" and "values" Key: SPARK-12791 URL: https://issues.apache.org/jira/browse/SPARK-12791 Project: Spark

[jira] [Commented] (SPARK-4226) SparkSQL - Add support for subqueries in predicates

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095695#comment-15095695 ] Apache Spark commented on SPARK-4226: - User 'davies' has created a pull request for this issue:

[jira] [Updated] (SPARK-12795) Whole stage codegen

2016-01-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-12795: --- Description: Whole stage codegen is used by some modern MPP databases to archive great performance.

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095776#comment-15095776 ] Antonio Piccolboni commented on SPARK-6817: --- My question made sense only wrt the block or

[jira] [Updated] (SPARK-12795) Whole stage codegen

2016-01-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-12795: --- Summary: Whole stage codegen (was: Compile multiple operator into a single Java function to avoid

[jira] [Created] (SPARK-12795) Compile multiple operator into a single Java function to avoid the overhead from materialize rows and Scala iterator

2016-01-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12795: -- Summary: Compile multiple operator into a single Java function to avoid the overhead from materialize rows and Scala iterator Key: SPARK-12795 URL:

[jira] [Comment Edited] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095721#comment-15095721 ] Reynold Xin edited comment on SPARK-6817 at 1/13/16 6:57 AM: - [~sunrui] Why

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095721#comment-15095721 ] Reynold Xin commented on SPARK-6817: [~sunrui] Why are you focusing on a row-based API? I think a

[jira] [Comment Edited] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095776#comment-15095776 ] Antonio Piccolboni edited comment on SPARK-6817 at 1/13/16 7:41 AM: My

[jira] [Updated] (SPARK-12770) Implement rules for branch elimination for CaseWhen in SimplifyConditionals

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12770: Description: There are a few things we can do: 1. If a branch's condition is a true literal,

[jira] [Commented] (SPARK-12449) Pushing down arbitrary logical plans to data sources

2016-01-12 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093533#comment-15093533 ] Santiago M. Mola commented on SPARK-12449: -- Implementing this interface or an equivalent one

[jira] [Created] (SPARK-12771) Improve code generation for CaseWhen

2016-01-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12771: --- Summary: Improve code generation for CaseWhen Key: SPARK-12771 URL: https://issues.apache.org/jira/browse/SPARK-12771 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2016-01-12 Thread Konstantin Shaposhnikov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093511#comment-15093511 ] Konstantin Shaposhnikov commented on SPARK-2984: I am seeing the same error message with

[jira] [Created] (SPARK-12772) Better error message for parsing failure?

2016-01-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12772: --- Summary: Better error message for parsing failure? Key: SPARK-12772 URL: https://issues.apache.org/jira/browse/SPARK-12772 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-12689) Migrate DDL parsing to the newly absorbed parser

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12689: Assignee: (was: Apache Spark) > Migrate DDL parsing to the newly absorbed parser >

[jira] [Updated] (SPARK-12770) Implement rules for branch elimination for CaseWhen in SimplifyConditionals

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12770: Summary: Implement rules for branch elimination for CaseWhen in SimplifyConditionals (was:

[jira] [Updated] (SPARK-12770) Implement rules for removing unnecessary branches for CaseWhen in SimplifyConditionals

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12770: Description: There are a few things we can do: 1. If a branch is a true literal, remove the

[jira] [Updated] (SPARK-12768) Remove CaseKeyWhen expression

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12768: Summary: Remove CaseKeyWhen expression (was: Remove CaseKeyWhen) > Remove CaseKeyWhen expression

[jira] [Updated] (SPARK-12762) Add unit test for simplifying if expression

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12762: Issue Type: Sub-task (was: Improvement) Parent: SPARK-12767 > Add unit test for

[jira] [Created] (SPARK-12773) Impurity and Sample details for each node of a decision tree

2016-01-12 Thread Rahul Tanwani (JIRA)
Rahul Tanwani created SPARK-12773: - Summary: Impurity and Sample details for each node of a decision tree Key: SPARK-12773 URL: https://issues.apache.org/jira/browse/SPARK-12773 Project: Spark

[jira] [Updated] (SPARK-12774) DataFrame.mapPartitions apply function operates on Pandas DataFrame instead of a generator or rows

2016-01-12 Thread Josh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh updated SPARK-12774: - Description: Currently DataFrame.mapPatitions is analogous to DataFrame.rdd.mapPatitions in both Spark and

[jira] [Created] (SPARK-12774) DataFrame.mapPartitions apply function operates on Pandas DataFrame instead of a generator or rows

2016-01-12 Thread Josh (JIRA)
Josh created SPARK-12774: Summary: DataFrame.mapPartitions apply function operates on Pandas DataFrame instead of a generator or rows Key: SPARK-12774 URL: https://issues.apache.org/jira/browse/SPARK-12774

[jira] [Assigned] (SPARK-12689) Migrate DDL parsing to the newly absorbed parser

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12689: Assignee: Apache Spark > Migrate DDL parsing to the newly absorbed parser >

[jira] [Commented] (SPARK-12689) Migrate DDL parsing to the newly absorbed parser

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093672#comment-15093672 ] Apache Spark commented on SPARK-12689: -- User 'viirya' has created a pull request for this issue:

[jira] [Updated] (SPARK-12769) Remove If expression

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12769: Description: If can be a simple factory method for CaseWhen, similar to CaseKeyWhen. We can then

[jira] [Assigned] (SPARK-12768) Remove CaseKeyWhen expression

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12768: Assignee: Apache Spark (was: Reynold Xin) > Remove CaseKeyWhen expression >

[jira] [Assigned] (SPARK-12768) Remove CaseKeyWhen expression

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12768: Assignee: Reynold Xin (was: Apache Spark) > Remove CaseKeyWhen expression >

[jira] [Commented] (SPARK-12768) Remove CaseKeyWhen expression

2016-01-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093506#comment-15093506 ] Apache Spark commented on SPARK-12768: -- User 'rxin' has created a pull request for this issue:

[jira] [Updated] (SPARK-12760) inaccurate description for difference between local vs cluster mode in closure handling

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12760: -- Priority: Minor (was: Trivial) Issue Type: Bug (was: Question) Summary: inaccurate

[jira] [Resolved] (SPARK-12766) Unshaded google guava classes in spark-network-common jar

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12766. --- Resolution: Not A Problem This is on purpose. Some Guava classes are used in the public Java API

[jira] [Comment Edited] (SPARK-12764) XML Column type is not supported

2016-01-12 Thread Ewan Leith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093836#comment-15093836 ] Ewan Leith edited comment on SPARK-12764 at 1/12/16 12:53 PM: -- What are you

[jira] [Commented] (SPARK-12775) Couldn't find leader offsets exception when hostname can't be resolved

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093858#comment-15093858 ] Sean Owen commented on SPARK-12775: --- Hm, I don't think that's a spark problem though. > Couldn't find

[jira] [Resolved] (SPARK-12582) IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12582. --- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved by pull

[jira] [Resolved] (SPARK-7615) MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7615. -- Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.0.0 1.6.1

[jira] [Resolved] (SPARK-12773) Impurity and Sample details for each node of a decision tree

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12773. --- Resolution: Invalid Target Version/s: (was: 1.5.2) Please ask questions at

[jira] [Updated] (SPARK-12759) Spark should fail fast if --executor-memory is too small for spark to start

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12759: -- Component/s: Spark Submit Spark Core > Spark should fail fast if --executor-memory is

[jira] [Updated] (SPARK-12763) Spark gets stuck executing SSB query

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12763: -- Component/s: SQL > Spark gets stuck executing SSB query > > >

[jira] [Resolved] (SPARK-2516) Bootstrapping

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2516. -- Resolution: Won't Fix This is the only one left under this umbrella; I assume it's stale, or really

[jira] [Resolved] (SPARK-3669) Extract IndexedRDD interface

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3669. -- Resolution: Won't Fix Resolved for now per parent discussion > Extract IndexedRDD interface >

[jira] [Resolved] (SPARK-3668) Support for arbitrary key types in IndexedRDD

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3668. -- Resolution: Won't Fix Resolved for now per parent discussion > Support for arbitrary key types in

[jira] [Resolved] (SPARK-4043) Add a flag for stopping threads of cancelled tasks if Thread.interrupt doesn't kill them

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4043. -- Resolution: Won't Fix I think this never went anywhere specific, so closing it > Add a flag for

[jira] [Resolved] (SPARK-3818) Graph coarsening

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3818. -- Resolution: Won't Fix > Graph coarsening > > > Key: SPARK-3818 >

[jira] [Resolved] (SPARK-3360) Add RowMatrix.multiply(Vector)

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3360. -- Resolution: Won't Fix > Add RowMatrix.multiply(Vector) > -- > >

[jira] [Updated] (SPARK-12638) Parameter explaination not very accurate for rdd function "aggregate"

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12638: -- Assignee: Tommy Yu > Parameter explaination not very accurate for rdd function "aggregate" >

[jira] [Resolved] (SPARK-12638) Parameter explaination not very accurate for rdd function "aggregate"

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12638. --- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved by pull

[jira] [Resolved] (SPARK-1521) Take character set size into account when compressing in-memory string columns

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1521. -- Resolution: Won't Fix I assume this is obsolete or else already implemented in some sense by tungsten

[jira] [Resolved] (SPARK-873) Add a way to specify rack topology in Mesos and standalone modes

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-873. - Resolution: Won't Fix > Add a way to specify rack topology in Mesos and standalone modes >

[jira] [Resolved] (SPARK-1515) Specialized ColumnTypes for Array, Map and Struct

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1515. -- Resolution: Won't Fix Assuming this is obsolete > Specialized ColumnTypes for Array, Map and Struct >

[jira] [Resolved] (SPARK-1614) Move Mesos protobufs out of TaskState

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1614. -- Resolution: Won't Fix > Move Mesos protobufs out of TaskState > - >

[jira] [Resolved] (SPARK-3055) Stack trace logged in driver on job failure is usually uninformative

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3055. -- Resolution: Won't Fix > Stack trace logged in driver on job failure is usually uninformative >

[jira] [Resolved] (SPARK-2359) Supporting common statistical functions in MLlib

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2359. -- Resolution: Done > Supporting common statistical functions in MLlib >

[jira] [Resolved] (SPARK-3172) Distinguish between shuffle spill on the map and reduce side

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3172. -- Resolution: Won't Fix > Distinguish between shuffle spill on the map and reduce side >

[jira] [Resolved] (SPARK-809) Give newly registered apps a set of executors right away

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-809. - Resolution: Won't Fix I'm assuming this is WontFix at this point. > Give newly registered apps a set of

[jira] [Resolved] (SPARK-5273) Improve documentation examples for LinearRegression

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5273. -- Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.0.0 1.6.1

[jira] [Updated] (SPARK-12759) Spark should fail fast if --executor-memory is too small for spark to start

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12759: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > Spark should fail fast if

[jira] [Updated] (SPARK-12765) CountVectorizerModel.transform lost the transformSchema

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12765: -- Fix Version/s: (was: 1.6.1) (was: 1.6.0) [~sloth2012] don't set fix

[jira] [Resolved] (SPARK-2011) Eliminate duplicate join in Pregel

2016-01-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2011. -- Resolution: Won't Fix > Eliminate duplicate join in Pregel > -- > >

  1   2   3   >