[jira] [Updated] (SPARK-15825) sort-merge-join gives invalid results when joining on a tupled key

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15825: Assignee: Herman van Hovell > sort-merge-join gives invalid results when joining on a tupled key >

[jira] [Updated] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15822: Assignee: Herman van Hovell > segmentation violation in o.a.s.unsafe.types.UTF8String with > spark

[jira] [Commented] (SPARK-9838) Support Poisson family in SparkR:::glm

2016-06-09 Thread Zhang Mengqi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323953#comment-15323953 ] Zhang Mengqi commented on SPARK-9838: - Hi Xiangrui Meng, I'm a student who are worki

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-09 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323947#comment-15323947 ] Takeshi Yamamuro commented on SPARK-15585: -- okay, I'll push later. > Don't use

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323946#comment-15323946 ] Reynold Xin commented on SPARK-15585: - Great let's update the documentation that way.

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-09 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323944#comment-15323944 ] Takeshi Yamamuro commented on SPARK-15585: -- yea, I manually checked that it work

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323940#comment-15323940 ] Reynold Xin commented on SPARK-15585: - Looks good. Does empty string actually work?

[jira] [Updated] (SPARK-15864) Inconsistent Behaviors when Uncaching Non-cached Tables

2016-06-09 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15864: Summary: Inconsistent Behaviors when Uncaching Non-cached Tables (was: Inconsistent Behaviors for Uncachin

[jira] [Created] (SPARK-15865) Blacklist should not result in job hanging with less than 4 executors

2016-06-09 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-15865: Summary: Blacklist should not result in job hanging with less than 4 executors Key: SPARK-15865 URL: https://issues.apache.org/jira/browse/SPARK-15865 Project: Spark

[jira] [Assigned] (SPARK-15864) Inconsistent Behaviors for Uncaching Non-cached Tables

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15864: Assignee: (was: Apache Spark) > Inconsistent Behaviors for Uncaching Non-cached Tables

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-09 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323936#comment-15323936 ] Takeshi Yamamuro commented on SPARK-15585: -- Understood. Anyway, I think it's oka

[jira] [Commented] (SPARK-15864) Inconsistent Behaviors for Uncaching Non-cached Tables

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323938#comment-15323938 ] Apache Spark commented on SPARK-15864: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-15864) Inconsistent Behaviors for Uncaching Non-cached Tables

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15864: Assignee: Apache Spark > Inconsistent Behaviors for Uncaching Non-cached Tables >

[jira] [Assigned] (SPARK-15863) Update SQL programming guide for Spark 2.0

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15863: Assignee: Cheng Lian (was: Apache Spark) > Update SQL programming guide for Spark 2.0 > -

[jira] [Assigned] (SPARK-15863) Update SQL programming guide for Spark 2.0

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15863: Assignee: Apache Spark (was: Cheng Lian) > Update SQL programming guide for Spark 2.0 > -

[jira] [Commented] (SPARK-15863) Update SQL programming guide for Spark 2.0

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323929#comment-15323929 ] Apache Spark commented on SPARK-15863: -- User 'liancheng' has created a pull request

[jira] [Created] (SPARK-15864) Inconsistent Behaviors for Uncaching Non-cached Tables

2016-06-09 Thread Xiao Li (JIRA)
Xiao Li created SPARK-15864: --- Summary: Inconsistent Behaviors for Uncaching Non-cached Tables Key: SPARK-15864 URL: https://issues.apache.org/jira/browse/SPARK-15864 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-15863) Update SQL programming guide for Spark 2.0

2016-06-09 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-15863: -- Summary: Update SQL programming guide for Spark 2.0 Key: SPARK-15863 URL: https://issues.apache.org/jira/browse/SPARK-15863 Project: Spark Issue Type: Documentat

[jira] [Resolved] (SPARK-15696) Improve `crosstab` to have a consistent column order

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15696. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Improve `crossta

[jira] [Resolved] (SPARK-15791) NPE in ScalarSubquery

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15791. - Resolution: Fixed Fix Version/s: 2.0.0 > NPE in ScalarSubquery > - > >

[jira] [Closed] (SPARK-15842) Add support for socket stream.

2016-06-09 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma closed SPARK-15842. --- Resolution: Not A Problem > Add support for socket stream. > -- >

[jira] [Commented] (SPARK-15842) Add support for socket stream.

2016-06-09 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323837#comment-15323837 ] Prashant Sharma commented on SPARK-15842: - Thank you for making it clear. Actual

[jira] [Closed] (SPARK-15838) CACHE TABLE AS SELECT should not replace the existing Temp Table

2016-06-09 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-15838. --- Resolution: Won't Fix > CACHE TABLE AS SELECT should not replace the existing Temp Table > --

[jira] [Commented] (SPARK-15862) Better Error Message When Having Database Name in CACHE TABLE AS SELECT

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323828#comment-15323828 ] Apache Spark commented on SPARK-15862: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-15862) Better Error Message When Having Database Name in CACHE TABLE AS SELECT

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15862: Assignee: Apache Spark > Better Error Message When Having Database Name in CACHE TABLE AS

[jira] [Assigned] (SPARK-15862) Better Error Message When Having Database Name in CACHE TABLE AS SELECT

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15862: Assignee: (was: Apache Spark) > Better Error Message When Having Database Name in CACH

[jira] [Created] (SPARK-15862) Better Error Message When Having Database Name in CACHE TABLE AS SELECT

2016-06-09 Thread Xiao Li (JIRA)
Xiao Li created SPARK-15862: --- Summary: Better Error Message When Having Database Name in CACHE TABLE AS SELECT Key: SPARK-15862 URL: https://issues.apache.org/jira/browse/SPARK-15862 Project: Spark

[jira] [Updated] (SPARK-15838) CACHE TABLE AS SELECT should not replace the existing Temp Table

2016-06-09 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15838: Description: -Currently, {{CACHE TABLE AS SELECT}} replaces the existing Temp Table, if existed. This beha

[jira] [Updated] (SPARK-15838) CACHE TABLE AS SELECT should not replace the existing Temp Table

2016-06-09 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15838: Description: Currently, {{CACHE TABLE AS SELECT}} replaces the existing Temp Table, if existed. This behav

[jira] [Updated] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-09 Thread Greg Bowyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Bowyer updated SPARK-15861: Description: Hi all, it appears that the method `rdd.mapPartitions` does odd things if it is fed a

[jira] [Created] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-09 Thread Greg Bowyer (JIRA)
Greg Bowyer created SPARK-15861: --- Summary: pyspark mapPartitions with none generator functions / functors Key: SPARK-15861 URL: https://issues.apache.org/jira/browse/SPARK-15861 Project: Spark

[jira] [Commented] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323797#comment-15323797 ] Apache Spark commented on SPARK-15858: -- User 'mhmoudr' has created a pull request fo

[jira] [Assigned] (SPARK-15825) sort-merge-join gives invalid results when joining on a tupled key

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15825: Assignee: Apache Spark > sort-merge-join gives invalid results when joining on a tupled ke

[jira] [Assigned] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15822: Assignee: Apache Spark > segmentation violation in o.a.s.unsafe.types.UTF8String with > s

[jira] [Assigned] (SPARK-15825) sort-merge-join gives invalid results when joining on a tupled key

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15825: Assignee: (was: Apache Spark) > sort-merge-join gives invalid results when joining on

[jira] [Commented] (SPARK-15825) sort-merge-join gives invalid results when joining on a tupled key

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323783#comment-15323783 ] Apache Spark commented on SPARK-15825: -- User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15822: Assignee: (was: Apache Spark) > segmentation violation in o.a.s.unsafe.types.UTF8Strin

[jira] [Commented] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323781#comment-15323781 ] Apache Spark commented on SPARK-15822: -- User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15858: Assignee: Apache Spark > "evaluateEachIteration" will fail on trying to run it on a model

[jira] [Assigned] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15858: Assignee: (was: Apache Spark) > "evaluateEachIteration" will fail on trying to run it

[jira] [Commented] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323777#comment-15323777 ] Apache Spark commented on SPARK-15858: -- User 'mhmoudr' has created a pull request fo

[jira] [Assigned] (SPARK-15860) Metrics for codegen size and perf

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15860: Assignee: Apache Spark > Metrics for codegen size and perf > -

[jira] [Assigned] (SPARK-15860) Metrics for codegen size and perf

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15860: Assignee: (was: Apache Spark) > Metrics for codegen size and perf > --

[jira] [Commented] (SPARK-15860) Metrics for codegen size and perf

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323761#comment-15323761 ] Apache Spark commented on SPARK-15860: -- User 'ericl' has created a pull request for

[jira] [Created] (SPARK-15860) Metrics for codegen size and perf

2016-06-09 Thread Eric Liang (JIRA)
Eric Liang created SPARK-15860: -- Summary: Metrics for codegen size and perf Key: SPARK-15860 URL: https://issues.apache.org/jira/browse/SPARK-15860 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-15850) Remove function grouping in SparkSession

2016-06-09 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-15850. --- Resolution: Resolved > Remove function grouping in SparkSession > ---

[jira] [Resolved] (SPARK-15853) HDFSMetadataLog.get leaks the input stream

2016-06-09 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-15853. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13583 [https://g

[jira] [Commented] (SPARK-15856) Revert API breaking changes made in DataFrameReader.text and SQLContext.range

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323715#comment-15323715 ] Reynold Xin commented on SPARK-15856: - cc [~koert] > Revert API breaking changes mad

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323709#comment-15323709 ] Marcelo Vanzin commented on SPARK-15851: It should be simple to fix it to work wi

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323704#comment-15323704 ] Alexander Ulanov commented on SPARK-15851: -- Sorry for confusion, I mean the shel

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323702#comment-15323702 ] Joseph K. Bradley commented on SPARK-15581: --- Synced some in person around the s

[jira] [Assigned] (SPARK-15859) Optimize the Partition Pruning with Disjunction

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15859: Assignee: Apache Spark > Optimize the Partition Pruning with Disjunction > ---

[jira] [Commented] (SPARK-15859) Optimize the Partition Pruning with Disjunction

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323683#comment-15323683 ] Apache Spark commented on SPARK-15859: -- User 'chenghao-intel' has created a pull req

[jira] [Assigned] (SPARK-15859) Optimize the Partition Pruning with Disjunction

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15859: Assignee: (was: Apache Spark) > Optimize the Partition Pruning with Disjunction >

[jira] [Resolved] (SPARK-15794) Should truncate toString() of very wide schemas

2016-06-09 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-15794. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13537 [https://github.

[jira] [Created] (SPARK-15859) Optimize the Partition Pruning with Disjunction

2016-06-09 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-15859: - Summary: Optimize the Partition Pruning with Disjunction Key: SPARK-15859 URL: https://issues.apache.org/jira/browse/SPARK-15859 Project: Spark Issue Type: Improve

[jira] [Commented] (SPARK-15855) dataframe.R example fails with "java.io.IOException: No input paths specified in job"

2016-06-09 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323667#comment-15323667 ] Shivaram Venkataraman commented on SPARK-15855: --- For the example to work in

[jira] [Commented] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323666#comment-15323666 ] Apache Spark commented on SPARK-15509: -- User 'keypointt' has created a pull request

[jira] [Assigned] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15509: Assignee: Apache Spark > R MLlib algorithms should support input columns "features" and "l

[jira] [Assigned] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15509: Assignee: (was: Apache Spark) > R MLlib algorithms should support input columns "featu

[jira] [Commented] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Mahmoud Rawas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323662#comment-15323662 ] Mahmoud Rawas commented on SPARK-15858: --- I am working on a solution. > "evaluateEa

[jira] [Resolved] (SPARK-15841) [SPARK REPL] REPLSuite has incorrect env set for a couple of tests.

2016-06-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-15841. -- Resolution: Fixed Assignee: Prashant Sharma Fix Version/s: 2.0.0 > [SPARK REPL]

[jira] [Created] (SPARK-15858) "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees

2016-06-09 Thread Mahmoud Rawas (JIRA)
Mahmoud Rawas created SPARK-15858: - Summary: "evaluateEachIteration" will fail on trying to run it on a model with 500+ trees Key: SPARK-15858 URL: https://issues.apache.org/jira/browse/SPARK-15858 P

[jira] [Updated] (SPARK-15856) Revert API breaking changes made in DataFrameReader.text and SQLContext.range

2016-06-09 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15856: --- Description: In Spark 2.0, after unifying Datasets and DataFrames, we made two API breaking changes:

[jira] [Commented] (SPARK-15857) Add Caller Context in Spark

2016-06-09 Thread Weiqing Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323655#comment-15323655 ] Weiqing Yang commented on SPARK-15857: -- I will attach the design doc soon. > Add Ca

[jira] [Updated] (SPARK-15856) Revert API breaking changes made in DataFrameReader.text and SQLContext.range

2016-06-09 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15856: --- Description: In Spark 2.0, after unifying Datasets and DataFrames, we made two API breaking changes:

[jira] [Created] (SPARK-15857) Add Caller Context in Spark

2016-06-09 Thread Weiqing Yang (JIRA)
Weiqing Yang created SPARK-15857: Summary: Add Caller Context in Spark Key: SPARK-15857 URL: https://issues.apache.org/jira/browse/SPARK-15857 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-12447) Only update AM's internal state when executor is successfully launched by NM

2016-06-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-12447. Resolution: Fixed Assignee: Saisai Shao (was: Apache Spark) Fix Version/s:

[jira] [Created] (SPARK-15856) Revert API breaking changes made in DataFrameReader.text and SQLContext.range

2016-06-09 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-15856: -- Summary: Revert API breaking changes made in DataFrameReader.text and SQLContext.range Key: SPARK-15856 URL: https://issues.apache.org/jira/browse/SPARK-15856 Project: Sp

[jira] [Updated] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-15822: -- Priority: Blocker (was: Critical) > segmentation violation in o.a.s.unsafe.types.UTF8S

[jira] [Updated] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String with spark.memory.offHeap.enabled=true

2016-06-09 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-15822: -- Description: Executors fail with segmentation violation while running application with

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323644#comment-15323644 ] Marcelo Vanzin commented on SPARK-15851: bq. "spark-build-info" can be rewritten

[jira] [Created] (SPARK-15855) dataframe.R example fails with "java.io.IOException: No input paths specified in job"

2016-06-09 Thread Yesha Vora (JIRA)
Yesha Vora created SPARK-15855: -- Summary: dataframe.R example fails with "java.io.IOException: No input paths specified in job" Key: SPARK-15855 URL: https://issues.apache.org/jira/browse/SPARK-15855 Pro

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323638#comment-15323638 ] Alexander Ulanov commented on SPARK-15851: -- I can do that. However, it seems tha

[jira] [Created] (SPARK-15854) Spark History server gets null pointer exception

2016-06-09 Thread Yesha Vora (JIRA)
Yesha Vora created SPARK-15854: -- Summary: Spark History server gets null pointer exception Key: SPARK-15854 URL: https://issues.apache.org/jira/browse/SPARK-15854 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323629#comment-15323629 ] Marcelo Vanzin commented on SPARK-15851: Adding "bash" explicitly in the pom shou

[jira] [Assigned] (SPARK-15853) HDFSMetadataLog.get leaks the input stream

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15853: Assignee: Shixiong Zhu (was: Apache Spark) > HDFSMetadataLog.get leaks the input stream >

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323624#comment-15323624 ] Alexander Ulanov commented on SPARK-15851: -- This does not work because Ant uses

[jira] [Assigned] (SPARK-15853) HDFSMetadataLog.get leaks the input stream

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15853: Assignee: Apache Spark (was: Shixiong Zhu) > HDFSMetadataLog.get leaks the input stream >

[jira] [Commented] (SPARK-15853) HDFSMetadataLog.get leaks the input stream

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323625#comment-15323625 ] Apache Spark commented on SPARK-15853: -- User 'zsxwing' has created a pull request fo

[jira] [Updated] (SPARK-15830) Spark application should get hive tokens only when it is required

2016-06-09 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yesha Vora updated SPARK-15830: --- Affects Version/s: 1.6.1 > Spark application should get hive tokens only when it is required > --

[jira] [Created] (SPARK-15853) HDFSMetadataLog.get leaks the input stream

2016-06-09 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-15853: Summary: HDFSMetadataLog.get leaks the input stream Key: SPARK-15853 URL: https://issues.apache.org/jira/browse/SPARK-15853 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323620#comment-15323620 ] Marcelo Vanzin commented on SPARK-15851: [~tgraves] (and [~Dhruve Ashar]) hah we

[jira] [Updated] (SPARK-15794) Should truncate toString() of very wide schemas

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15794: Assignee: Eric Liang > Should truncate toString() of very wide schemas > --

[jira] [Updated] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15764: Description: BindReferences contains a n^2 loop which causes performance issues when operating ove

[jira] [Updated] (SPARK-15794) Should truncate toString() of very wide schemas

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15794: Target Version/s: 2.0.0 > Should truncate toString() of very wide schemas > ---

[jira] [Updated] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15764: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15852 > Replace n^2 loop in BindRefe

[jira] [Updated] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15742: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15852 > Reduce collections allocatio

[jira] [Updated] (SPARK-15748) Replace inefficient foldLeft() call in PartitionStatistics

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15748: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15852 > Replace inefficient foldLeft

[jira] [Updated] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15762: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15852 > Cache Metadata.hashCode and

[jira] [Updated] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15764: Description: BindReferences contains a n^2 loop which causes performance issues when operating ove

[jira] [Updated] (SPARK-15794) Should truncate toString() of very wide schemas

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15794: Issue Type: Sub-task (was: Bug) Parent: SPARK-15852 > Should truncate toString() of very w

[jira] [Created] (SPARK-15852) Improve query planning performance for wide nested schema

2016-06-09 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15852: --- Summary: Improve query planning performance for wide nested schema Key: SPARK-15852 URL: https://issues.apache.org/jira/browse/SPARK-15852 Project: Spark Issue

[jira] [Resolved] (SPARK-14321) Reduce date format cost in date functions

2016-06-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14321. - Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.0.0 > Reduce date

[jira] [Updated] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov updated SPARK-15851: - Fix Version/s: 2.0.0 > Spark 2.0 does not compile in Windows 7 >

[jira] [Updated] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov updated SPARK-15851: - Target Version/s: 2.0.0 Fix Version/s: (was: 2.0.0) > Spark 2.0 does not compi

[jira] [Created] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-09 Thread Alexander Ulanov (JIRA)
Alexander Ulanov created SPARK-15851: Summary: Spark 2.0 does not compile in Windows 7 Key: SPARK-15851 URL: https://issues.apache.org/jira/browse/SPARK-15851 Project: Spark Issue Type: B

[jira] [Assigned] (SPARK-15850) Remove function grouping in SparkSession

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15850: Assignee: Reynold Xin (was: Apache Spark) > Remove function grouping in SparkSession > --

[jira] [Commented] (SPARK-15850) Remove function grouping in SparkSession

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323560#comment-15323560 ] Apache Spark commented on SPARK-15850: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-15850) Remove function grouping in SparkSession

2016-06-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15850: Assignee: Apache Spark (was: Reynold Xin) > Remove function grouping in SparkSession > --

  1   2   3   >