[jira] [Assigned] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10720: Assignee: (was: Apache Spark) > Add a java wrapper to create dataframe from a local li

[jira] [Assigned] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10720: Assignee: Apache Spark > Add a java wrapper to create dataframe from a local list of Java

[jira] [Commented] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904059#comment-14904059 ] Apache Spark commented on SPARK-10720: -- User 'holdenk' has created a pull request fo

[jira] [Created] (SPARK-10771) Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

2015-09-22 Thread Ferdinand Xu (JIRA)
Ferdinand Xu created SPARK-10771: Summary: Implement the shuffle encryption with AES-CTR crypto using JCE key provider. Key: SPARK-10771 URL: https://issues.apache.org/jira/browse/SPARK-10771 Project:

[jira] [Commented] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904046#comment-14904046 ] Apache Spark commented on SPARK-10770: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10770: Assignee: Reynold Xin (was: Apache Spark) > SparkPlan.executeCollect/executeTake should r

[jira] [Assigned] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10770: Assignee: Apache Spark (was: Reynold Xin) > SparkPlan.executeCollect/executeTake should r

[jira] [Created] (SPARK-10770) SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row

2015-09-22 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-10770: --- Summary: SparkPlan.executeCollect/executeTake should return InternalRow rather than external Row Key: SPARK-10770 URL: https://issues.apache.org/jira/browse/SPARK-10770

[jira] [Resolved] (SPARK-10742) Add the ability to embed HTML relative links in job descriptions

2015-09-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10742. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Add the ability to e

[jira] [Resolved] (SPARK-10652) Set meaningful job descriptions for streaming related jobs

2015-09-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10652. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Set meaningful job d

[jira] [Assigned] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10769: Assignee: (was: Apache Spark) > Fix o.a.s.streaming.CheckpointSuite.maintains rate con

[jira] [Assigned] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10769: Assignee: Apache Spark > Fix o.a.s.streaming.CheckpointSuite.maintains rate controller > -

[jira] [Commented] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903998#comment-14903998 ] Apache Spark commented on SPARK-10769: -- User 'zsxwing' has created a pull request fo

[jira] [Commented] (SPARK-10000) Consolidate cache memory management and execution memory management

2015-09-22 Thread Bowen Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903995#comment-14903995 ] Bowen Zhang commented on SPARK-1: - [~rxin], I am very interested in this new stor

[jira] [Created] (SPARK-10769) Fix o.a.s.streaming.CheckpointSuite.maintains rate controller

2015-09-22 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-10769: Summary: Fix o.a.s.streaming.CheckpointSuite.maintains rate controller Key: SPARK-10769 URL: https://issues.apache.org/jira/browse/SPARK-10769 Project: Spark

[jira] [Created] (SPARK-10768) How to access columns with "." dot in their name in Spark SQL

2015-09-22 Thread Harut Martirosyan (JIRA)
Harut Martirosyan created SPARK-10768: - Summary: How to access columns with "." dot in their name in Spark SQL Key: SPARK-10768 URL: https://issues.apache.org/jira/browse/SPARK-10768 Project: Spar

[jira] [Created] (SPARK-10767) Make pyspark shared params codegen more consistent

2015-09-22 Thread holdenk (JIRA)
holdenk created SPARK-10767: --- Summary: Make pyspark shared params codegen more consistent Key: SPARK-10767 URL: https://issues.apache.org/jira/browse/SPARK-10767 Project: Spark Issue Type: Improve

[jira] [Commented] (SPARK-10668) Use WeightedLeastSquares in LinearRegression with L2 regularization if the number of features is small

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903935#comment-14903935 ] Xiangrui Meng commented on SPARK-10668: --- [~lewuathe] any progress? > Use WeightedL

[jira] [Updated] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10663: -- Target Version/s: 1.6.0, 1.5.1 > Change test.toDF to test in Spark ML Programming Guide > -

[jira] [Updated] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10663: -- Assignee: Matt Hagen > Change test.toDF to test in Spark ML Programming Guide > ---

[jira] [Resolved] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10663. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull

[jira] [Created] (SPARK-10766) Add some configurations for the client process in yarn-cluster mode.

2015-09-22 Thread SaintBacchus (JIRA)
SaintBacchus created SPARK-10766: Summary: Add some configurations for the client process in yarn-cluster mode. Key: SPARK-10766 URL: https://issues.apache.org/jira/browse/SPARK-10766 Project: Spark

[jira] [Resolved] (SPARK-10310) [Spark SQL] All result records will be popluated into ONE line during the script transform due to missing the correct line/filed delimiter

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10310. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request 8

[jira] [Updated] (SPARK-8882) A New Receiver Scheduling Mechanism to solve unbalanced receivers

2015-09-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-8882: - Summary: A New Receiver Scheduling Mechanism to solve unbalanced receivers (was: A New Receiver S

[jira] [Updated] (SPARK-8882) A New Receiver Scheduling Mechanism

2015-09-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-8882: - Description: There are some problems in the current mechanism: - If a task fails more than “spark

[jira] [Commented] (SPARK-10731) The head() implementation of dataframe is very slow

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903818#comment-14903818 ] Apache Spark commented on SPARK-10731: -- User 'rxin' has created a pull request for t

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-22 Thread Amey Ghadigaonkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903750#comment-14903750 ] Amey Ghadigaonkar commented on SPARK-7442: -- Getting the same error with Spark 1.4

[jira] [Assigned] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10663: Assignee: (was: Apache Spark) > Change test.toDF to test in Spark ML Programming Guide

[jira] [Assigned] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10663: Assignee: Apache Spark > Change test.toDF to test in Spark ML Programming Guide >

[jira] [Commented] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903749#comment-14903749 ] Apache Spark commented on SPARK-10663: -- User 'hagenhaus' has created a pull request

[jira] [Commented] (SPARK-10733) TungstenAggregation cannot acquire page after switching to sort-based

2015-09-22 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903740#comment-14903740 ] Yi Zhou commented on SPARK-10733: - yes. i still got error after applying the commit. >

[jira] [Updated] (SPARK-10705) Stop converting internal rows to external rows in DataFrame.toJSON

2015-09-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10705: --- Assignee: Liang-Chi Hsieh > Stop converting internal rows to external rows in DataFrame.toJSON >

[jira] [Resolved] (SPARK-10640) Spark history server fails to parse taskEndReasonFromJson TaskCommitDenied

2015-09-22 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10640. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 > Spark history server fails t

[jira] [Commented] (SPARK-10748) Log error instead of crashing Spark Mesos dispatcher when a job is misconfigured

2015-09-22 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903672#comment-14903672 ] Alan Braithwaite commented on SPARK-10748: -- This could be tricky with persistenc

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-22 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903657#comment-14903657 ] Feynman Liang commented on SPARK-9798: -- The actual scala doc > CrossValidatorModel D

[jira] [Updated] (SPARK-9585) HiveHBaseTableInputFormat can'be cached

2015-09-22 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9585: - Assignee: meiyoula > HiveHBaseTableInputFormat can'be cached > --- > >

[jira] [Commented] (SPARK-9798) CrossValidatorModel Documentation Improvements

2015-09-22 Thread rerngvit yanggratoke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903653#comment-14903653 ] rerngvit yanggratoke commented on SPARK-9798: - [~fliang] By documentation, you

[jira] [Assigned] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10765: Assignee: (was: Apache Spark) > use new aggregate interface for hive UDAF > --

[jira] [Commented] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903639#comment-14903639 ] Apache Spark commented on SPARK-10765: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10765: Assignee: Apache Spark > use new aggregate interface for hive UDAF > -

[jira] [Created] (SPARK-10765) use new aggregate interface for hive UDAF

2015-09-22 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10765: --- Summary: use new aggregate interface for hive UDAF Key: SPARK-10765 URL: https://issues.apache.org/jira/browse/SPARK-10765 Project: Spark Issue Type: Improveme

[jira] [Created] (SPARK-10764) Add optional caching to Pipelines

2015-09-22 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10764: - Summary: Add optional caching to Pipelines Key: SPARK-10764 URL: https://issues.apache.org/jira/browse/SPARK-10764 Project: Spark Issue Type: Sub-t

[jira] [Commented] (SPARK-10409) Multilayer perceptron regression

2015-09-22 Thread Lauren Moos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903553#comment-14903553 ] Lauren Moos commented on SPARK-10409: - no problem! > Multilayer perceptron regressio

[jira] [Assigned] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10403: Assignee: Josh Rosen (was: Apache Spark) > UnsafeRowSerializer can't work with UnsafeShuf

[jira] [Assigned] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10403: Assignee: Apache Spark (was: Josh Rosen) > UnsafeRowSerializer can't work with UnsafeShuf

[jira] [Commented] (SPARK-10762) GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

2015-09-22 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903551#comment-14903551 ] Glenn Strycker commented on SPARK-10762: Is this related? http://mail-archives.

[jira] [Commented] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903552#comment-14903552 ] Apache Spark commented on SPARK-10403: -- User 'JoshRosen' has created a pull request

[jira] [Commented] (SPARK-4489) JavaPairRDD.collectAsMap from checkpoint RDD may fail with ClassCastException

2015-09-22 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903548#comment-14903548 ] Glenn Strycker commented on SPARK-4489: --- I believe I am getting this issue as well.

[jira] [Created] (SPARK-10763) Update Java MLLIB/ML tests to use simplified dataframe construction

2015-09-22 Thread holdenk (JIRA)
holdenk created SPARK-10763: --- Summary: Update Java MLLIB/ML tests to use simplified dataframe construction Key: SPARK-10763 URL: https://issues.apache.org/jira/browse/SPARK-10763 Project: Spark Is

[jira] [Commented] (SPARK-2737) ClassCastExceptions when collect()ing JavaRDDs' underlying Scala RDDs

2015-09-22 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903542#comment-14903542 ] Glenn Strycker commented on SPARK-2737: --- I am getting a similar error in Spark 1.3.0

[jira] [Commented] (SPARK-1040) Collect as Map throws a casting exception when run on a JavaPairRDD object

2015-09-22 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903541#comment-14903541 ] Glenn Strycker commented on SPARK-1040: --- I am getting a similar error in Spark 1.3.0

[jira] [Commented] (SPARK-10688) Python API for AFTSurvivalRegression

2015-09-22 Thread Kai Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903528#comment-14903528 ] Kai Jiang commented on SPARK-10688: --- May I take a try? Cause I am pretty interested in

[jira] [Created] (SPARK-10762) GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

2015-09-22 Thread Glenn Strycker (JIRA)
Glenn Strycker created SPARK-10762: -- Summary: GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table Key: SPARK-10762 URL: https://issues.apache.org/jira/browse/SPARK

[jira] [Updated] (SPARK-10759) Missing Python code example in ML Programming guide

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10759: -- Labels: starter (was: ) > Missing Python code example in ML Programming guide > --

[jira] [Updated] (SPARK-10333) Add user guide for linear-methods.md columns

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10333: -- Assignee: Lauren Moos > Add user guide for linear-methods.md columns >

[jira] [Updated] (SPARK-10759) Missing Python code example in ML Programming guide

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10759: -- Assignee: Lauren Moos > Missing Python code example in ML Programming guide > -

[jira] [Commented] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-09-22 Thread Dan Brown (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903512#comment-14903512 ] Dan Brown commented on SPARK-10685: --- Thanks for fixing the python udf part of the issue

[jira] [Assigned] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-10403: -- Assignee: Josh Rosen > UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort

[jira] [Commented] (SPARK-10409) Multilayer perceptron regression

2015-09-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903480#comment-14903480 ] Xiangrui Meng commented on SPARK-10409: --- [~lmoos] This is a major feature. Could yo

[jira] [Updated] (SPARK-10607) Scheduler should include defensive measures against infinite loops due to task commit denial

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10607: --- Target Version/s: (was: 1.3.2, 1.4.2, 1.5.1) > Scheduler should include defensive measures against

[jira] [Commented] (SPARK-10607) Scheduler should include defensive measures against infinite loops due to task commit denial

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903476#comment-14903476 ] Josh Rosen commented on SPARK-10607: Retargeting; this enhancement doesn't need to be

[jira] [Assigned] (SPARK-10749) Support multiple roles with Spark Mesos dispatcher

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10749: Assignee: Apache Spark > Support multiple roles with Spark Mesos dispatcher >

[jira] [Commented] (SPARK-10749) Support multiple roles with Spark Mesos dispatcher

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903474#comment-14903474 ] Apache Spark commented on SPARK-10749: -- User 'tnachen' has created a pull request fo

[jira] [Assigned] (SPARK-10749) Support multiple roles with Spark Mesos dispatcher

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10749: Assignee: (was: Apache Spark) > Support multiple roles with Spark Mesos dispatcher > -

[jira] [Updated] (SPARK-8447) Test external shuffle service with all shuffle managers

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8447: -- Target Version/s: 1.6.0 (was: 1.5.1) > Test external shuffle service with all shuffle managers > --

[jira] [Updated] (SPARK-10058) Flaky test: HeartbeatReceiverSuite: normal heartbeat

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10058: --- Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > Flaky test: HeartbeatReceiverSuite: normal heartbeat >

[jira] [Updated] (SPARK-6701) Flaky test: o.a.s.deploy.yarn.YarnClusterSuite Python application

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6701: -- Target Version/s: (was: 1.5.1) > Flaky test: o.a.s.deploy.yarn.YarnClusterSuite Python application > -

[jira] [Updated] (SPARK-6484) Ganglia metrics xml reporter doesn't escape correctly

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6484: -- Target Version/s: (was: 1.5.1) I'm going to untarget this from 1.5.1 because, as far as I know, this i

[jira] [Updated] (SPARK-7420) Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear received block data too soon"

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7420: -- Target Version/s: (was: 1.5.1) > Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear received

[jira] [Updated] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10685: --- Assignee: Reynold Xin > Misaligned data with RDD.zip and DataFrame.withColumn after repartition > ---

[jira] [Resolved] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-10685. Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull reque

[jira] [Resolved] (SPARK-10714) Refactor PythonRDD to decouple iterator computation from PythonRDD

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-10714. Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull reque

[jira] [Resolved] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8632. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request

[jira] [Commented] (SPARK-10409) Multilayer perceptron regression

2015-09-22 Thread Lauren Moos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903451#comment-14903451 ] Lauren Moos commented on SPARK-10409: - I'd be happy to work on this > Multilayer per

[jira] [Assigned] (SPARK-10761) Refactor DiskBlockObjectWriter to not require BlockId

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10761: Assignee: Josh Rosen (was: Apache Spark) > Refactor DiskBlockObjectWriter to not require

[jira] [Assigned] (SPARK-10761) Refactor DiskBlockObjectWriter to not require BlockId

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10761: Assignee: Apache Spark (was: Josh Rosen) > Refactor DiskBlockObjectWriter to not require

[jira] [Commented] (SPARK-10333) Add user guide for linear-methods.md columns

2015-09-22 Thread Lauren Moos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903448#comment-14903448 ] Lauren Moos commented on SPARK-10333: - I'd be happy to work on this > Add user guid

[jira] [Commented] (SPARK-10761) Refactor DiskBlockObjectWriter to not require BlockId

2015-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903450#comment-14903450 ] Apache Spark commented on SPARK-10761: -- User 'JoshRosen' has created a pull request

[jira] [Created] (SPARK-10761) Refactor DiskBlockObjectWriter to not require BlockId

2015-09-22 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-10761: -- Summary: Refactor DiskBlockObjectWriter to not require BlockId Key: SPARK-10761 URL: https://issues.apache.org/jira/browse/SPARK-10761 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10381) Infinite loop when OutputCommitCoordination is enabled and OutputCommitter.commitTask throws exception

2015-09-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10381: --- Fix Version/s: 1.3.2 > Infinite loop when OutputCommitCoordination is enabled and > OutputCommitter.

[jira] [Resolved] (SPARK-10737) When using UnsafeRows, SortMergeJoin may return wrong results

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10737. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request 8

[jira] [Resolved] (SPARK-10672) We should not fail to create a table If we cannot persist metadata of a data source table to metastore in a Hive compatible way

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10672. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request 8

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2015-09-22 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903375#comment-14903375 ] Meihua Wu commented on SPARK-7129: -- [~sethah] Thank you very much for the write up! That

[jira] [Commented] (SPARK-10688) Python API for AFTSurvivalRegression

2015-09-22 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903357#comment-14903357 ] Gayathri Murali commented on SPARK-10688: - If there isn't anyone working on it, i

[jira] [Commented] (SPARK-8418) Add single- and multi-value support to ML Transformers

2015-09-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903350#comment-14903350 ] Joseph K. Bradley commented on SPARK-8418: -- New idea: We could allow transformers

[jira] [Commented] (SPARK-10759) Missing Python code example in ML Programming guide

2015-09-22 Thread Lauren Moos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903263#comment-14903263 ] Lauren Moos commented on SPARK-10759: - I can work on this > Missing Python code exa

[jira] [Updated] (SPARK-10740) handle nondeterministic expressions correctly for set operations

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10740: - Assignee: Wenchen Fan > handle nondeterministic expressions correctly for set operations > --

[jira] [Resolved] (SPARK-10740) handle nondeterministic expressions correctly for set operations

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10740. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request 8

[jira] [Updated] (SPARK-10740) handle nondeterministic expressions correctly for set operations

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10740: - Target Version/s: 1.6.0, 1.5.1 > handle nondeterministic expressions correctly for set operations > -

[jira] [Commented] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-09-22 Thread Chris Heller (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903260#comment-14903260 ] Chris Heller commented on SPARK-9442: - Curious if the issue seen here was with a parqu

[jira] [Updated] (SPARK-10740) handle nondeterministic expressions correctly for set operations

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10740: - Priority: Blocker (was: Major) > handle nondeterministic expressions correctly for set operations >

[jira] [Comment Edited] (SPARK-10732) Starting spark streaming from a specific point in time.

2015-09-22 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903231#comment-14903231 ] Cody Koeninger edited comment on SPARK-10732 at 9/22/15 7:02 PM: --

[jira] [Commented] (SPARK-10732) Starting spark streaming from a specific point in time.

2015-09-22 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903231#comment-14903231 ] Cody Koeninger commented on SPARK-10732: Yeah, even if that gets implemented it w

[jira] [Resolved] (SPARK-10704) Rename HashShufflereader to BlockStoreShuffleReader

2015-09-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10704. - Resolution: Fixed Fix Version/s: 1.6.0 > Rename HashShufflereader to BlockStoreShuffleRead

[jira] [Resolved] (SPARK-10485) IF expression is not correctly resolved when one of the options have NullType

2015-09-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10485. -- Resolution: Fixed I tested on 1.5 and it seems fixed to me. Please reopen if you have

[jira] [Commented] (SPARK-10732) Starting spark streaming from a specific point in time.

2015-09-22 Thread Bijay Singh Bisht (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903183#comment-14903183 ] Bijay Singh Bisht commented on SPARK-10732: --- I get it. Apparently there is a di

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2015-09-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903180#comment-14903180 ] Seth Hendrickson commented on SPARK-7129: - I had some time to give this topic some

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2015-09-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903136#comment-14903136 ] Joseph K. Bradley commented on SPARK-7129: -- It's not really on the roadmap for 1.

[jira] [Resolved] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-22 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10593. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull request 8

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2015-09-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903119#comment-14903119 ] Joseph K. Bradley commented on SPARK-7129: -- Hi, I'd recommend starting with small

  1   2   >