[jira] [Assigned] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21072: Assignee: (was: Apache Spark) > `TreeNode.mapChildren` should only apply to the

[jira] [Commented] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047431#comment-16047431 ] Apache Spark commented on SPARK-21072: -- User 'ConeyLiu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21072: Assignee: Apache Spark > `TreeNode.mapChildren` should only apply to the children node.

[jira] [Updated] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] coneyliu updated SPARK-21072: - Description: Just as the function name and comments of `TreeNode.mapChildren` mentioned, the function

[jira] [Updated] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] coneyliu updated SPARK-21072: - Description: Just as the function name and comments of `TreeNode.mapChildren` mentioned, the function

[jira] [Updated] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] coneyliu updated SPARK-21072: - Description: Just as the function name and comments of `TreeNode.mapChildren` mentioned, the function

[jira] [Updated] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] coneyliu updated SPARK-21072: - Description: Just as the function name and comments of `TreeNode.mapChildren` mentioned, the function

[jira] [Created] (SPARK-21072) `TreeNode.mapChildren` should only apply to the children node.

2017-06-12 Thread coneyliu (JIRA)
coneyliu created SPARK-21072: Summary: `TreeNode.mapChildren` should only apply to the children node. Key: SPARK-21072 URL: https://issues.apache.org/jira/browse/SPARK-21072 Project: Spark

[jira] [Resolved] (SPARK-19910) `stack` should not reject NULL values due to type mismatch

2017-06-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19910. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.3.0 > `stack` should not

[jira] [Updated] (SPARK-21045) Spark executor blocked instead of throwing exception because exception occur when python worker send exception info to Java Gateway in Python 2+

2017-06-12 Thread Joshuawangzj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joshuawangzj updated SPARK-21045: - Environment: It has problem only in Python 2+. Python 3+ is ok. Summary: Spark

[jira] [Commented] (SPARK-20988) Convert logistic regression to new aggregator framework

2017-06-12 Thread Vincent (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047370#comment-16047370 ] Vincent commented on SPARK-20988: - I can work on this if no one is working on it now :) > Convert

[jira] [Commented] (SPARK-21068) SparkR error message when passed an R object rather than Java object could be more informative

2017-06-12 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047338#comment-16047338 ] Felix Cheung commented on SPARK-21068: -- surely. what brings you to R land? :) > SparkR error

[jira] [Commented] (SPARK-21028) Parallel One vs. Rest Classifier Scala

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047258#comment-16047258 ] Apache Spark commented on SPARK-21028: -- User 'ajaysaini725' has created a pull request for this

[jira] [Updated] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21027: -- Shepherd: Joseph K. Bradley > Parallel One vs. Rest Classifier >

[jira] [Created] (SPARK-21071) remove append APIs and simplify array writing logic

2017-06-12 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-21071: --- Summary: remove append APIs and simplify array writing logic Key: SPARK-21071 URL: https://issues.apache.org/jira/browse/SPARK-21071 Project: Spark Issue

[jira] [Closed] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14450. - Resolution: Duplicate > Python OneVsRest should train multiple models at once >

[jira] [Commented] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047245#comment-16047245 ] Joseph K. Bradley commented on SPARK-14450: --- See linked JIRA for new issue. > Python OneVsRest

[jira] [Commented] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047244#comment-16047244 ] Joseph K. Bradley commented on SPARK-14450: --- Scala already has parallelization. I just

[jira] [Commented] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047243#comment-16047243 ] Joseph K. Bradley commented on SPARK-21027: --- Copying from [ML-14450]: [SPARK-7861] adds a

[jira] [Comment Edited] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047243#comment-16047243 ] Joseph K. Bradley edited comment on SPARK-21027 at 6/12/17 11:54 PM: -

[jira] [Commented] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047241#comment-16047241 ] Joseph K. Bradley commented on SPARK-21027: --- Whoops! I realized I'd reported this long

[jira] [Commented] (SPARK-18294) Implement commit protocol to support `mapred` package's committer

2017-06-12 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047233#comment-16047233 ] Dayou Zhou commented on SPARK-18294: Thanks for responding. My colleague Aarati Khobare will provide

[jira] [Updated] (SPARK-21047) Add test suites for complicated cases in ColumnarBatchSuite

2017-06-12 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-21047: - Summary: Add test suites for complicated cases in ColumnarBatchSuite (was: Add test

[jira] [Commented] (SPARK-18294) Implement commit protocol to support `mapred` package's committer

2017-06-12 Thread Jiang Xingbo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047229#comment-16047229 ] Jiang Xingbo commented on SPARK-18294: -- This is actually legacy code refactoring, it shouldn't

[jira] [Commented] (SPARK-18294) Implement commit protocol to support `mapred` package's committer

2017-06-12 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047196#comment-16047196 ] Dayou Zhou commented on SPARK-18294: Hi [~jiangxb1987][~jiangxb], Thank you for making this fix --

[jira] [Assigned] (SPARK-21070) Pick up cloudpickle upgrades from cloudpickle python module

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21070: Assignee: Apache Spark > Pick up cloudpickle upgrades from cloudpickle python module >

[jira] [Assigned] (SPARK-21070) Pick up cloudpickle upgrades from cloudpickle python module

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21070: Assignee: (was: Apache Spark) > Pick up cloudpickle upgrades from cloudpickle python

[jira] [Commented] (SPARK-21070) Pick up cloudpickle upgrades from cloudpickle python module

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047153#comment-16047153 ] Apache Spark commented on SPARK-21070: -- User 'rgbkrk' has created a pull request for this issue:

[jira] [Created] (SPARK-21070) Pick up cloudpickle upgrades from cloudpickle python module

2017-06-12 Thread Kyle Kelley (JIRA)
Kyle Kelley created SPARK-21070: --- Summary: Pick up cloudpickle upgrades from cloudpickle python module Key: SPARK-21070 URL: https://issues.apache.org/jira/browse/SPARK-21070 Project: Spark

[jira] [Updated] (SPARK-21069) Add rate source to programming guide

2017-06-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-21069: - Labels: starter (was: ) > Add rate source to programming guide >

[jira] [Updated] (SPARK-20979) Add a rate source to generate values for tests and benchmark

2017-06-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-20979: - Affects Version/s: (was: 2.2.0) 2.3.0 > Add a rate source to generate

[jira] [Commented] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047127#comment-16047127 ] Apache Spark commented on SPARK-21027: -- User 'ajaysaini725' has created a pull request for this

[jira] [Assigned] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21027: Assignee: (was: Apache Spark) > Parallel One vs. Rest Classifier >

[jira] [Assigned] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21027: Assignee: Apache Spark > Parallel One vs. Rest Classifier >

[jira] [Created] (SPARK-21069) Add rate source to programming guide

2017-06-12 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-21069: Summary: Add rate source to programming guide Key: SPARK-21069 URL: https://issues.apache.org/jira/browse/SPARK-21069 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-20979) Add a rate source to generate values for tests and benchmark

2017-06-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-20979. -- Resolution: Fixed Fix Version/s: 2.3.0 > Add a rate source to generate values for tests

[jira] [Reopened] (SPARK-20379) Allow setting SSL-related passwords through env variables

2017-06-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-20379: I'll re-open this since the SSL options don't really use the code path that can reference env

[jira] [Created] (SPARK-21068) SparkR error message when passed an R object rather than Java object could be more informative

2017-06-12 Thread holdenk (JIRA)
holdenk created SPARK-21068: --- Summary: SparkR error message when passed an R object rather than Java object could be more informative Key: SPARK-21068 URL: https://issues.apache.org/jira/browse/SPARK-21068

[jira] [Resolved] (SPARK-21050) ml word2vec write has overflow issue in calculating numPartitions

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21050. --- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue

[jira] [Updated] (SPARK-20499) Spark MLlib, GraphX 2.2 QA umbrella

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20499: -- Fix Version/s: 2.2.0 > Spark MLlib, GraphX 2.2 QA umbrella >

[jira] [Updated] (SPARK-20507) Update MLlib, GraphX websites for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20507: -- Fix Version/s: 2.2.0 > Update MLlib, GraphX websites for 2.2 >

[jira] [Resolved] (SPARK-21059) LikeSimplification can NPE on null pattern

2017-06-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21059. - Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 > LikeSimplification can NPE on

[jira] [Resolved] (SPARK-20345) Fix STS error handling logic on HiveSQLException

2017-06-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20345. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.3.0 2.2.1

[jira] [Created] (SPARK-21067) Thrift Server - CTAS fail with Unable to move source

2017-06-12 Thread Dominic Ricard (JIRA)
Dominic Ricard created SPARK-21067: -- Summary: Thrift Server - CTAS fail with Unable to move source Key: SPARK-21067 URL: https://issues.apache.org/jira/browse/SPARK-21067 Project: Spark

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047045#comment-16047045 ] Zhenhua Wang commented on SPARK-17642: -- [~mbasmanova] I've reopened and rebased the above PR. IMO, I

[jira] [Resolved] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2017-06-12 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-17914. --- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue resolved by

[jira] [Assigned] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2017-06-12 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-17914: - Assignee: Anton Okolnychyi > Spark SQL casting to TimestampType with nanosecond results

[jira] [Updated] (SPARK-21065) Spark Streaming concurrentJobs + StreamingJobProgressListener conflict

2017-06-12 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Dutrow updated SPARK-21065: --- Description: My streaming application has 200+ output operations, many of them stateful and several

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2017-06-12 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046957#comment-16046957 ] Sital Kedia commented on SPARK-18838: - [~joshrosen] - The PR for my change to multi-thread the event

[jira] [Updated] (SPARK-20434) Move Hadoop delegation token code from yarn to core

2017-06-12 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-20434: Summary: Move Hadoop delegation token code from yarn to core (was: Move Kerberos

[jira] [Assigned] (SPARK-20511) SparkR 2.2 QA: Check for new R APIs requiring example code

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20511: - Assignee: Felix Cheung > SparkR 2.2 QA: Check for new R APIs requiring example

[jira] [Updated] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18864: -- Fix Version/s: 2.2.0 > Changes of MLlib and SparkR behavior for 2.2 >

[jira] [Updated] (SPARK-20511) SparkR 2.2 QA: Check for new R APIs requiring example code

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20511: -- Fix Version/s: 2.2.0 > SparkR 2.2 QA: Check for new R APIs requiring example code >

[jira] [Assigned] (SPARK-20508) Spark R 2.2 QA umbrella

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20508: - Assignee: Felix Cheung (was: Joseph K. Bradley) > Spark R 2.2 QA umbrella >

[jira] [Deleted] (SPARK-20513) Update SparkR website for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley deleted SPARK-20513: -- > Update SparkR website for 2.2 > - > > Key:

[jira] [Updated] (SPARK-20508) Spark R 2.2 QA umbrella

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20508: -- Fix Version/s: 2.2.0 > Spark R 2.2 QA umbrella > --- > >

[jira] [Updated] (SPARK-20510) SparkR 2.2 QA: Update user guide for new features & APIs

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20510: -- Fix Version/s: 2.2.0 > SparkR 2.2 QA: Update user guide for new features & APIs >

[jira] [Assigned] (SPARK-20510) SparkR 2.2 QA: Update user guide for new features & APIs

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20510: - Assignee: Felix Cheung > SparkR 2.2 QA: Update user guide for new features &

[jira] [Commented] (SPARK-20513) Update SparkR website for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046843#comment-16046843 ] Joseph K. Bradley commented on SPARK-20513: --- whoops, i'll delete this... > Update SparkR

[jira] [Updated] (SPARK-20512) SparkR 2.2 QA: Programming guide, migration guide, vignettes updates

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20512: -- Fix Version/s: 2.2.0 > SparkR 2.2 QA: Programming guide, migration guide, vignettes

[jira] [Assigned] (SPARK-20512) SparkR 2.2 QA: Programming guide, migration guide, vignettes updates

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20512: - Assignee: Felix Cheung > SparkR 2.2 QA: Programming guide, migration guide,

[jira] [Commented] (SPARK-21066) LibSVM load just one input file

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046812#comment-16046812 ] Sean Owen commented on SPARK-21066: --- CC [~lian cheng] I don't immediately see why the relation can't

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046798#comment-16046798 ] Dongjoon Hyun commented on SPARK-17642: --- Sorry, I cannot help you further with that because I'm not

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Maria (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046794#comment-16046794 ] Maria commented on SPARK-17642: --- [~dongjoon], where can I read more on this? I found blog post [1]

[jira] [Commented] (SPARK-20927) Add cache operator to Unsupported Operations in Structured Streaming

2017-06-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046740#comment-16046740 ] Shixiong Zhu commented on SPARK-20927: -- Do nothing except logging a warning > Add cache operator to

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046739#comment-16046739 ] Dongjoon Hyun commented on SPARK-17642: --- I see. Interesting. At a first glance, the comments seem

[jira] [Resolved] (SPARK-21046) simplify the array offset and length in ColumnVector

2017-06-12 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21046. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18260

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Maria (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046718#comment-16046718 ] Maria commented on SPARK-17642: --- [~dongjoon], thanks for such a quick response. Attached PR [1]

[jira] [Updated] (SPARK-21066) LibSVM load just one input file

2017-06-12 Thread darion yaphet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] darion yaphet updated SPARK-21066: -- Description: Currently when we using SVM to train dataset we found the input files limit only

[jira] [Created] (SPARK-21066) LibSVM load just one input file

2017-06-12 Thread darion yaphet (JIRA)
darion yaphet created SPARK-21066: - Summary: LibSVM load just one input file Key: SPARK-21066 URL: https://issues.apache.org/jira/browse/SPARK-21066 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046694#comment-16046694 ] Dongjoon Hyun commented on SPARK-17642: --- Hi, [~mbasmanova]. Spark-LLAP is a third party library. Is

[jira] [Commented] (SPARK-17642) Support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2017-06-12 Thread Maria (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046664#comment-16046664 ] Maria commented on SPARK-17642: --- Folks, It seems to me that column-level access control is implemented

[jira] [Resolved] (SPARK-20715) MapStatuses shouldn't be redundantly stored in both ShuffleMapStage and MapOutputTracker

2017-06-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-20715. Resolution: Fixed Fix Version/s: 2.3.0 Fixed for 2.3.0. > MapStatuses shouldn't be

[jira] [Commented] (SPARK-21065) Spark Streaming concurrentJobs + StreamingJobProgressListener conflict

2017-06-12 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046649#comment-16046649 ] Dan Dutrow commented on SPARK-21065: Something to note: If one batch's processing time exceeds the

[jira] [Updated] (SPARK-21065) Spark Streaming concurrentJobs + StreamingJobProgressListener conflict

2017-06-12 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Dutrow updated SPARK-21065: --- Component/s: DStreams > Spark Streaming concurrentJobs + StreamingJobProgressListener conflict >

[jira] [Updated] (SPARK-21065) Spark Streaming concurrentJobs + StreamingJobProgressListener conflict

2017-06-12 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Dutrow updated SPARK-21065: --- Description: My streaming application has 200+ output operations, many of them stateful and several

[jira] [Commented] (SPARK-21061) GMM Error : Matrix is not symmetric

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046631#comment-16046631 ] Sean Owen commented on SPARK-21061: --- I get a slightly different error: NotConvergedException. The

[jira] [Updated] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2017-06-12 Thread Peter Bykov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bykov updated SPARK-21063: Affects Version/s: 2.1.0 > Spark return an empty result from remote hadoop cluster >

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2017-06-12 Thread Peter Bykov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046596#comment-16046596 ] Peter Bykov commented on SPARK-21063: - [~sowen] data available in table, also i have empty result if

[jira] [Created] (SPARK-21065) Spark Streaming concurrentJobs + StreamingJobProgressListener conflict

2017-06-12 Thread Dan Dutrow (JIRA)
Dan Dutrow created SPARK-21065: -- Summary: Spark Streaming concurrentJobs + StreamingJobProgressListener conflict Key: SPARK-21065 URL: https://issues.apache.org/jira/browse/SPARK-21065 Project: Spark

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046581#comment-16046581 ] Sean Owen commented on SPARK-21063: --- Is there data in the table? are you pointing at the right cluster?

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2017-06-12 Thread Peter Bykov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046576#comment-16046576 ] Peter Bykov commented on SPARK-21063: - [~srowen] what do you mean by wrong config? what configuration

[jira] [Resolved] (SPARK-21041) With whole-stage codegen, SparkSession.range()'s behavior is inconsistent with SparkContext.range()

2017-06-12 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21041. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.3.0

[jira] [Commented] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046493#comment-16046493 ] Apache Spark commented on SPARK-21064: -- User 'djvulee' has created a pull request for this issue:

[jira] [Commented] (SPARK-21019) read orc when some of the columns are missing in some files

2017-06-12 Thread Mahesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046485#comment-16046485 ] Mahesh commented on SPARK-21019: Can you include the exception in your defect, from what I know spark

[jira] [Assigned] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-21064: - Assignee: DjvuLee Priority: Trivial (was: Major) Component/s: (was: Spark

[jira] [Commented] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046478#comment-16046478 ] Apache Spark commented on SPARK-21064: -- User 'djvulee' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21064: Assignee: (was: Apache Spark) > Fix the default value bug in

[jira] [Assigned] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21064: Assignee: Apache Spark > Fix the default value bug in NettyBlockTransferServiceSuite >

[jira] [Commented] (SPARK-20927) Add cache operator to Unsupported Operations in Structured Streaming

2017-06-12 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046476#comment-16046476 ] Chenzhao Guo commented on SPARK-20927: -- What exactly is 'no-op' ? Does that mean scala

[jira] [Commented] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046475#comment-16046475 ] DjvuLee commented on SPARK-21064: - The defalut value for `spark.port.maxRetries` is 100, but we use the

[jira] [Created] (SPARK-21064) Fix the default value bug in NettyBlockTransferServiceSuite

2017-06-12 Thread DjvuLee (JIRA)
DjvuLee created SPARK-21064: --- Summary: Fix the default value bug in NettyBlockTransferServiceSuite Key: SPARK-21064 URL: https://issues.apache.org/jira/browse/SPARK-21064 Project: Spark Issue

[jira] [Commented] (SPARK-21058) potential SVD optimization

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046424#comment-16046424 ] Sean Owen commented on SPARK-21058: --- I think we discussed this separately. The Gramian method is indeed

[jira] [Assigned] (SPARK-20947) Encoding/decoding issue in PySpark pipe implementation

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20947: Assignee: Apache Spark > Encoding/decoding issue in PySpark pipe implementation >

[jira] [Commented] (SPARK-20947) Encoding/decoding issue in PySpark pipe implementation

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046420#comment-16046420 ] Apache Spark commented on SPARK-20947: -- User 'chaoslawful' has created a pull request for this

[jira] [Assigned] (SPARK-20947) Encoding/decoding issue in PySpark pipe implementation

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20947: Assignee: (was: Apache Spark) > Encoding/decoding issue in PySpark pipe

[jira] [Assigned] (SPARK-21057) Do not use a PascalDistribution in countApprox

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21057: Assignee: (was: Apache Spark) > Do not use a PascalDistribution in countApprox >

[jira] [Assigned] (SPARK-21057) Do not use a PascalDistribution in countApprox

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21057: Assignee: Apache Spark > Do not use a PascalDistribution in countApprox >

[jira] [Commented] (SPARK-21057) Do not use a PascalDistribution in countApprox

2017-06-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046402#comment-16046402 ] Apache Spark commented on SPARK-21057: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2017-06-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046395#comment-16046395 ] Sean Owen commented on SPARK-21063: --- Nothing about this suggests a Spark problem. You may not have the

  1   2   >