[jira] [Created] (SPARK-19283) Application details UI not visible for completed actions

2017-01-18 Thread Deenbandhu Agarwal (JIRA)
Deenbandhu Agarwal created SPARK-19283: -- Summary: Application details UI not visible for completed actions Key: SPARK-19283 URL: https://issues.apache.org/jira/browse/SPARK-19283 Project: Spark

[jira] [Assigned] (SPARK-19276) FetchFailures can be hidden by user (or sql) exception handling

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19276: Assignee: (was: Apache Spark) > FetchFailures can be hidden by user (or sql)

[jira] [Assigned] (SPARK-19276) FetchFailures can be hidden by user (or sql) exception handling

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19276: Assignee: Apache Spark > FetchFailures can be hidden by user (or sql) exception handling

[jira] [Commented] (SPARK-19276) FetchFailures can be hidden by user (or sql) exception handling

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829396#comment-15829396 ] Apache Spark commented on SPARK-19276: -- User 'squito' has created a pull request for this issue:

[jira] [Closed] (SPARK-19210) Add log level info into streaming checkpoint

2017-01-18 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu closed SPARK-19210. - Resolution: Won't Fix > Add log level info into streaming checkpoint >

[jira] [Created] (SPARK-19282) RandomForestRegressionModel should expose getMaxDepth

2017-01-18 Thread Nick Lothian (JIRA)
Nick Lothian created SPARK-19282: Summary: RandomForestRegressionModel should expose getMaxDepth Key: SPARK-19282 URL: https://issues.apache.org/jira/browse/SPARK-19282 Project: Spark Issue

[jira] [Updated] (SPARK-16968) Allow to add additional options when creating a new table in DF's JDBC writer.

2017-01-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16968: Fix Version/s: 2.0.3 > Allow to add additional options when creating a new table in DF's JDBC > writer.

[jira] [Updated] (SPARK-19102) Accuracy error of spark SQL results

2017-01-18 Thread XiaodongCui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiaodongCui updated SPARK-19102: Description: the problem is cube6's second column named sumprice is 1 times bigger than the

[jira] [Updated] (SPARK-19102) Accuracy error of spark SQL results

2017-01-18 Thread XiaodongCui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiaodongCui updated SPARK-19102: Description: the problem is cube6's second column named sumprice is 1 times bigger than the

[jira] [Assigned] (SPARK-19115) SparkSQL unsupports the command " create external table if not exist new_tbl like old_tbl location '/warehouse/new_tbl' "

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19115: Assignee: Apache Spark (was: Xiao Li) > SparkSQL unsupports the command " create

[jira] [Assigned] (SPARK-19115) SparkSQL unsupports the command " create external table if not exist new_tbl like old_tbl location '/warehouse/new_tbl' "

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19115: Assignee: Xiao Li (was: Apache Spark) > SparkSQL unsupports the command " create

[jira] [Commented] (SPARK-19115) SparkSQL unsupports the command " create external table if not exist new_tbl like old_tbl location '/warehouse/new_tbl' "

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829233#comment-15829233 ] Apache Spark commented on SPARK-19115: -- User 'ouyangxiaochen' has created a pull request for this

[jira] [Comment Edited] (SPARK-15023) Add support for testing against the `ProcessingTime(intervalMS > 0)` trigger and `ManualClock`

2017-01-18 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829216#comment-15829216 ] Liwei Lin edited comment on SPARK-15023 at 1/19/17 3:01 AM: Hi

[jira] [Commented] (SPARK-15023) Add support for testing against the `ProcessingTime(intervalMS > 0)` trigger and `ManualClock`

2017-01-18 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829216#comment-15829216 ] Liwei Lin commented on SPARK-15023: --- Hi [~hyukjin.kwon], yea this was resolved by the PR. Thanks for

[jira] [Resolved] (SPARK-19183) Add deleteWithJob hook to internal commit protocol API

2017-01-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19183. - Resolution: Fixed Assignee: Eric Liang Fix Version/s: 2.2.0 > Add deleteWithJob

[jira] [Commented] (SPARK-18750) spark should be able to control the number of executor and should not throw stack overslow

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829132#comment-15829132 ] Marcelo Vanzin commented on SPARK-18750: I haven't been able to reproduce this yet, but

[jira] [Commented] (SPARK-19225) Spark SQL round constant double return null

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829128#comment-15829128 ] Apache Spark commented on SPARK-19225: -- User 'discipleforteen' has created a pull request for this

[jira] [Comment Edited] (SPARK-14409) Investigate adding a RankingEvaluator to ML

2017-01-18 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15826901#comment-15826901 ] Roberto Mirizzi edited comment on SPARK-14409 at 1/19/17 12:51 AM: ---

[jira] [Commented] (SPARK-19208) MaxAbsScaler and MinMaxScaler are very inefficient

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829014#comment-15829014 ] Joseph K. Bradley commented on SPARK-19208: --- +1 for [~mlnick]'s suggestion. If we're

[jira] [Comment Edited] (SPARK-18750) spark should be able to control the number of executor and should not throw stack overslow

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15829008#comment-15829008 ] Marcelo Vanzin edited comment on SPARK-18750 at 1/18/17 11:56 PM: -- Sean,

[jira] [Reopened] (SPARK-18750) spark should be able to control the number of executor and should not throw stack overslow

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-18750: Sean, this is a separate issue. Even if Spark is smart, it could decide to try to allocate a

[jira] [Updated] (SPARK-19208) MaxAbsScaler and MinMaxScaler are very inefficient

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19208: -- Assignee: (was: Apache Spark) > MaxAbsScaler and MinMaxScaler are very inefficient

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828980#comment-15828980 ] Marcelo Vanzin commented on SPARK-18085: I uploaded branch shs-ng/M4.1 to my repo which contains

[jira] [Updated] (SPARK-14975) Predicted Probability per training instance for Gradient Boosted Trees

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14975: -- Summary: Predicted Probability per training instance for Gradient Boosted Trees (was:

[jira] [Resolved] (SPARK-14975) Predicted Probability per training instance for Gradient Boosted Trees

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14975. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16441

[jira] [Resolved] (SPARK-10890) "Column count does not match; SQL statement:" error in JDBCWriteSuite

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10890. --- Resolution: Cannot Reproduce Fix Version/s: (was: 2.2.0) > "Column count does not match;

[jira] [Updated] (SPARK-19180) the offset of short is 4 in OffHeapColumnVector's putShorts

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19180: -- Assignee: yucai > the offset of short is 4 in OffHeapColumnVector's putShorts >

[jira] [Updated] (SPARK-19019) PySpark does not work with Python 3.6.0

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19019: -- Assignee: Hyukjin Kwon > PySpark does not work with Python 3.6.0 >

[jira] [Reopened] (SPARK-10890) "Column count does not match; SQL statement:" error in JDBCWriteSuite

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-10890: --- > "Column count does not match; SQL statement:" error in JDBCWriteSuite >

[jira] [Updated] (SPARK-18335) Add a numSlices parameter to SparkR's createDataFrame

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18335: -- Assignee: Felix Cheung > Add a numSlices parameter to SparkR's createDataFrame >

[jira] [Commented] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828914#comment-15828914 ] Apache Spark commented on SPARK-14503: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14503: Assignee: (was: Apache Spark) > spark.ml Scala API for FPGrowth >

[jira] [Assigned] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14503: Assignee: Apache Spark > spark.ml Scala API for FPGrowth >

[jira] [Commented] (SPARK-14501) spark.ml parity for fpm - frequent items

2017-01-18 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828909#comment-15828909 ] yuhao yang commented on SPARK-14501: Since this is the parent item. I'll change the PR to target

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-01-18 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828866#comment-15828866 ] Alex Bozarth commented on SPARK-18085: -- I'm interested in helping with M4, but I am currently bogged

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828860#comment-15828860 ] Marcelo Vanzin commented on SPARK-18085: BTW if anyone is still following this, I updated the

[jira] [Commented] (SPARK-12650) No means to specify Xmx settings for spark-submit in cluster deploy mode for Spark on YARN

2017-01-18 Thread SriReddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828845#comment-15828845 ] SriReddy commented on SPARK-12650: -- Voted up. We need similar functionality and using SPARK_SUBMIT_OPTS

[jira] [Closed] (SPARK-8855) Python API for Association Rules

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-8855. Resolution: Won't Fix > Python API for Association Rules >

[jira] [Created] (SPARK-19281) spark.ml Python API for FPGrowth

2017-01-18 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19281: - Summary: spark.ml Python API for FPGrowth Key: SPARK-19281 URL: https://issues.apache.org/jira/browse/SPARK-19281 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8855) Python API for Association Rules

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828808#comment-15828808 ] Joseph K. Bradley commented on SPARK-8855: -- I'm going to close this issue in favor of the

[jira] [Updated] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14503: -- Summary: spark.ml Scala API for FPGrowth (was: spark.ml API for FPGrowth) > spark.ml

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828803#comment-15828803 ] Joseph K. Bradley commented on SPARK-17136: --- CC [~avulanov], who has thought a lot about these

[jira] [Commented] (SPARK-5484) Pregel should checkpoint periodically to avoid StackOverflowError

2017-01-18 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828768#comment-15828768 ] Michael Allman commented on SPARK-5484: --- Hi Guys, @ding has rebased his PR, and it LGTM. Can a

[jira] [Updated] (SPARK-5256) Improving MLlib optimization APIs

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5256: - Description: *Goal*: Improve APIs for optimization *Motivation*: There have been several

[jira] [Commented] (SPARK-13610) Create a Transformer to disassemble vectors in DataFrames

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828756#comment-15828756 ] Joseph K. Bradley commented on SPARK-13610: --- One more: Would these selected subsets of elements

[jira] [Commented] (SPARK-19053) Supporting multiple evaluation metrics in DataFrame-based API: discussion

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828755#comment-15828755 ] Joseph K. Bradley commented on SPARK-19053: --- After thinking about this more and hearing your

[jira] [Updated] (SPARK-19280) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu updated SPARK-19280: Description: In one of our applications, we found the following issue, the application recovering from a

[jira] [Commented] (SPARK-17602) PySpark - Performance Optimization Large Size of Broadcast Variable

2017-01-18 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828709#comment-15828709 ] holdenk commented on SPARK-17602: - Ah yes, sorry I've been pretty busy. I just had an interesting chat

[jira] [Commented] (SPARK-17602) PySpark - Performance Optimization Large Size of Broadcast Variable

2017-01-18 Thread Junfeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828674#comment-15828674 ] Junfeng commented on SPARK-17602: - [~holdenk] could you send me instruction how to move forward this?? It

[jira] [Resolved] (SPARK-19266) Ensure DiskStore properly encrypts cached data on disk.

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-19266. Resolution: Not A Problem Alright, I think I understand the code now.

[jira] [Resolved] (SPARK-19278) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19278. --- Resolution: Duplicate > Failed Recovery from checkpoint caused by the multi-threads issue in Spark

[jira] [Commented] (SPARK-16293) SparkAppHandle.getState() returns wrong state in standalone mode if Spark application terminates unexpectedly

2017-01-18 Thread Adam Kramer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828613#comment-15828613 ] Adam Kramer commented on SPARK-16293: - Was this verified as fixed? The duplicate refers to an issue

[jira] [Commented] (SPARK-19278) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828621#comment-15828621 ] Nan Zhu commented on SPARK-19278: - any one would help to close this one? as it is a duplication of

[jira] [Commented] (SPARK-14659) OneHotEncoder support drop first category alphabetically in the encoded vector

2017-01-18 Thread Wayne Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828618#comment-15828618 ] Wayne Zhang commented on SPARK-14659: - [~yanboliang] [~josephkb] Has anyone been working on this

[jira] [Commented] (SPARK-19279) Disallow Users to Create a Hive Table With an Empty Schema

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828605#comment-15828605 ] Apache Spark commented on SPARK-19279: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19279) Disallow Users to Create a Hive Table With an Empty Schema

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19279: Assignee: Xiao Li (was: Apache Spark) > Disallow Users to Create a Hive Table With an

[jira] [Assigned] (SPARK-19279) Disallow Users to Create a Hive Table With an Empty Schema

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19279: Assignee: Apache Spark (was: Xiao Li) > Disallow Users to Create a Hive Table With an

[jira] [Commented] (SPARK-19280) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828602#comment-15828602 ] Nan Zhu commented on SPARK-19280: - [~zsxwing] would you mind confirming about this? it would be great if

[jira] [Created] (SPARK-19280) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Nan Zhu (JIRA)
Nan Zhu created SPARK-19280: --- Summary: Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler Key: SPARK-19280 URL: https://issues.apache.org/jira/browse/SPARK-19280

[jira] [Updated] (SPARK-19279) Disallow Users to Create a Hive Table With an Empty Schema

2017-01-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-19279: Summary: Disallow Users to Create a Hive Table With an Empty Schema (was: Disallow Users to Create a Hive

[jira] [Created] (SPARK-19279) Disallow Users to Create a Hive table With an Empty Schema

2017-01-18 Thread Xiao Li (JIRA)
Xiao Li created SPARK-19279: --- Summary: Disallow Users to Create a Hive table With an Empty Schema Key: SPARK-19279 URL: https://issues.apache.org/jira/browse/SPARK-19279 Project: Spark Issue Type:

[jira] [Created] (SPARK-19278) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-01-18 Thread Nan Zhu (JIRA)
Nan Zhu created SPARK-19278: --- Summary: Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler Key: SPARK-19278 URL: https://issues.apache.org/jira/browse/SPARK-19278

[jira] [Commented] (SPARK-19059) Unable to retrieve data from a parquet table whose name starts with underscore

2017-01-18 Thread Jayadevan M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828593#comment-15828593 ] Jayadevan M commented on SPARK-19059: - @uncleGen @Eric Liang @cloud-fan Could you please review this

[jira] [Commented] (SPARK-19059) Unable to retrieve data from a parquet table whose name starts with underscore

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828574#comment-15828574 ] Apache Spark commented on SPARK-19059: -- User 'jayadevanmurali' has created a pull request for this

[jira] [Assigned] (SPARK-19059) Unable to retrieve data from a parquet table whose name starts with underscore

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19059: Assignee: (was: Apache Spark) > Unable to retrieve data from a parquet table whose

[jira] [Assigned] (SPARK-19059) Unable to retrieve data from a parquet table whose name starts with underscore

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19059: Assignee: Apache Spark > Unable to retrieve data from a parquet table whose name starts

[jira] [Updated] (SPARK-14975) Predicted Probability per training instance for Gradient Boosted Trees in mllib.

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14975: -- Shepherd: Joseph K. Bradley > Predicted Probability per training instance for Gradient

[jira] [Commented] (SPARK-19059) Unable to retrieve data from a parquet table whose name starts with underscore

2017-01-18 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828559#comment-15828559 ] Thomas Sebastian commented on SPARK-19059: -- Me and [~jayadevan.m] working on it. > Unable to

[jira] [Updated] (SPARK-14975) Predicted Probability per training instance for Gradient Boosted Trees in mllib.

2017-01-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14975: -- Assignee: Ilya Matiach > Predicted Probability per training instance for Gradient

[jira] [Resolved] (SPARK-19182) Optimize the lock in StreamingJobProgressListener to not block UI when generating Streaming jobs

2017-01-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19182. -- Resolution: Fixed Assignee: Genmao Yu Fix Version/s: 2.2.0 > Optimize the lock

[jira] [Resolved] (SPARK-19168) StateStore should be aborted upon error

2017-01-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19168. -- Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.2.0

[jira] [Resolved] (SPARK-19113) Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors from a source should be sent to the user

2017-01-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19113. -- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > Fix flaky test:

[jira] [Resolved] (SPARK-18113) Sending AskPermissionToCommitOutput failed, driver enter into task deadloop

2017-01-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-18113. Resolution: Fixed Assignee: jin xing Fix Version/s: 2.2.0 > Sending

[jira] [Created] (SPARK-19277) YARN topology script configuration needs to be localized by Spark

2017-01-18 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-19277: -- Summary: YARN topology script configuration needs to be localized by Spark Key: SPARK-19277 URL: https://issues.apache.org/jira/browse/SPARK-19277 Project: Spark

[jira] [Commented] (SPARK-16554) Spark should kill executors when they are blacklisted

2017-01-18 Thread Jose Soltren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828522#comment-15828522 ] Jose Soltren commented on SPARK-16554: -- Builds on some BlacklistTracker changes that should land

[jira] [Updated] (SPARK-16554) Spark should kill executors when they are blacklisted

2017-01-18 Thread Jose Soltren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Soltren updated SPARK-16554: - Shepherd: Imran Rashid > Spark should kill executors when they are blacklisted >

[jira] [Commented] (SPARK-16554) Spark should kill executors when they are blacklisted

2017-01-18 Thread Jose Soltren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828520#comment-15828520 ] Jose Soltren commented on SPARK-16554: -- I have some changes ready, but I'm going to wait for

[jira] [Commented] (SPARK-19276) FetchFailures can be hidden be user (or sql) exception handling

2017-01-18 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828518#comment-15828518 ] Imran Rashid commented on SPARK-19276: -- I haven't been successful in creating a test case to

[jira] [Updated] (SPARK-19276) FetchFailures can be hidden by user (or sql) exception handling

2017-01-18 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-19276: - Summary: FetchFailures can be hidden by user (or sql) exception handling (was: FetchFailures

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2017-01-18 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828512#comment-15828512 ] Emlyn Corrin commented on SPARK-8480: - [~skp33] OK, I can see that could be useful, but I think it's

[jira] [Updated] (SPARK-19276) FetchFailures can be hidden be user (or sql) exception handling

2017-01-18 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-19276: - Description: The scheduler handles node failures by looking for a special

[jira] [Created] (SPARK-19276) FetchFailures can be hidden be user (or sql) exception handling

2017-01-18 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-19276: Summary: FetchFailures can be hidden be user (or sql) exception handling Key: SPARK-19276 URL: https://issues.apache.org/jira/browse/SPARK-19276 Project: Spark

[jira] [Updated] (SPARK-19270) Add summary table to GLM summary

2017-01-18 Thread Wayne Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wayne Zhang updated SPARK-19270: Shepherd: Yanbo Liang > Add summary table to GLM summary > > >

[jira] [Assigned] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14536: Assignee: Apache Spark > NPE in JDBCRDD when array column contains nulls (postgresql) >

[jira] [Assigned] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14536: Assignee: (was: Apache Spark) > NPE in JDBCRDD when array column contains nulls

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2017-01-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Shepherd: Felix Cheung > Support R formula arithmetic > - > >

[jira] [Commented] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2017-01-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828463#comment-15828463 ] Xiao Li commented on SPARK-14536: - This issue needs to be resolved. > NPE in JDBCRDD when array column

[jira] [Resolved] (SPARK-19231) SparkR hangs when there is download or untar failure

2017-01-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-19231. -- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Target

[jira] [Reopened] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2017-01-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reopened SPARK-14536: - > NPE in JDBCRDD when array column contains nulls (postgresql) >

[jira] [Updated] (SPARK-19275) Spark Streaming, Kafka receiver, "Failed to get records for ... after polling for 512"

2017-01-18 Thread Dmitry Ochnev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Ochnev updated SPARK-19275: -- Description: We have a Spark Streaming application reading records from Kafka 0.10. Some

[jira] [Created] (SPARK-19275) Spark Streaming, Kafka receiver, "Failed to get records for ... after polling for 512"

2017-01-18 Thread Dmitry Ochnev (JIRA)
Dmitry Ochnev created SPARK-19275: - Summary: Spark Streaming, Kafka receiver, "Failed to get records for ... after polling for 512" Key: SPARK-19275 URL: https://issues.apache.org/jira/browse/SPARK-19275

[jira] [Commented] (SPARK-18569) Support R formula arithmetic

2017-01-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828435#comment-15828435 ] Felix Cheung commented on SPARK-18569: -- Yes, I'll put together a proposal and shepherd this >

[jira] [Comment Edited] (SPARK-18570) Consider supporting other R formula operators

2017-01-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828432#comment-15828432 ] Felix Cheung edited comment on SPARK-18570 at 1/18/17 5:27 PM: ---

[jira] [Commented] (SPARK-18570) Consider supporting other R formula operators

2017-01-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828432#comment-15828432 ] Felix Cheung commented on SPARK-18570: -- [~KrishnaKalyan3]I think supporting x * y (a+b+c)^2 and

[jira] [Commented] (SPARK-18011) SparkR serialize "NA" throws exception

2017-01-18 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828426#comment-15828426 ] Miao Wang commented on SPARK-18011: --- OS and R information: R version 3.3.0 (2016-05-03) -- "Supposedly

[jira] [Commented] (SPARK-19264) Work should start driver, the same to AM of yarn

2017-01-18 Thread hustfxj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828402#comment-15828402 ] hustfxj commented on SPARK-19264: - @Sean Owen Why not solve it like AM of Yarn. I remember the

[jira] [Commented] (SPARK-19264) Work should start driver, the same to AM of yarn

2017-01-18 Thread hustfxj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828366#comment-15828366 ] hustfxj commented on SPARK-19264: - Maybe you are right. We can't hard-kill the driver. But I don't

[jira] [Commented] (SPARK-15573) Backwards-compatible persistence for spark.ml

2017-01-18 Thread Asher Krim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828346#comment-15828346 ] Asher Krim commented on SPARK-15573: Any thoughts on determining the version in the loading logic?

[jira] [Resolved] (SPARK-18559) Fix HLL++ with small relative error

2017-01-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18559. --- Resolution: Fixed Assignee: Zhenhua Wang Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-16968) Allow to add additional options when creating a new table in DF's JDBC writer.

2017-01-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828291#comment-15828291 ] Apache Spark commented on SPARK-16968: -- User 'gatorsmile' has created a pull request for this issue:

  1   2   >