[jira] [Updated] (SPARK-18686) Several cleanup and improvements for spark.logit

2016-12-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18686: Description: Several cleanup and improvements for {{spark.logit}}: * {{summary}} should return

[jira] [Created] (SPARK-18686) Several cleanup and improvements for spark.logit

2016-12-01 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-18686: --- Summary: Several cleanup and improvements for spark.logit Key: SPARK-18686 URL: https://issues.apache.org/jira/browse/SPARK-18686 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-18685) Fix all tests in ExecutorClassLoaderSuite to pass on Windows

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18685: Assignee: (was: Apache Spark) > Fix all tests in ExecutorClassLoaderSuite to pass on

[jira] [Commented] (SPARK-18685) Fix all tests in ExecutorClassLoaderSuite to pass on Windows

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714356#comment-15714356 ] Apache Spark commented on SPARK-18685: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-18685) Fix all tests in ExecutorClassLoaderSuite to pass on Windows

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18685: Assignee: Apache Spark > Fix all tests in ExecutorClassLoaderSuite to pass on Windows >

[jira] [Created] (SPARK-18685) Fix all tests in ExecutorClassLoaderSuite to pass on Windows

2016-12-01 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-18685: Summary: Fix all tests in ExecutorClassLoaderSuite to pass on Windows Key: SPARK-18685 URL: https://issues.apache.org/jira/browse/SPARK-18685 Project: Spark

[jira] [Commented] (SPARK-18165) Kinesis support in Structured Streaming

2016-12-01 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714327#comment-15714327 ] Takeshi Yamamuro commented on SPARK-18165: -- Thanks for the reference! I'd like to discuss the

[jira] [Closed] (SPARK-17909) we should create table before writing out the data in CTAS

2016-12-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan closed SPARK-17909. --- Resolution: Invalid > we should create table before writing out the data in CTAS >

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-01 Thread Aral Can Kaymaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714209#comment-15714209 ] Aral Can Kaymaz commented on SPARK-16845: - I am currently out of office, and will be back on

[jira] [Updated] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16845: Component/s: (was: Java API) >

[jira] [Updated] (SPARK-18661) Creating a partitioned datasource table should not scan all files for table

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18661: Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > Creating a partitioned datasource

[jira] [Updated] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18679: Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > Regression in file listing

[jira] [Updated] (SPARK-18659) Incorrect behaviors in overwrite table for datasource tables

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18659: Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > Incorrect behaviors in overwrite

[jira] [Assigned] (SPARK-18667) input_file_name function does not work with UDF

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18667: Assignee: Apache Spark > input_file_name function does not work with UDF >

[jira] [Assigned] (SPARK-18667) input_file_name function does not work with UDF

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18667: Assignee: (was: Apache Spark) > input_file_name function does not work with UDF >

[jira] [Commented] (SPARK-18667) input_file_name function does not work with UDF

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714200#comment-15714200 ] Apache Spark commented on SPARK-18667: -- User 'viirya' has created a pull request for this issue:

[jira] [Resolved] (SPARK-18640) Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18640. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.3 Target

[jira] [Commented] (SPARK-18640) Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714189#comment-15714189 ] Reynold Xin commented on SPARK-18640: - [~andrewor14] how come you didn't close the ticket? > Fix

[jira] [Resolved] (SPARK-17213) Parquet String Pushdown for Non-Eq Comparisons Broken

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17213. - Resolution: Fixed Fix Version/s: 2.1.0 > Parquet String Pushdown for Non-Eq Comparisons

[jira] [Resolved] (SPARK-18658) Writing to a text DataSource buffers one or more lines in memory

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18658. - Resolution: Fixed Fix Version/s: 2.2.0 > Writing to a text DataSource buffers one or more

[jira] [Resolved] (SPARK-18663) Simplify CountMinSketch aggregate implementation

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18663. - Resolution: Fixed Fix Version/s: 2.2.0 > Simplify CountMinSketch aggregate implementation

[jira] [Assigned] (SPARK-18620) Spark Streaming + Kinesis : Receiver MaxRate is violated

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18620: Assignee: Apache Spark > Spark Streaming + Kinesis : Receiver MaxRate is violated >

[jira] [Assigned] (SPARK-18620) Spark Streaming + Kinesis : Receiver MaxRate is violated

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18620: Assignee: (was: Apache Spark) > Spark Streaming + Kinesis : Receiver MaxRate is

[jira] [Commented] (SPARK-18620) Spark Streaming + Kinesis : Receiver MaxRate is violated

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714116#comment-15714116 ] Apache Spark commented on SPARK-18620: -- User 'maropu' has created a pull request for this issue:

[jira] [Resolved] (SPARK-18647) do not put provider in table properties for Hive serde table

2016-12-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18647. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 16080

[jira] [Updated] (SPARK-18284) Scheme of DataFrame generated from RDD is diffrent between master and 2.0

2016-12-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18284: Assignee: Kazuaki Ishizaki > Scheme of DataFrame generated from RDD is diffrent between master and

[jira] [Resolved] (SPARK-18284) Scheme of DataFrame generated from RDD is diffrent between master and 2.0

2016-12-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18284. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15780

[jira] [Comment Edited] (SPARK-12216) Spark failed to delete temp directory

2016-12-01 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713978#comment-15713978 ] Brian edited comment on SPARK-12216 at 12/2/16 4:24 AM: Theory or no for what

[jira] [Commented] (SPARK-12216) Spark failed to delete temp directory

2016-12-01 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713978#comment-15713978 ] Brian commented on SPARK-12216: --- Theory or no for what caused it, it's a bug in spark. Other programs and

[jira] [Commented] (SPARK-12216) Spark failed to delete temp directory

2016-12-01 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713974#comment-15713974 ] Brian commented on SPARK-12216: --- Why is this closed / marked as resolved? It is not resolved at all - this

[jira] [Assigned] (SPARK-18668) Do not auto-generate query name

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18668: Assignee: Tathagata Das (was: Apache Spark) > Do not auto-generate query name >

[jira] [Assigned] (SPARK-18657) Persist UUID across query restart

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18657: Assignee: (was: Apache Spark) > Persist UUID across query restart >

[jira] [Assigned] (SPARK-18668) Do not auto-generate query name

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18668: Assignee: Apache Spark (was: Tathagata Das) > Do not auto-generate query name >

[jira] [Assigned] (SPARK-18657) Persist UUID across query restart

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18657: Assignee: Apache Spark > Persist UUID across query restart >

[jira] [Commented] (SPARK-18668) Do not auto-generate query name

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713943#comment-15713943 ] Apache Spark commented on SPARK-18668: -- User 'tdas' has created a pull request for this issue:

[jira] [Commented] (SPARK-18657) Persist UUID across query restart

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713942#comment-15713942 ] Apache Spark commented on SPARK-18657: -- User 'tdas' has created a pull request for this issue:

[jira] [Commented] (SPARK-18620) Spark Streaming + Kinesis : Receiver MaxRate is violated

2016-12-01 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713894#comment-15713894 ] Takeshi Yamamuro commented on SPARK-18620: -- yea, I'll make a pr in a day > Spark Streaming +

[jira] [Commented] (SPARK-18620) Spark Streaming + Kinesis : Receiver MaxRate is violated

2016-12-01 Thread david przybill (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713889#comment-15713889 ] david przybill commented on SPARK-18620: Looks good to me. Thanks for the prompt answer > Spark

[jira] [Closed] (SPARK-18680) Throw Filtering is supported only on partition keys of type string exception

2016-12-01 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang closed SPARK-18680. --- Resolution: Duplicate > Throw Filtering is supported only on partition keys of type string exception

[jira] [Resolved] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-12-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18538. - Resolution: Fixed Fix Version/s: 2.1.0 > Concurrent Fetching DataFrameReader JDBC APIs Do

[jira] [Updated] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-12-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18141: Assignee: Suresh Thalamati > jdbc datasource read fails when quoted columns (eg:mixed case, reserved >

[jira] [Commented] (SPARK-18681) Throw Filtering is supported only on partition keys of type string exception

2016-12-01 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713871#comment-15713871 ] Liang-Chi Hsieh commented on SPARK-18681: - Looks like you create two Jiras (SPARK-18680,

[jira] [Resolved] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-12-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-18141. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15662

[jira] [Comment Edited] (SPARK-18684) Spark Executors off-heap memory usage keeps increasing while running spark streaming

2016-12-01 Thread Krishna Gandra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713855#comment-15713855 ] Krishna Gandra edited comment on SPARK-18684 at 12/2/16 3:02 AM: -

[jira] [Commented] (SPARK-18684) Spark Executors off-heap memory usage keeps increasing while running spark streaming

2016-12-01 Thread Krishna Gandra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713855#comment-15713855 ] Krishna Gandra commented on SPARK-18684: Executor off-heap size is keep increasing and eventually

[jira] [Created] (SPARK-18684) Spark Executors off-heap memory usage keeps increasing while running spark streaming

2016-12-01 Thread Krishna Gandra (JIRA)
Krishna Gandra created SPARK-18684: -- Summary: Spark Executors off-heap memory usage keeps increasing while running spark streaming Key: SPARK-18684 URL: https://issues.apache.org/jira/browse/SPARK-18684

[jira] [Updated] (SPARK-18665) Spark ThriftServer jobs where are canceled are still “STARTED”

2016-12-01 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-18665: -- Description: I find that, some jobs are canceled, but the state are still "STARTED", I think this bug

[jira] [Updated] (SPARK-18665) Spark ThriftServer jobs where are canceled are still “STARTED”

2016-12-01 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-18665: -- Description: I find that, some jobs are canceled, but the state are still "STARTED", I think this bug

[jira] [Updated] (SPARK-18665) Spark ThriftServer jobs where are canceled are still “STARTED”

2016-12-01 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-18665: -- Description: I find that, some jobs are canceled, but the state are still "STARTED", I think this bug

[jira] [Assigned] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18679: Assignee: Apache Spark > Regression in file listing performance >

[jira] [Assigned] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18679: Assignee: (was: Apache Spark) > Regression in file listing performance >

[jira] [Commented] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713826#comment-15713826 ] Apache Spark commented on SPARK-18679: -- User 'ericl' has created a pull request for this issue:

[jira] [Commented] (SPARK-13287) Standalone REST API throttling?

2016-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713742#comment-15713742 ] Shixiong Zhu commented on SPARK-13287: -- Created SPARK-18683. But I don't have time to work on it

[jira] [Created] (SPARK-18683) REST APIs for standalone Master and Workers

2016-12-01 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18683: Summary: REST APIs for standalone Master and Workers Key: SPARK-18683 URL: https://issues.apache.org/jira/browse/SPARK-18683 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-18639) Build only a single pip package

2016-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18639. - Resolution: Fixed Fix Version/s: 2.1.0 > Build only a single pip package >

[jira] [Commented] (SPARK-13287) Standalone REST API throttling?

2016-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713735#comment-15713735 ] Shixiong Zhu commented on SPARK-13287: -- Right now there is no REST API for master. You use the REST

[jira] [Resolved] (SPARK-13287) Standalone REST API throttling?

2016-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13287. -- Resolution: Not A Bug > Standalone REST API throttling? > --- > >

[jira] [Commented] (SPARK-18323) Update MLlib, GraphX websites for 2.1

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713738#comment-15713738 ] Joseph K. Bradley commented on SPARK-18323: --- Recommendations: * Update "Calling MLlib in

[jira] [Updated] (SPARK-18234) Update mode in structured streaming

2016-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18234: - Target Version/s: 2.2.0 > Update mode in structured streaming >

[jira] [Created] (SPARK-18682) Batch Source for Kafka

2016-12-01 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18682: Summary: Batch Source for Kafka Key: SPARK-18682 URL: https://issues.apache.org/jira/browse/SPARK-18682 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17822: -- Target Version/s: 2.0.3, 2.1.1, 2.2.0 (was: 2.0.3, 2.1.0) > JVMObjectTracker.objMap

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713646#comment-15713646 ] Joseph K. Bradley commented on SPARK-17822: --- Since 2.1 is underway and this is not a

[jira] [Updated] (SPARK-17823) Make JVMObjectTracker.objMap thread-safe

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17823: -- Target Version/s: 2.0.3, 2.1.1, 2.2.0 (was: 2.0.3, 2.1.0) > Make

[jira] [Commented] (SPARK-17823) Make JVMObjectTracker.objMap thread-safe

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713644#comment-15713644 ] Joseph K. Bradley commented on SPARK-17823: --- Since 2.1 is underway and this is not a

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-12-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713636#comment-15713636 ] Nicholas Chammas commented on SPARK-13587: -- [~tsp]: {quote} Previously, I have had reasonable

[jira] [Commented] (SPARK-18681) Throw Filtering is supported only on partition keys of type string exception

2016-12-01 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713624#comment-15713624 ] Yuming Wang commented on SPARK-18681: - I will pull request for this issue later. > Throw Filtering

[jira] [Created] (SPARK-18680) Throw Filtering is supported only on partition keys of type string exception

2016-12-01 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-18680: --- Summary: Throw Filtering is supported only on partition keys of type string exception Key: SPARK-18680 URL: https://issues.apache.org/jira/browse/SPARK-18680 Project:

[jira] [Created] (SPARK-18681) Throw Filtering is supported only on partition keys of type string exception

2016-12-01 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-18681: --- Summary: Throw Filtering is supported only on partition keys of type string exception Key: SPARK-18681 URL: https://issues.apache.org/jira/browse/SPARK-18681 Project:

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713604#comment-15713604 ] Joseph K. Bradley commented on SPARK-17822: --- I've been able to observe something like this bug

[jira] [Resolved] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic when kafka-clients 0.10.0.1 is used

2016-12-01 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Heji Kim resolved SPARK-18506. -- Resolution: Not A Problem Just another library incompatibilty issue. We just downgraded the

[jira] [Updated] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic when kafka-clients 0.10.1.0 is used

2016-12-01 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Heji Kim updated SPARK-18506: - Summary: kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on

[jira] [Commented] (SPARK-18476) SparkR Logistic Regression should should support output original label.

2016-12-01 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713515#comment-15713515 ] Miao Wang commented on SPARK-18476: --- spark.logit predict should output original label instead of a

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-12-01 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713476#comment-15713476 ] Heji Kim commented on SPARK-18506: -- Breaking news I finally found the source of the problem. Our

[jira] [Updated] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic when kafka-clients 0.10.0.1 is used

2016-12-01 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Heji Kim updated SPARK-18506: - Summary: kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on

[jira] [Commented] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713459#comment-15713459 ] Apache Spark commented on SPARK-18538: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2016-12-01 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713430#comment-15713430 ] Miao Wang commented on SPARK-18131: --- I can try to follow this discussion for an initial PR. > Support

[jira] [Updated] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18679: --- Component/s: SQL > Regression in file listing performance > -- >

[jira] [Created] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18679: -- Summary: Regression in file listing performance Key: SPARK-18679 URL: https://issues.apache.org/jira/browse/SPARK-18679 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18679) Regression in file listing performance

2016-12-01 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18679: --- Affects Version/s: 2.1.0 > Regression in file listing performance >

[jira] [Commented] (SPARK-18618) SparkR model predict should support type as a argument

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713358#comment-15713358 ] Joseph K. Bradley commented on SPARK-18618: --- [~yanboliang] Shall we get this into 2.1 as a fix

[jira] [Commented] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713341#comment-15713341 ] Joseph K. Bradley commented on SPARK-18291: --- I just saw the comment at the end of the PR and

[jira] [Commented] (SPARK-13534) Implement Apache Arrow serializer for Spark DataFrame for use in DataFrame.toPandas

2016-12-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713340#comment-15713340 ] Bryan Cutler commented on SPARK-13534: -- Hi [~icexelloss], that sounds great! We could definitely

[jira] [Issue Comment Deleted] (SPARK-18476) SparkR Logistic Regression should should support output original label.

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18476: -- Comment: was deleted (was: [~wangmiao1981] This changes the output schema and is an

[jira] [Commented] (SPARK-18674) improve the error message of using join

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713321#comment-15713321 ] Apache Spark commented on SPARK-18674: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-18476) SparkR Logistic Regression should should support output original label.

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713296#comment-15713296 ] Joseph K. Bradley commented on SPARK-18476: --- [~wangmiao1981] This changes the output schema and

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18291: -- Attachment: SparkR2.1decisionoutputschemaforGLMs.pdf I'm adding a little summary of

[jira] [Commented] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713179#comment-15713179 ] Apache Spark commented on SPARK-18588: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18670) Limit the number of StreamingQueryListener.StreamProgressEvent when there is no data

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18670: Assignee: Shixiong Zhu (was: Apache Spark) > Limit the number of

[jira] [Commented] (SPARK-18670) Limit the number of StreamingQueryListener.StreamProgressEvent when there is no data

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713142#comment-15713142 ] Apache Spark commented on SPARK-18670: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18670) Limit the number of StreamingQueryListener.StreamProgressEvent when there is no data

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18670: Assignee: Apache Spark (was: Shixiong Zhu) > Limit the number of

[jira] [Updated] (SPARK-18274) Memory leak in PySpark StringIndexer

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18274: -- Target Version/s: 2.0.3, 2.1.0 (was: 2.0.3, 2.1.1, 2.2.0) > Memory leak in PySpark

[jira] [Resolved] (SPARK-18274) Memory leak in PySpark StringIndexer

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18274. --- Resolution: Fixed Fix Version/s: 2.2.0 2.0.3

[jira] [Updated] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-12-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18642: -- Affects Version/s: 1.6.3 > Spark SQL: Catalyst is scanning undesired columns >

[jira] [Closed] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-12-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-18641. - Resolution: Invalid I close this issue because the reported error message comes from Sentry

[jira] [Commented] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-12-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713103#comment-15713103 ] Dongjoon Hyun commented on SPARK-18641: --- Thank you for reporting, [~zhangqw] But, I'm wondering

[jira] [Updated] (SPARK-18274) Memory leak in PySpark StringIndexer

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18274: -- Shepherd: Joseph K. Bradley > Memory leak in PySpark StringIndexer >

[jira] [Updated] (SPARK-18274) Memory leak in PySpark StringIndexer

2016-12-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18274: -- Assignee: Sandeep Singh > Memory leak in PySpark StringIndexer >

[jira] [Commented] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-12-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15713081#comment-15713081 ] Dongjoon Hyun commented on SPARK-18642: --- Thank you for reporting, [~mohitgargk]. It seems to be the

[jira] [Created] (SPARK-18678) Skewed feature subsampling in Random forest

2016-12-01 Thread Bjoern Toldbod (JIRA)
Bjoern Toldbod created SPARK-18678: -- Summary: Skewed feature subsampling in Random forest Key: SPARK-18678 URL: https://issues.apache.org/jira/browse/SPARK-18678 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-18677) Json path implementation fails to parse ['key']

2016-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18677: Assignee: (was: Apache Spark) > Json path implementation fails to parse ['key'] >

  1   2   >