[jira] [Commented] (SPARK-24206) Improve DataSource benchmark code for read and pushdown

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466902#comment-16466902 ] Apache Spark commented on SPARK-24206: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24206) Improve DataSource benchmark code for read and pushdown

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24206: Assignee: Apache Spark > Improve DataSource benchmark code for read and pushdown >

[jira] [Assigned] (SPARK-24206) Improve DataSource benchmark code for read and pushdown

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24206: Assignee: (was: Apache Spark) > Improve DataSource benchmark code for read and

[jira] [Commented] (SPARK-18165) Kinesis support in Structured Streaming

2018-05-07 Thread Vikram Agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466897#comment-16466897 ] Vikram Agrawal commented on SPARK-18165: Thanks [~marmbrus] - Planning to start the work on

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-05-07 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-20114: --- Component/s: (was: PySpark) > spark.ml parity for sequential pattern mining - PrefixSpan >

[jira] [Updated] (SPARK-24146) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-07 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-24146: --- Component/s: PySpark > spark.ml parity for sequential pattern mining - PrefixSpan: Python API >

[jira] [Commented] (SPARK-24146) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466896#comment-16466896 ] Apache Spark commented on SPARK-24146: -- User 'WeichenXu123' has created a pull request for this

[jira] [Assigned] (SPARK-24146) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24146: Assignee: Apache Spark > spark.ml parity for sequential pattern mining - PrefixSpan:

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-05-07 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-20114: --- Component/s: PySpark > spark.ml parity for sequential pattern mining - PrefixSpan >

[jira] [Assigned] (SPARK-24146) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24146: Assignee: (was: Apache Spark) > spark.ml parity for sequential pattern mining -

[jira] [Created] (SPARK-24206) Improve DataSource benchmark code for read and pushdown

2018-05-07 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-24206: Summary: Improve DataSource benchmark code for read and pushdown Key: SPARK-24206 URL: https://issues.apache.org/jira/browse/SPARK-24206 Project: Spark

[jira] [Assigned] (SPARK-24128) Mention spark.sql.crossJoin.enabled in implicit cartesian product error msg

2018-05-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-24128: Assignee: Henry Robinson > Mention spark.sql.crossJoin.enabled in implicit cartesian

[jira] [Resolved] (SPARK-24128) Mention spark.sql.crossJoin.enabled in implicit cartesian product error msg

2018-05-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24128. -- Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pull

[jira] [Resolved] (SPARK-23975) Allow Clustering to take Arrays of Double as input features

2018-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-23975. --- Resolution: Fixed Fix Version/s: 2.4.0 > Allow Clustering to take Arrays of Double as

[jira] [Updated] (SPARK-24205) java.util.concurrent.locks.LockSupport.parkNanos

2018-05-07 Thread joy-m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] joy-m updated SPARK-24205: -- Attachment: 屏幕快照 2018-05-08 上午10.58.08.png > java.util.concurrent.locks.LockSupport.parkNanos >

[jira] [Updated] (SPARK-24205) java.util.concurrent.locks.LockSupport.parkNanos

2018-05-07 Thread joy-m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] joy-m updated SPARK-24205: -- Attachment: (was: 屏幕快照 2018-05-06 上午10.04.27.png) > java.util.concurrent.locks.LockSupport.parkNanos >

[jira] [Updated] (SPARK-24205) java.util.concurrent.locks.LockSupport.parkNanos

2018-05-07 Thread joy-m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] joy-m updated SPARK-24205: -- Attachment: 屏幕快照 2018-05-06 上午10.04.27.png > java.util.concurrent.locks.LockSupport.parkNanos >

[jira] [Created] (SPARK-24205) java.util.concurrent.locks.LockSupport.parkNanos

2018-05-07 Thread joy-m (JIRA)
joy-m created SPARK-24205: - Summary: java.util.concurrent.locks.LockSupport.parkNanos Key: SPARK-24205 URL: https://issues.apache.org/jira/browse/SPARK-24205 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24204) Verify a write schema in OrcFileFormat

2018-05-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466742#comment-16466742 ] Takeshi Yamamuro commented on SPARK-24204: -- This fix is like:

[jira] [Assigned] (SPARK-24084) Add job group id for query through spark-sql

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24084: Assignee: (was: Apache Spark) > Add job group id for query through spark-sql >

[jira] [Commented] (SPARK-24084) Add job group id for query through spark-sql

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466741#comment-16466741 ] Apache Spark commented on SPARK-24084: -- User 'caneGuy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24084) Add job group id for query through spark-sql

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24084: Assignee: Apache Spark > Add job group id for query through spark-sql >

[jira] [Created] (SPARK-24204) Verify a write schema in OrcFileFormat

2018-05-07 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-24204: Summary: Verify a write schema in OrcFileFormat Key: SPARK-24204 URL: https://issues.apache.org/jira/browse/SPARK-24204 Project: Spark Issue Type:

[jira] [Commented] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466700#comment-16466700 ] Hyukjin Kwon commented on SPARK-24200: -- If it's a question for now, I would suggest to ask it to

[jira] [Resolved] (SPARK-24199) Structured Streaming

2018-05-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24199. -- Resolution: Invalid Questions should go to mailing list rather than filing it as an issue

[jira] [Commented] (SPARK-24172) we should not apply operator pushdown to data source v2 many times

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466638#comment-16466638 ] Apache Spark commented on SPARK-24172: -- User 'rdblue' has created a pull request for this issue:

[jira] [Resolved] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-20114. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20973

[jira] [Resolved] (SPARK-22885) ML test for StructuredStreaming: spark.ml.tuning

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22885. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20261

[jira] [Updated] (SPARK-23291) SparkR : substr : In SparkR dataframe , starting and ending position arguments in "substr" is giving wrong result when the position is greater than 1

2018-05-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-23291: Fix Version/s: 2.3.1 > SparkR : substr : In SparkR dataframe , starting and ending position >

[jira] [Resolved] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15750. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 13493

[jira] [Commented] (SPARK-24152) SparkR CRAN feasibility check server problem

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466513#comment-16466513 ] Joseph K. Bradley commented on SPARK-24152: --- Thank you all! > SparkR CRAN feasibility check

[jira] [Commented] (SPARK-24203) Make executor's bindAddress configurable

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466474#comment-16466474 ] Apache Spark commented on SPARK-24203: -- User 'lukmajercak' has created a pull request for this

[jira] [Assigned] (SPARK-24203) Make executor's bindAddress configurable

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24203: Assignee: (was: Apache Spark) > Make executor's bindAddress configurable >

[jira] [Assigned] (SPARK-24203) Make executor's bindAddress configurable

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24203: Assignee: Apache Spark > Make executor's bindAddress configurable >

[jira] [Created] (SPARK-24203) Make executor's bindAddress configurable

2018-05-07 Thread Lukas Majercak (JIRA)
Lukas Majercak created SPARK-24203: -- Summary: Make executor's bindAddress configurable Key: SPARK-24203 URL: https://issues.apache.org/jira/browse/SPARK-24203 Project: Spark Issue Type:

[jira] [Updated] (SPARK-24202) Separate SQLContext dependencies from SparkSession.implicits

2018-05-07 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gerard Maas updated SPARK-24202: Description: The current implementation of the implicits in SparkSession passes the current

[jira] [Updated] (SPARK-24202) Separate SQLContext dependencies from SparkSession.implicits

2018-05-07 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gerard Maas updated SPARK-24202: Description: The current implementation of the implicits in SparkSession passes the current

[jira] [Created] (SPARK-24202) Separate SQLContext dependencies from SparkSession.implicits

2018-05-07 Thread Gerard Maas (JIRA)
Gerard Maas created SPARK-24202: --- Summary: Separate SQLContext dependencies from SparkSession.implicits Key: SPARK-24202 URL: https://issues.apache.org/jira/browse/SPARK-24202 Project: Spark

[jira] [Commented] (SPARK-18165) Kinesis support in Structured Streaming

2018-05-07 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466410#comment-16466410 ] Michael Armbrust commented on SPARK-18165: -- This is great!  I'm glad there are more connectors

[jira] [Updated] (SPARK-18165) Kinesis support in Structured Streaming

2018-05-07 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18165: - Component/s: (was: DStreams) Structured Streaming > Kinesis support

[jira] [Created] (SPARK-24201) IllegalArgumentException originating from ClosureCleaner in Java 9+

2018-05-07 Thread Grant Henke (JIRA)
Grant Henke created SPARK-24201: --- Summary: IllegalArgumentException originating from ClosureCleaner in Java 9+ Key: SPARK-24201 URL: https://issues.apache.org/jira/browse/SPARK-24201 Project: Spark

[jira] [Commented] (SPARK-24176) The hdfs file path with wildcard can not be identified when loading data

2018-05-07 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466263#comment-16466263 ] kevin yu commented on SPARK-24176: -- I am looking at this one, will provide a proposal fix soon.  > The

[jira] [Commented] (SPARK-23529) Specify hostpath volume and mount the volume in Spark driver and executor pods in Kubernetes

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466203#comment-16466203 ] Apache Spark commented on SPARK-23529: -- User 'andrusha' has created a pull request for this issue:

[jira] [Commented] (SPARK-24112) Add `spark.sql.hive.convertMetastoreTableProperty` for backward compatiblility

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466189#comment-16466189 ] Apache Spark commented on SPARK-24112: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-05-07 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466099#comment-16466099 ] Paul Wu commented on SPARK-22371: - Got the same problem with 2.3 and also the program stalled: {{ 

[jira] [Updated] (SPARK-23161) Add missing APIs to Python GBTClassifier

2018-05-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-23161: - Description: GBTClassifier is missing \{{featureSubsetStrategy}}.  This should be moved to

[jira] [Comment Edited] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466088#comment-16466088 ] Darek edited comment on SPARK-18673 at 5/7/18 4:09 PM: ---

[jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466088#comment-16466088 ] Darek commented on SPARK-18673: --- PR20819 for Spark => Hive 2.x was done but not merged and deleted. >

[jira] [Commented] (SPARK-23458) Flaky test: OrcQuerySuite

2018-05-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466078#comment-16466078 ] Xiao Li commented on SPARK-23458: - Yeah. [~dongjoon] Please investigate why they still fail. After your

[jira] [Resolved] (SPARK-24170) [Spark SQL] json file format is not dropped after dropping table

2018-05-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-24170. - Resolution: Not A Bug > [Spark SQL] json file format is not dropped after dropping table >

[jira] [Commented] (SPARK-24170) [Spark SQL] json file format is not dropped after dropping table

2018-05-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466071#comment-16466071 ] Xiao Li commented on SPARK-24170: - They are external tables when you specify the path in CREATE TABLE.

[jira] [Resolved] (SPARK-24043) InterpretedPredicate.eval fails if expression tree contains Nondeterministic expressions

2018-05-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-24043. --- Resolution: Fixed Assignee: Bruce Robbins Fix Version/s: 2.4.0 >

[jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465950#comment-16465950 ] Steve Loughran commented on SPARK-18673: Good Q, [~Bidek]. That SPARK-23807 POM fixes up the

[jira] [Comment Edited] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465934#comment-16465934 ] Darek edited comment on SPARK-18673 at 5/7/18 1:59 PM: --- Based on the recent PR, the

[jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465934#comment-16465934 ] Darek commented on SPARK-18673: --- Based on the recent PR, the community is moving toward Hadoop 3.1, why do

[jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2018-05-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465925#comment-16465925 ] Steve Loughran commented on SPARK-18673: Josh Rosen added some changes, particularly: *

[jira] [Commented] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2018-05-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465917#comment-16465917 ] Steve Loughran commented on SPARK-23977: It will need the hadoop-aws module and deoendencies as

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/* /* "; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/ */* "; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/ ** /** "; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/* /* "; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/*/*"; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/"; sparkContext.textFile(folder, 1).toJavaRDD()  Is asterisks

[jira] [Created] (SPARK-24200) Read subdirectories with out asterisks

2018-05-07 Thread kumar (JIRA)
kumar created SPARK-24200: - Summary: Read subdirectories with out asterisks Key: SPARK-24200 URL: https://issues.apache.org/jira/browse/SPARK-24200 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23933: Assignee: (was: Apache Spark) > High-order function: map(array, array) → map >

[jira] [Commented] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465746#comment-16465746 ] Apache Spark commented on SPARK-23933: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23933: Assignee: Apache Spark > High-order function: map(array, array) → map >

[jira] [Assigned] (SPARK-24194) HadoopFsRelation cannot overwrite a path that is also being read from

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24194: Assignee: Apache Spark > HadoopFsRelation cannot overwrite a path that is also being read

[jira] [Assigned] (SPARK-24194) HadoopFsRelation cannot overwrite a path that is also being read from

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24194: Assignee: (was: Apache Spark) > HadoopFsRelation cannot overwrite a path that is also

[jira] [Commented] (SPARK-24194) HadoopFsRelation cannot overwrite a path that is also being read from

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465732#comment-16465732 ] Apache Spark commented on SPARK-24194: -- User 'zheh12' has created a pull request for this issue:

[jira] [Commented] (SPARK-24177) Spark returning inconsistent rows and data in a join query when run using Spark SQL (using SQLContext.sql(...))

2018-05-07 Thread Ajay Monga (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465729#comment-16465729 ] Ajay Monga commented on SPARK-24177: Thanks Marco. We have a few systems running on the latest

[jira] [Created] (SPARK-24199) Structured Streaming

2018-05-07 Thread shuke (JIRA)
shuke created SPARK-24199: - Summary: Structured Streaming Key: SPARK-24199 URL: https://issues.apache.org/jira/browse/SPARK-24199 Project: Spark Issue Type: Bug Components: DStreams

[jira] [Resolved] (SPARK-16406) Reference resolution for large number of columns should be faster

2018-05-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-16406. --- Resolution: Fixed Fix Version/s: 2.4.0 > Reference resolution for large

[jira] [Updated] (SPARK-24197) add array_sort function

2018-05-07 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marek Novotny updated SPARK-24197: -- Description: Add a SparkR equivalent function to 

[jira] [Updated] (SPARK-24197) add array_sort function

2018-05-07 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marek Novotny updated SPARK-24197: -- Description: Add a SparkR equivalent function to SPARK-23921. (was: Add a SparkR equivalent

[jira] [Commented] (SPARK-24198) add slice function

2018-05-07 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465627#comment-16465627 ] Marek Novotny commented on SPARK-24198: --- I will work on this. Thanks. > add slice function >

[jira] [Created] (SPARK-24198) add slice function

2018-05-07 Thread Marek Novotny (JIRA)
Marek Novotny created SPARK-24198: - Summary: add slice function Key: SPARK-24198 URL: https://issues.apache.org/jira/browse/SPARK-24198 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-24197) add array_sort function

2018-05-07 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465626#comment-16465626 ] Marek Novotny commented on SPARK-24197: --- I will work on this. Thanks. > add array_sort function >

[jira] [Created] (SPARK-24197) add array_sort function

2018-05-07 Thread Marek Novotny (JIRA)
Marek Novotny created SPARK-24197: - Summary: add array_sort function Key: SPARK-24197 URL: https://issues.apache.org/jira/browse/SPARK-24197 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-23930) High-order function: slice(x, start, length) → array

2018-05-07 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-23930. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21040

[jira] [Assigned] (SPARK-23930) High-order function: slice(x, start, length) → array

2018-05-07 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-23930: - Assignee: Marco Gaido > High-order function: slice(x, start, length) → array >

[jira] [Updated] (SPARK-24196) Spark Thrift Server - SQL Client connections does't show db artefacts

2018-05-07 Thread rr (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] rr updated SPARK-24196: --- Attachment: screenshot-1.png > Spark Thrift Server - SQL Client connections does't show db artefacts >

[jira] [Commented] (SPARK-24160) ShuffleBlockFetcherIterator should fail if it receives zero-size blocks

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465536#comment-16465536 ] Apache Spark commented on SPARK-24160: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Updated] (SPARK-24196) Spark Thrift Server - SQL Client connections does't show db artefacts

2018-05-07 Thread rr (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] rr updated SPARK-24196: --- Description: When connecting to Spark Thrift Server via JDBC artefacts(db objects are not showing up) whereas when

[jira] [Assigned] (SPARK-24186) add array reverse and concat

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24186: Assignee: (was: Apache Spark) > add array reverse and concat >

[jira] [Commented] (SPARK-24186) add array reverse and concat

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465526#comment-16465526 ] Apache Spark commented on SPARK-24186: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Updated] (SPARK-24196) Spark Thrift Server - SQL Client connections does't show db artefacts

2018-05-07 Thread rr (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] rr updated SPARK-24196: --- Description: When connecting to Spark Thrift Server via JDBC artefacts(db objects are not showing up) whereas when

[jira] [Created] (SPARK-24196) Spark Thrift Server - SQL Client connections does't show db artefacts

2018-05-07 Thread rr (JIRA)
rr created SPARK-24196: -- Summary: Spark Thrift Server - SQL Client connections does't show db artefacts Key: SPARK-24196 URL: https://issues.apache.org/jira/browse/SPARK-24196 Project: Spark Issue

[jira] [Assigned] (SPARK-24186) add array reverse and concat

2018-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24186: Assignee: Apache Spark > add array reverse and concat > - >

[jira] [Assigned] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-05-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23775: --- Assignee: Gabor Somogyi > Flaky test: DataFrameRangeSuite > ---

[jira] [Resolved] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-05-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23775. - Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pull

[jira] [Resolved] (SPARK-24160) ShuffleBlockFetcherIterator should fail if it receives zero-size blocks

2018-05-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-24160. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21219

[jira] [Resolved] (SPARK-23921) High-order function: array_sort(x) → array

2018-05-07 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-23921. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21021

[jira] [Assigned] (SPARK-23921) High-order function: array_sort(x) → array

2018-05-07 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-23921: - Assignee: Kazuaki Ishizaki > High-order function: array_sort(x) → array >

[jira] [Resolved] (SPARK-24143) filter empty blocks when convert mapstatus to (blockId, size) pair

2018-05-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-24143. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21212

[jira] [Assigned] (SPARK-24143) filter empty blocks when convert mapstatus to (blockId, size) pair

2018-05-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-24143: --- Assignee: jin xing > filter empty blocks when convert mapstatus to (blockId, size) pair >