[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-09-15 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493395#comment-15493395 ] Jonathan Taws commented on SPARK-15917: --- Hi Andrew, Your 2 suggestions make a lot of sense, I'll

[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-09-15 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493399#comment-15493399 ] Jonathan Taws commented on SPARK-15917: --- I didn't create a specific pull request for the moment as

[jira] [Created] (SPARK-17555) ExternalShuffleBlockResolver fails randomly with External Shuffle Service and Dynamic Resource Allocation on Mesos running under Marathon

2016-09-15 Thread Brad Willard (JIRA)
Brad Willard created SPARK-17555: Summary: ExternalShuffleBlockResolver fails randomly with External Shuffle Service and Dynamic Resource Allocation on Mesos running under Marathon Key: SPARK-17555 URL:

[jira] [Updated] (SPARK-17544) Timeout waiting for connection from pool, DataFrame Reader's not closing S3 connections?

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-17544: -- Description: I have an application that loops through a text file to find files in S3

[jira] [Comment Edited] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-09-15 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493395#comment-15493395 ] Jonathan Taws edited comment on SPARK-15917 at 9/15/16 4:05 PM: Hi

[jira] [Assigned] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17458: Assignee: Apache Spark > Alias specified for aggregates in a pivot are not honored >

[jira] [Assigned] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17458: Assignee: (was: Apache Spark) > Alias specified for aggregates in a pivot are not

[jira] [Commented] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493777#comment-15493777 ] Apache Spark commented on SPARK-17458: -- User 'aray' has created a pull request for this issue:

[jira] [Updated] (SPARK-17281) Add treeAggregateDepth parameter for AFTSurvivalRegression

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17281: -- Priority: Minor (was: Major) What's the use case, just consistency? why not on more jobs than just

[jira] [Commented] (SPARK-17406) Event Timeline will be very slow when there are too many executor events

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492919#comment-15492919 ] Apache Spark commented on SPARK-17406: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-17538) sqlContext.registerDataFrameAsTable is not working sometimes in pyspark 2.0.0

2016-09-15 Thread Srinivas Rishindra Pothireddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492937#comment-15492937 ] Srinivas Rishindra Pothireddi commented on SPARK-17538: --- Hi [~srowen], I will fix

[jira] [Commented] (SPARK-16297) Mapping Boolean and string to BIT and NVARCHAR(MAX) for SQL Server jdbc dialect

2016-09-15 Thread Oussama Mekni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492994#comment-15492994 ] Oussama Mekni commented on SPARK-16297: --- some updates about ? > Mapping Boolean and string to BIT

[jira] [Assigned] (SPARK-17549) InMemoryRelation doesn't scale to large tables

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17549: Assignee: Apache Spark > InMemoryRelation doesn't scale to large tables >

[jira] [Assigned] (SPARK-17549) InMemoryRelation doesn't scale to large tables

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17549: Assignee: (was: Apache Spark) > InMemoryRelation doesn't scale to large tables >

[jira] [Commented] (SPARK-17549) InMemoryRelation doesn't scale to large tables

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493929#comment-15493929 ] Apache Spark commented on SPARK-17549: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493936#comment-15493936 ] holdenk commented on SPARK-16407: - I think its important to keep in mind that these APIs are already

[jira] [Commented] (SPARK-17544) Timeout waiting for connection from pool, DataFrame Reader's not closing S3 connections?

2016-09-15 Thread Brady Auen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493555#comment-15493555 ] Brady Auen commented on SPARK-17544: That looks like my issue too, thanks Josh! > Timeout waiting

[jira] [Resolved] (SPARK-17379) Upgrade netty-all to 4.0.41.Final (4.1.5-Final not compatible)

2016-09-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-17379. -- Resolution: Fixed Fix Version/s: 2.1.0 > Upgrade netty-all to 4.0.41.Final (4.1.5-Final

[jira] [Resolved] (SPARK-17451) CoarseGrainedExecutorBackend should inform driver before self-kill

2016-09-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-17451. -- Resolution: Fixed Assignee: Tejas Patil Fix Version/s: 2.1.0 >

[jira] [Created] (SPARK-17556) Executor side broadcast for broadcast joins

2016-09-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-17556: --- Summary: Executor side broadcast for broadcast joins Key: SPARK-17556 URL: https://issues.apache.org/jira/browse/SPARK-17556 Project: Spark Issue Type: New

[jira] [Created] (SPARK-17557) SQL query on parquet table java.lang.UnsupportedOperationException: org.apache.parquet.column.values.dictionary.PlainValuesDictionary

2016-09-15 Thread Egor Pahomov (JIRA)
Egor Pahomov created SPARK-17557: Summary: SQL query on parquet table java.lang.UnsupportedOperationException: org.apache.parquet.column.values.dictionary.PlainValuesDictionary Key: SPARK-17557 URL:

[jira] [Resolved] (SPARK-17547) Temporary shuffle data files may be leaked following exception in write

2016-09-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-17547. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 1.6.3

[jira] [Resolved] (SPARK-17114) Adding a 'GROUP BY 1' where first column is literal results in wrong answer

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17114. --- Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-17429) spark sql length(1) return error

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17429. --- Resolution: Fixed Assignee: cen yuhai Fix Version/s: 2.1.0 > spark

[jira] [Updated] (SPARK-17483) Minor refactoring and cleanup in BlockManager block status reporting and block removal

2016-09-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17483: --- Fix Version/s: 2.0.1 > Minor refactoring and cleanup in BlockManager block status reporting and >

[jira] [Resolved] (SPARK-17484) Race condition when cancelling a job during a cache write can lead to block fetch failures

2016-09-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-17484. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull

[jira] [Resolved] (SPARK-17364) Can not query hive table starting with number

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17364. --- Resolution: Fixed Assignee: Sean Zhong Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17458. --- Resolution: Fixed Fix Version/s: 2.1.0 I cannot find the JIRA username of the

[jira] [Commented] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494361#comment-15494361 ] Andrew Ray commented on SPARK-17458: It's a1ray > Alias specified for aggregates in a pivot are not

[jira] [Updated] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-17458: -- Assignee: Herman van Hovell > Alias specified for aggregates in a pivot are not

[jira] [Updated] (SPARK-10815) API design: data sources and sinks

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10815: Description: The existing (in 2.0) source/sink interface for structured streaming depends on

[jira] [Commented] (SPARK-10816) API design: window and session specification

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495011#comment-15495011 ] Reynold Xin commented on SPARK-10816: - I guess window specification was done, but session remains to

[jira] [Updated] (SPARK-10815) API design: data sources and sinks

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10815: Description: The existing source/sink interface for structured streaming depends on RDDs. This

[jira] [Commented] (SPARK-5484) Pregel should checkpoint periodically to avoid StackOverflowError

2016-09-15 Thread ding (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495131#comment-15495131 ] ding commented on SPARK-5484: - I will work on the issue if nobody took it. > Pregel should checkpoint

[jira] [Commented] (SPARK-17477) SparkSQL cannot handle schema evolution from Int -> Long when parquet files have Int as its type while hive metastore has Long as its type

2016-09-15 Thread Gang Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494681#comment-15494681 ] Gang Wu commented on SPARK-17477: - Just confirmed that this also doesn't work with vectorized reader.

[jira] [Commented] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-15 Thread Frederick Reiss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494795#comment-15494795 ] Frederick Reiss commented on SPARK-16407: - With respect, I'm not seeing a whole lot of flux in

[jira] [Updated] (SPARK-15689) Data source API v2

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15689: Target Version/s: 2.2.0 (was: 2.1.0) > Data source API v2 > -- > >

[jira] [Created] (SPARK-17558) Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-17558: --- Summary: Bump Hadoop 2.7 version from 2.7.2 to 2.7.3 Key: SPARK-17558 URL: https://issues.apache.org/jira/browse/SPARK-17558 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Nathan Beyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494666#comment-15494666 ] Nathan Beyer commented on SPARK-17545: -- As a workaround, the following format can be set as an

[jira] [Commented] (SPARK-16534) Kafka 0.10 Python support

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494955#comment-15494955 ] Reynold Xin commented on SPARK-16534: - [~maver1ck] thanks for the comment. That's a great point.

[jira] [Commented] (SPARK-10816) API design: window and session specification

2016-09-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495032#comment-15495032 ] Tathagata Das commented on SPARK-10816: --- Yeah, its yet to be designed. > API design: window and

[jira] [Updated] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Nathan Beyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nathan Beyer updated SPARK-17545: - Summary: Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset (was: Spark

[jira] [Commented] (SPARK-17544) Timeout waiting for connection from pool, DataFrame Reader's not closing S3 connections?

2016-09-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494715#comment-15494715 ] Josh Rosen commented on SPARK-17544: 3.0.1 release fixing this should be available now:

[jira] [Commented] (SPARK-15472) Add support for writing in `csv`, `json`, `text` formats in Structured Streaming

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495015#comment-15495015 ] Reynold Xin commented on SPARK-15472: - This is done in 2.0, isn't it? cc [~zsxwing] > Add support

[jira] [Assigned] (SPARK-17558) Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17558: Assignee: Reynold Xin (was: Apache Spark) > Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

[jira] [Assigned] (SPARK-17558) Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17558: Assignee: Apache Spark (was: Reynold Xin) > Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

[jira] [Commented] (SPARK-17558) Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495125#comment-15495125 ] Apache Spark commented on SPARK-17558: -- User 'rxin' has created a pull request for this issue:

[jira] [Commented] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495006#comment-15495006 ] Reynold Xin commented on SPARK-16407: - The source/sink interface currently depends on RDDs, doesn't

[jira] [Created] (SPARK-17559) PeriodicGraphCheckpointer didnot persist edges as expected in some cases

2016-09-15 Thread ding (JIRA)
ding created SPARK-17559: Summary: PeriodicGraphCheckpointer didnot persist edges as expected in some cases Key: SPARK-17559 URL: https://issues.apache.org/jira/browse/SPARK-17559 Project: Spark

[jira] [Commented] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495410#comment-15495410 ] Hyukjin Kwon commented on SPARK-17545: -- Therefore, IMHO, this is not an issue as we can workaround

[jira] [Comment Edited] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495408#comment-15495408 ] Hyukjin Kwon edited comment on SPARK-17545 at 9/16/16 5:20 AM: --- Hi

[jira] [Commented] (SPARK-17559) PeriodicGraphCheckpointer didnot persist edges as expected in some cases

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495370#comment-15495370 ] Apache Spark commented on SPARK-17559: -- User 'dding3' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17559) PeriodicGraphCheckpointer didnot persist edges as expected in some cases

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17559: Assignee: (was: Apache Spark) > PeriodicGraphCheckpointer didnot persist edges as

[jira] [Assigned] (SPARK-17559) PeriodicGraphCheckpointer didnot persist edges as expected in some cases

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17559: Assignee: Apache Spark > PeriodicGraphCheckpointer didnot persist edges as expected in

[jira] [Commented] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495413#comment-15495413 ] Hyukjin Kwon commented on SPARK-17545: -- FYI - this is related with

[jira] [Commented] (SPARK-17545) Spark SQL Catalyst doesn't handle ISO 8601 date without colon in offset

2016-09-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495408#comment-15495408 ] Hyukjin Kwon commented on SPARK-17545: -- Hi [~nbeyer], the basic ISO format currently follows

[jira] [Comment Edited] (SPARK-17522) [MESOS] More even distribution of executors on Mesos cluster

2016-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495243#comment-15495243 ] Saisai Shao edited comment on SPARK-17522 at 9/16/16 3:19 AM: -- [~sunrui] I

[jira] [Commented] (SPARK-17522) [MESOS] More even distribution of executors on Mesos cluster

2016-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495243#comment-15495243 ] Saisai Shao commented on SPARK-17522: - [~sunrui] I think the performance is depended on different

[jira] [Commented] (SPARK-17557) SQL query on parquet table java.lang.UnsupportedOperationException: org.apache.parquet.column.values.dictionary.PlainValuesDictionary

2016-09-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15495431#comment-15495431 ] Hyukjin Kwon commented on SPARK-17557: -- Do you mind if I ask a simple file so that I can reproduce

[jira] [Assigned] (SPARK-17543) Missing log4j config file for tests in common/network-shuffle

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17543: Assignee: Apache Spark > Missing log4j config file for tests in common/network-shuffle >

[jira] [Commented] (SPARK-17543) Missing log4j config file for tests in common/network-shuffle

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492494#comment-15492494 ] Apache Spark commented on SPARK-17543: -- User 'jagadeesanas2' has created a pull request for this

[jira] [Assigned] (SPARK-17543) Missing log4j config file for tests in common/network-shuffle

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17543: Assignee: (was: Apache Spark) > Missing log4j config file for tests in

[jira] [Resolved] (SPARK-17440) Issue Exception when ALTER TABLE commands try to alter a VIEW

2016-09-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17440. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15004

[jira] [Updated] (SPARK-17441) Issue Exceptions when ALTER TABLE RENAME PARTITION tries to alter a data source table

2016-09-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17441: Assignee: Xiao Li > Issue Exceptions when ALTER TABLE RENAME PARTITION tries to alter a data >

[jira] [Resolved] (SPARK-17441) Issue Exceptions when ALTER TABLE RENAME PARTITION tries to alter a data source table

2016-09-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17441. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15004

[jira] [Updated] (SPARK-17507) check weight vector size in ANN

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17507: -- Assignee: Weichen Xu Priority: Trivial (was: Major) > check weight vector size in ANN >

[jira] [Resolved] (SPARK-17507) check weight vector size in ANN

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17507. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15060

[jira] [Updated] (SPARK-17440) Issue Exception when ALTER TABLE commands try to alter a VIEW

2016-09-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17440: Assignee: Xiao Li > Issue Exception when ALTER TABLE commands try to alter a VIEW >

[jira] [Resolved] (SPARK-17537) Improve performance for reading parquet schema

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17537. --- Resolution: Duplicate > Improve performance for reading parquet schema >

[jira] [Resolved] (SPARK-17552) Doubt about the double Synchronized in Object SparkSession.getOrCreate()

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17552. --- Resolution: Invalid Questions should go to user@. The two locks here are not the same one. > Doubt

[jira] [Commented] (SPARK-17552) Doubt about the double Synchronized in Object SparkSession.getOrCreate()

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492689#comment-15492689 ] Sean Owen commented on SPARK-17552: --- The comments explain why one lock is used over the other > Doubt

[jira] [Commented] (SPARK-17546) start-* scripts should use hostname --fqdn

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492685#comment-15492685 ] Sean Owen commented on SPARK-17546: --- {{hostname --fqdn}} doesn't work on OS X, so may be specific to

[jira] [Commented] (SPARK-16938) Cannot resolve column name after a join

2016-09-15 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492643#comment-15492643 ] Dongjoon Hyun commented on SPARK-16938: --- Hi, [~cloud_fan]. Could you review the PR? Or, may I close

[jira] [Created] (SPARK-17552) Doubt about the double Synchronized in Object SparkSession.getOrCreate()

2016-09-15 Thread Jianfei Wang (JIRA)
Jianfei Wang created SPARK-17552: Summary: Doubt about the double Synchronized in Object SparkSession.getOrCreate() Key: SPARK-17552 URL: https://issues.apache.org/jira/browse/SPARK-17552 Project:

[jira] [Commented] (SPARK-17552) Doubt about the double Synchronized in Object SparkSession.getOrCreate()

2016-09-15 Thread Jianfei Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492684#comment-15492684 ] Jianfei Wang commented on SPARK-17552: -- of course not the same one ,but only one thread can get into

[jira] [Resolved] (SPARK-17535) Performance Improvement of Signleton pattern in SparkContext

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17535. --- Resolution: Won't Fix I don't think volatile matters here. It's not the point I was making actually;

[jira] [Updated] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-17458: -- Assignee: (was: Herman van Hovell) > Alias specified for aggregates in a pivot are

[jira] [Comment Edited] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494475#comment-15494475 ] Michael Armbrust edited comment on SPARK-16407 at 9/15/16 8:42 PM: ---

[jira] [Commented] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494475#comment-15494475 ] Michael Armbrust commented on SPARK-16407: -- Sure, but the bar for compatibility is different for

[jira] [Resolved] (SPARK-17544) Timeout waiting for connection from pool, DataFrame Reader's not closing S3 connections?

2016-09-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-17544. Resolution: Invalid This is caused by a bug in the third-party {{spark-avro}} data source. I'm

[jira] [Commented] (SPARK-16264) Allow the user to use operators on the received DataFrame

2016-09-15 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494587#comment-15494587 ] Jakob Odersky commented on SPARK-16264: --- I just came across this issue through a comment in the

[jira] [Issue Comment Deleted] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Ray updated SPARK-17458: --- Comment: was deleted (was: [~hvanhovell] It's a1ray) > Alias specified for aggregates in a pivot

[jira] [Assigned] (SPARK-17473) jdbc docker tests are failing with java.lang.AbstractMethodError:

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17473: Assignee: Apache Spark > jdbc docker tests are failing with

[jira] [Assigned] (SPARK-17473) jdbc docker tests are failing with java.lang.AbstractMethodError:

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17473: Assignee: (was: Apache Spark) > jdbc docker tests are failing with

[jira] [Commented] (SPARK-17473) jdbc docker tests are failing with java.lang.AbstractMethodError:

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494597#comment-15494597 ] Apache Spark commented on SPARK-17473: -- User 'sureshthalamati' has created a pull request for this

[jira] [Commented] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494591#comment-15494591 ] Andrew Ray commented on SPARK-17458: [~hvanhovell]: My JIRA username is a1ray. > Alias specified for

[jira] [Comment Edited] (SPARK-17458) Alias specified for aggregates in a pivot are not honored

2016-09-15 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494361#comment-15494361 ] Andrew Ray edited comment on SPARK-17458 at 9/15/16 8:09 PM: - [~hvanhovell]

[jira] [Commented] (SPARK-17508) Setting weightCol to None in ML library causes an error

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15494418#comment-15494418 ] Apache Spark commented on SPARK-17508: -- User 'BryanCutler' has created a pull request for this

[jira] [Assigned] (SPARK-17508) Setting weightCol to None in ML library causes an error

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17508: Assignee: (was: Apache Spark) > Setting weightCol to None in ML library causes an

[jira] [Assigned] (SPARK-17508) Setting weightCol to None in ML library causes an error

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17508: Assignee: Apache Spark > Setting weightCol to None in ML library causes an error >

[jira] [Comment Edited] (SPARK-17501) Re-register BlockManager again and again

2016-09-15 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15484185#comment-15484185 ] cen yuhai edited comment on SPARK-17501 at 9/15/16 9:19 AM: I can't hardly

[jira] [Updated] (SPARK-17523) Cannot get Spark build info from spark-core package which built in Windows

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17523: -- Fix Version/s: (was: 2.0.1) Component/s: (was: Spark Core) > Cannot get Spark build info

[jira] [Assigned] (SPARK-17501) Re-register BlockManager again and again

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17501: Assignee: Apache Spark > Re-register BlockManager again and again >

[jira] [Commented] (SPARK-17501) Re-register BlockManager again and again

2016-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15492860#comment-15492860 ] Apache Spark commented on SPARK-17501: -- User 'cenyuhai' has created a pull request for this issue:

[jira] [Updated] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17524: -- Assignee: Adam Roberts Priority: Trivial (was: Minor) > RowBasedKeyValueBatchSuite always uses 64

[jira] [Resolved] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17524. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15079

[jira] [Resolved] (SPARK-17406) Event Timeline will be very slow when there are too many executor events

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17406. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14969

[jira] [Resolved] (SPARK-17505) Add setBins for BinaryClassificationMetrics in mlllb/evaluation

2016-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17505. --- Resolution: Won't Fix > Add setBins for BinaryClassificationMetrics in mlllb/evaluation >

[jira] [Created] (SPARK-17554) spark.executor.memory option not working

2016-09-15 Thread Sankar Mittapally (JIRA)
Sankar Mittapally created SPARK-17554: - Summary: spark.executor.memory option not working Key: SPARK-17554 URL: https://issues.apache.org/jira/browse/SPARK-17554 Project: Spark Issue

[jira] [Commented] (SPARK-17538) sqlContext.registerDataFrameAsTable is not working sometimes in pyspark 2.0.0

2016-09-15 Thread Srinivas Rishindra Pothireddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15493075#comment-15493075 ] Srinivas Rishindra Pothireddi commented on SPARK-17538: --- Hi [~srowen], I updated

  1   2   >