[jira] [Commented] (SPARK-20067) Use treeString to print out the table schema for CatalogTable

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937764#comment-15937764 ] Apache Spark commented on SPARK-20067: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20067) Use treeString to print out the table schema for CatalogTable

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20067: Assignee: Xiao Li (was: Apache Spark) > Use treeString to print out the table schema for

[jira] [Assigned] (SPARK-20067) Use treeString to print out the table schema for CatalogTable

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20067: Assignee: Apache Spark (was: Xiao Li) > Use treeString to print out the table schema for

[jira] [Created] (SPARK-20067) Use treeString to print out the table schema for CatalogTable

2017-03-22 Thread Xiao Li (JIRA)
Xiao Li created SPARK-20067: --- Summary: Use treeString to print out the table schema for CatalogTable Key: SPARK-20067 URL: https://issues.apache.org/jira/browse/SPARK-20067 Project: Spark Issue

[jira] [Resolved] (SPARK-19913) Log warning rather than throw AnalysisException when output is partitioned although format is memory, console or foreach

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19913. --- Resolution: Won't Fix > Log warning rather than throw AnalysisException when output is partitioned

[jira] [Commented] (SPARK-14083) Analyze JVM bytecode and turn closures into Catalyst expressions

2017-03-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937754#comment-15937754 ] Liang-Chi Hsieh commented on SPARK-14083: - [~kiszk] Thanks for rebasing it. It is more convenient

[jira] [Commented] (SPARK-16060) Vectorized Orc reader

2017-03-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937711#comment-15937711 ] Liang-Chi Hsieh commented on SPARK-16060: - cc [~rxin] If the approach based on Hive package is

[jira] [Commented] (SPARK-17556) Executor side broadcast for broadcast joins

2017-03-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937707#comment-15937707 ] Liang-Chi Hsieh commented on SPARK-17556: - We may need to change the Target Version/s for this.

[jira] [Resolved] (SPARK-19169) columns changed orc table encouter 'IndexOutOfBoundsException' when read the old schema files

2017-03-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19169. -- Resolution: Invalid It seems the reporter is inactive, I can't reproduce this, it seems this

[jira] [Commented] (SPARK-19136) Aggregator with case class as output type fails with ClassCastException

2017-03-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937680#comment-15937680 ] Hyukjin Kwon commented on SPARK-19136: -- [~a1ray], do you think this JIRA is resolvable? >

[jira] [Commented] (SPARK-20061) Reading a file with colon (:) from S3 fails with URISyntaxException

2017-03-22 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937675#comment-15937675 ] Genmao Yu commented on SPARK-20061: --- Colon is not supported in hadoop, see

[jira] [Commented] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-22 Thread chenerlu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937670#comment-15937670 ] chenerlu commented on SPARK-20023: -- Hi, I review the PR and test this PR, then I found table comment can

[jira] [Assigned] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20066: Assignee: (was: Apache Spark) > Add explicit SecurityManager(SparkConf) constructor

[jira] [Commented] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937666#comment-15937666 ] Apache Spark commented on SPARK-20066: -- User 'markgrover' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20066: Assignee: Apache Spark > Add explicit SecurityManager(SparkConf) constructor for

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:46 AM: --- [~q79969786] Thx

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:41 AM: --- [~q79969786] Thx

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:39 AM: --- [~q79969786] Thx

[jira] [Commented] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-22 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937658#comment-15937658 ] Mark Grover commented on SPARK-20066: - I have attached some simple test code here:

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:37 AM: --- [~q79969786] Thx

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:34 AM: --- [~q79969786] Thx

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:35 AM: --- [~q79969786] Thx

[jira] [Comment Edited] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu edited comment on SPARK-19927 at 3/23/17 3:34 AM: --- [~q79969786] Thx

[jira] [Updated] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] bruce xu updated SPARK-19927: - Description: suppose the content of file test.sql: -

[jira] [Created] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-22 Thread Mark Grover (JIRA)
Mark Grover created SPARK-20066: --- Summary: Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java Key: SPARK-20066 URL: https://issues.apache.org/jira/browse/SPARK-20066

[jira] [Commented] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread bruce xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937644#comment-15937644 ] bruce xu commented on SPARK-19927: -- [~q79969786] Thx for response. it is half right. reason: - issue

[jira] [Created] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-03-22 Thread Silvio Fiorito (JIRA)
Silvio Fiorito created SPARK-20065: -- Summary: Empty output files created for aggregation query in append mode Key: SPARK-20065 URL: https://issues.apache.org/jira/browse/SPARK-20065 Project: Spark

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937467#comment-15937467 ] Takeshi Yamamuro commented on SPARK-20009: -- okay, I'll do it later. Thanks! > Use user-friendly

[jira] [Created] (SPARK-20064) Bump the PySpark verison number to 2.2

2017-03-22 Thread holdenk (JIRA)
holdenk created SPARK-20064: --- Summary: Bump the PySpark verison number to 2.2 Key: SPARK-20064 URL: https://issues.apache.org/jira/browse/SPARK-20064 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-18970) FileSource failure during file list refresh doesn't cause an application to fail, but stops further processing

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-18970. -- Resolution: Fixed Fix Version/s: 2.1.0 I'm going to close this, but please

[jira] [Closed] (SPARK-17344) Kafka 0.8 support for Structured Streaming

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust closed SPARK-17344. Resolution: Won't Fix Unless someone really wants to work on this, i think the fact that

[jira] [Updated] (SPARK-19965) DataFrame batch reader may fail to infer partitions when reading FileStreamSink's output

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19965: - Target Version/s: 2.2.0 > DataFrame batch reader may fail to infer partitions when

[jira] [Updated] (SPARK-19767) API Doc pages for Streaming with Kafka 0.10 not current

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19767: - Component/s: (was: Structured Streaming) DStreams > API Doc pages

[jira] [Resolved] (SPARK-19013) java.util.ConcurrentModificationException when using s3 path as checkpointLocation

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-19013. -- Resolution: Later It seems like [HADOOP-13345] is the right solution here, but since

[jira] [Assigned] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20008: Assignee: Xiao Li (was: Apache Spark) >

[jira] [Assigned] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20008: Assignee: Apache Spark (was: Xiao Li) >

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937430#comment-15937430 ] Apache Spark commented on SPARK-20008: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Resolved] (SPARK-19788) DataStreamReader/DataStreamWriter.option shall accept user-defined type

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-19788. -- Resolution: Won't Fix Thanks for the suggestion. However, as [~zsxwing] said, the

[jira] [Resolved] (SPARK-19932) Disallow a case that might cause OOM for steaming deduplication

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-19932. -- Resolution: Won't Fix Thanks for working on this. While I think it would be helpful

[jira] [Assigned] (SPARK-19876) Add OneTime trigger executor

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-19876: Assignee: Tyson Condie > Add OneTime trigger executor >

[jira] [Updated] (SPARK-19876) Add OneTime trigger executor

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19876: - Target Version/s: 2.2.0 > Add OneTime trigger executor > >

[jira] [Updated] (SPARK-19989) Flaky Test: org.apache.spark.sql.kafka010.KafkaSourceStressSuite

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19989: - Description: This test failed recently here:

[jira] [Updated] (SPARK-19989) Flaky Test: org.apache.spark.sql.kafka010.KafkaSourceStressSuite

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19989: - Target Version/s: 2.2.0 > Flaky Test:

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-03-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937377#comment-15937377 ] Marcelo Vanzin commented on SPARK-18085: Hi all, an update. I started working on this again and I

[jira] [Created] (SPARK-20063) Trigger without delay when falling behind

2017-03-22 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-20063: Summary: Trigger without delay when falling behind Key: SPARK-20063 URL: https://issues.apache.org/jira/browse/SPARK-20063 Project: Spark Issue

[jira] [Created] (SPARK-20062) Inconsistent checking on ML estimator/model copy in the unit tests.

2017-03-22 Thread yuhao yang (JIRA)
yuhao yang created SPARK-20062: -- Summary: Inconsistent checking on ML estimator/model copy in the unit tests. Key: SPARK-20062 URL: https://issues.apache.org/jira/browse/SPARK-20062 Project: Spark

[jira] [Commented] (SPARK-20044) Support Spark UI behind front-end reverse proxy using a path prefix

2017-03-22 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937356#comment-15937356 ] Alex Bozarth commented on SPARK-20044: -- I took a look at your link and it looks like it's on the

[jira] [Updated] (SPARK-20037) impossible to set kafka offsets using kafka 0.10 and spark 2.0.0

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20037: -- Target Version/s: (was: 2.0.0) Priority: Major (was: Critical) Fix Version/s:

[jira] [Updated] (SPARK-19876) Add OneTime trigger executor

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19876: -- Fix Version/s: (was: 2.2.0) > Add OneTime trigger executor > > >

[jira] [Updated] (SPARK-20036) impossible to read a whole kafka topic using kafka 0.10 and spark 2.0.0

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20036: -- Target Version/s: (was: 2.0.0) Priority: Major (was: Critical) Fix Version/s:

[jira] [Comment Edited] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accept

2017-03-22 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937269#comment-15937269 ] yuhao yang edited comment on SPARK-20043 at 3/22/17 10:25 PM: -- Looks like a

[jira] [Updated] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-22 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-20043: --- Labels: starter (was: ) > CrossValidatorModel loader does not recognize impurity "Gini" and

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-22 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15937269#comment-15937269 ] yuhao yang commented on SPARK-20043: Looks like a bug for tree models load. a toLower should be added

[jira] [Resolved] (SPARK-19613) Flaky test: StateStoreRDDSuite.versioning and immutability

2017-03-22 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19613. Resolution: Cannot Reproduce I'm closing this because, while it had a burst of failures

[jira] [Resolved] (SPARK-19612) Tests failing with timeout

2017-03-22 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19612. Resolution: Cannot Reproduce Closing this for now because I haven't seen this issue in a

[jira] [Commented] (SPARK-13747) Concurrent execution in SQL doesn't work with Scala ForkJoinPool

2017-03-22 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936997#comment-15936997 ] Barry Becker commented on SPARK-13747: -- We have hit this on rare instances in our production

[jira] [Resolved] (SPARK-20057) Renamed KeyedState to GroupState

2017-03-22 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-20057. --- Resolution: Fixed Issue resolved by pull request 17385

[jira] [Commented] (SPARK-20040) Python API for ml.stat.ChiSquareTest

2017-03-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936905#comment-15936905 ] Joseph K. Bradley commented on SPARK-20040: --- Sure, go ahead, thanks! > Python API for

[jira] [Commented] (SPARK-20040) Python API for ml.stat.ChiSquareTest

2017-03-22 Thread Bago Amirbekian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936902#comment-15936902 ] Bago Amirbekian commented on SPARK-20040: - I'd like to work on this. > Python API for

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-22 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936876#comment-15936876 ] Michael Armbrust commented on SPARK-20009: -- Yeah, the DDL format is certainly a lot easier to

[jira] [Updated] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20008: Priority: Minor (was: Major) > hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count()

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936847#comment-15936847 ] Xiao Li commented on SPARK-20008: - This sounds a general issue for our Spark SQL. For example,

[jira] [Commented] (SPARK-20054) [Mesos] Detectability for resource starvation

2017-03-22 Thread Kamal Gurala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936759#comment-15936759 ] Kamal Gurala commented on SPARK-20054: -- Yes, the logs do help detect the issue. Do you think having

[jira] [Commented] (SPARK-17204) Spark 2.0 off heap RDD persistence with replication factor 2 leads to in-memory data corruption

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936733#comment-15936733 ] Apache Spark commented on SPARK-17204: -- User 'mallman' has created a pull request for this issue:

[jira] [Resolved] (SPARK-20018) Pivot with timestamp and count should not print internal representation

2017-03-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20018. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.2.0 > Pivot with timestamp and

[jira] [Resolved] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19927. --- Resolution: Duplicate > SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1 >

[jira] [Commented] (SPARK-19984) ERROR codegen.CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java'

2017-03-22 Thread Andrey Yakovenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936623#comment-15936623 ] Andrey Yakovenko commented on SPARK-19984: -- Unfortunately i cannot provide code since company

[jira] [Commented] (SPARK-19927) SparkThriftServer2 can not get ''--hivevar" variables in spark 2.1

2017-03-22 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936601#comment-15936601 ] Yuming Wang commented on SPARK-19927: - Is this duplicated by

[jira] [Commented] (SPARK-19837) Fetch failure throws a SparkException in SparkHiveWriter

2017-03-22 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936597#comment-15936597 ] Imran Rashid commented on SPARK-19837: -- I think this is addressed by SPARK-19276, which handles the

[jira] [Commented] (SPARK-20019) spark can not load alluxio fileSystem after adding jar

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936548#comment-15936548 ] Sean Owen commented on SPARK-20019: --- I don't know, because I'm not sure this is supposed to work the

[jira] [Commented] (SPARK-20019) spark can not load alluxio fileSystem after adding jar

2017-03-22 Thread roncenzhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936541#comment-15936541 ] roncenzhao commented on SPARK-20019: [~srowen] Should I create a PR for this problem? > spark can

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936516#comment-15936516 ] Xiao Li commented on SPARK-20008: - Sure, will do. >

[jira] [Assigned] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-20008: --- Assignee: Xiao Li > hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns

[jira] [Commented] (SPARK-15487) Spark Master UI to reverse proxy Application and Workers UI

2017-03-22 Thread Oliver Koeth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936501#comment-15936501 ] Oliver Koeth commented on SPARK-15487: -- Seems the follow-up issue was never opened. I created

[jira] [Created] (SPARK-20061) Reading a file with colon (:) from S3 fails with URISyntaxException

2017-03-22 Thread Michel Lemay (JIRA)
Michel Lemay created SPARK-20061: Summary: Reading a file with colon (:) from S3 fails with URISyntaxException Key: SPARK-20061 URL: https://issues.apache.org/jira/browse/SPARK-20061 Project: Spark

[jira] [Commented] (SPARK-20044) Support Spark UI behind front-end reverse proxy using a path prefix

2017-03-22 Thread Oliver Koeth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936482#comment-15936482 ] Oliver Koeth commented on SPARK-20044: -- I tried a few (actually 5) experimental changes, see

[jira] [Updated] (SPARK-20049) Writing data to Parquet with partitions takes very long after the job finishes

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20049: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) Possibly a doc issue, but I

[jira] [Commented] (SPARK-20049) Writing data to Parquet with partitions takes very long after the job finishes

2017-03-22 Thread Jakub Nowacki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936421#comment-15936421 ] Jakub Nowacki commented on SPARK-20049: --- I did a bit more and the writing and, as it came out,

[jira] [Commented] (SPARK-19999) Test failures in Spark Core due to java.nio.Bits.unaligned()

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936349#comment-15936349 ] Sean Owen commented on SPARK-1: --- Although this is going to be a very niche problem, and eventually

[jira] [Resolved] (SPARK-20027) Compilation fixed in java docs.

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20027. --- Resolution: Fixed Assignee: Prashant Sharma Fix Version/s: 2.2.0 Resolved by

[jira] [Resolved] (SPARK-14265) When stage is reRubmitted, DAG visualization does not render correctly for this stage

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14265. --- Resolution: Not A Problem > When stage is reRubmitted, DAG visualization does not render correctly

[jira] [Commented] (SPARK-20059) HbaseCredentialProvider uses wrong classloader

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936329#comment-15936329 ] Apache Spark commented on SPARK-20059: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20059) HbaseCredentialProvider uses wrong classloader

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20059: Assignee: Apache Spark > HbaseCredentialProvider uses wrong classloader >

[jira] [Assigned] (SPARK-20059) HbaseCredentialProvider uses wrong classloader

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20059: Assignee: (was: Apache Spark) > HbaseCredentialProvider uses wrong classloader >

[jira] [Commented] (SPARK-19712) EXISTS and Left Semi join do not produce the same plan

2017-03-22 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936303#comment-15936303 ] Nattavut Sutyanyong commented on SPARK-19712: - Another scenario of a missed opportunity to

[jira] [Assigned] (SPARK-20060) Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails Talking to Hive MetaStore

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20060: Assignee: (was: Apache Spark) > Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser

[jira] [Commented] (SPARK-20060) Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails Talking to Hive MetaStore

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936293#comment-15936293 ] Apache Spark commented on SPARK-20060: -- User 'yaooqinn' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20060) Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails Talking to Hive MetaStore

2017-03-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20060: Assignee: Apache Spark > Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails

[jira] [Created] (SPARK-20060) Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails Talking to Hive MetaStore

2017-03-22 Thread Kent Yao (JIRA)
Kent Yao created SPARK-20060: Summary: Spark On Non-Yarn Mode with Kerberized HDFS ProxyUser Fails Talking to Hive MetaStore Key: SPARK-20060 URL: https://issues.apache.org/jira/browse/SPARK-20060

[jira] [Commented] (SPARK-20058) the running application status changed from running to waiting when a master is down and it change to another standy by master

2017-03-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936255#comment-15936255 ] Saisai Shao commented on SPARK-20058: - Please subscribe this spark user mail list and set the

[jira] [Comment Edited] (SPARK-20058) the running application status changed from running to waiting when a master is down and it change to another standy by master

2017-03-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936255#comment-15936255 ] Saisai Shao edited comment on SPARK-20058 at 3/22/17 1:03 PM: -- Please

[jira] [Created] (SPARK-20059) HbaseCredentialProvider uses wrong classloader

2017-03-22 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-20059: --- Summary: HbaseCredentialProvider uses wrong classloader Key: SPARK-20059 URL: https://issues.apache.org/jira/browse/SPARK-20059 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-5236) java.lang.ClassCastException: org.apache.spark.sql.catalyst.expressions.MutableAny cannot be cast to org.apache.spark.sql.catalyst.expressions.MutableInt

2017-03-22 Thread Jorge Machado (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936222#comment-15936222 ] Jorge Machado edited comment on SPARK-5236 at 3/22/17 12:52 PM:

[jira] [Comment Edited] (SPARK-19992) spark-submit on deployment-mode cluster

2017-03-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936241#comment-15936241 ] Saisai Shao edited comment on SPARK-19992 at 3/22/17 12:51 PM: --- Oh, I see.

[jira] [Commented] (SPARK-19992) spark-submit on deployment-mode cluster

2017-03-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936241#comment-15936241 ] Saisai Shao commented on SPARK-19992: - Oh, I see. Check the code again, looks like "/*" cannot be

[jira] [Comment Edited] (SPARK-5236) java.lang.ClassCastException: org.apache.spark.sql.catalyst.expressions.MutableAny cannot be cast to org.apache.spark.sql.catalyst.expressions.MutableInt

2017-03-22 Thread Jorge Machado (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936222#comment-15936222 ] Jorge Machado edited comment on SPARK-5236 at 3/22/17 12:47 PM:

[jira] [Commented] (SPARK-5236) java.lang.ClassCastException: org.apache.spark.sql.catalyst.expressions.MutableAny cannot be cast to org.apache.spark.sql.catalyst.expressions.MutableInt

2017-03-22 Thread Jorge Machado (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936222#comment-15936222 ] Jorge Machado commented on SPARK-5236: -- [~marmbrus] Hi Michael, so I'm experience the same issue. I'm

[jira] [Commented] (SPARK-3728) RandomForest: Learn models too large to store in memory

2017-03-22 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-3728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936208#comment-15936208 ] Yan Facai (颜发才) commented on SPARK-3728: RandomForest already use a stack to save node, as

[jira] [Resolved] (SPARK-19992) spark-submit on deployment-mode cluster

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19992. --- Resolution: Not A Problem [~jerryshao] is "/*" even going to work? This must be an env problem

[jira] [Resolved] (SPARK-19934) code comments are not very clearly in BlackListTracker.scala

2017-03-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19934. --- Resolution: Not A Problem > code comments are not very clearly in BlackListTracker.scala >

  1   2   >