[jira] [Commented] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611160#comment-15611160 ] Zhenhua Wang commented on SPARK-18111: -- [~srowen] Sorry for the ambiguous example. I've updated the

[jira] [Updated] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu updated SPARK-18135: -- Description: h2. Environment 1. Spark package: build with Hadoop-2.7.2

[jira] [Updated] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu updated SPARK-18135: -- Description: h2. Environment 1. Spark package: build with Hadoop-2.7.2

[jira] [Commented] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611187#comment-15611187 ] Sean Owen commented on SPARK-18111: --- Yes that's clear, thank you. Just checking. It really says '5',

[jira] [Updated] (SPARK-18137) RewriteDistinctAggregates UnresolvedException because of UDAF TypeCheckFailure

2016-10-27 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Song Jun updated SPARK-18137: - Description: when run a sql with distinct(on spark github master branch), it throw UnresolvedException.

[jira] [Created] (SPARK-18137) RewriteDistinctAggregates UnresolvedException because of UDAF TypeCheckFailure

2016-10-27 Thread Song Jun (JIRA)
Song Jun created SPARK-18137: Summary: RewriteDistinctAggregates UnresolvedException because of UDAF TypeCheckFailure Key: SPARK-18137 URL: https://issues.apache.org/jira/browse/SPARK-18137 Project:

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610979#comment-15610979 ] Liang-Chi Hsieh commented on SPARK-18107: - I found a PR at Hive which should be the one to

[jira] [Commented] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610976#comment-15610976 ] Sean Owen commented on SPARK-18133: --- He must be using Python 3, which no longer has long vs int, and

[jira] [Created] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-18135: - Summary: Failed to access Spark UI and HistoryServer UI Key: SPARK-18135 URL: https://issues.apache.org/jira/browse/SPARK-18135 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611236#comment-15611236 ] Zhenhua Wang commented on SPARK-18111: -- Everytime it calls compress(), it will lose one duplicated

[jira] [Comment Edited] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Zach Kull (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611142#comment-15611142 ] Zach Kull edited comment on SPARK-18125 at 10/27/16 8:21 AM: - Same here:

[jira] [Created] (SPARK-18136) Make PySpark pip install works on windows

2016-10-27 Thread holdenk (JIRA)
holdenk created SPARK-18136: --- Summary: Make PySpark pip install works on windows Key: SPARK-18136 URL: https://issues.apache.org/jira/browse/SPARK-18136 Project: Spark Issue Type: Improvement

[jira] [Issue Comment Deleted] (SPARK-15854) Spark History server gets null pointer exception

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu updated SPARK-15854: -- Comment: was deleted (was: Got the same problem, resolved?) > Spark History server gets null pointer

[jira] [Commented] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611214#comment-15611214 ] Genmao Yu commented on SPARK-18135: --- [~srowen] I bulid the spark package by myself, the spark classpath

[jira] [Commented] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611229#comment-15611229 ] Zhenhua Wang commented on SPARK-18111: -- Yes, it says '5'. Because the samples in QuantileSummaries

[jira] [Comment Edited] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia edited comment on SPARK-14927 at 10/27/16 6:46 AM: ---

[jira] [Updated] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18134: -- Priority: Major (was: Blocker) > SQL: MapType in Group BY and Joins not working >

[jira] [Commented] (SPARK-15854) Spark History server gets null pointer exception

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1562#comment-1562 ] Sean Owen commented on SPARK-15854: --- That isn't an NPE underneath; it's just a knock-on error. It shows

[jira] [Comment Edited] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611160#comment-15611160 ] Zhenhua Wang edited comment on SPARK-18111 at 10/27/16 8:35 AM: [~srowen]

[jira] [Updated] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-10-27 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-17157: - Affects Version/s: 2.0.0 Fix Version/s: 2.1.0 > Add multiclass logistic regression

[jira] [Resolved] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-10-27 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17157. -- Resolution: Fixed Assignee: Miao Wang > Add multiclass logistic regression SparkR

[jira] [Comment Edited] (SPARK-15854) Spark History server gets null pointer exception

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610839#comment-15610839 ] Genmao Yu edited comment on SPARK-15854 at 10/27/16 6:19 AM: - Got the same

[jira] [Commented] (SPARK-15854) Spark History server gets null pointer exception

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610839#comment-15610839 ] Genmao Yu commented on SPARK-15854: --- Got the same problem > Spark History server gets null pointer

[jira] [Comment Edited] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia edited comment on SPARK-14927 at 10/27/16 6:55 AM: ---

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610962#comment-15610962 ] Liang-Chi Hsieh commented on SPARK-18107: - I checked the current codes for inserting into Hive

[jira] [Commented] (SPARK-18125) Spark generated code causes CompileException when groupByKey, reduceGroups and map(_._2) are used

2016-10-27 Thread Zach Kull (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611142#comment-15611142 ] Zach Kull commented on SPARK-18125: --- Same here: After groupByKey.reduceGroups.map(_._2):

[jira] [Comment Edited] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611214#comment-15611214 ] Genmao Yu edited comment on SPARK-18135 at 10/27/16 8:59 AM: - [~srowen] I

[jira] [Comment Edited] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia edited comment on SPARK-14927 at 10/27/16 6:50 AM: ---

[jira] [Commented] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610961#comment-15610961 ] Miao Wang commented on SPARK-18133: --- Python 2.7.11 |Anaconda 2.4.0 (x86_64)| (default, Dec 6 2015,

[jira] [Commented] (SPARK-18127) Add hooks and extension points to Spark

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611124#comment-15611124 ] Sean Owen commented on SPARK-18127: --- There's not much of any detail here. Can you elaborate or close?

[jira] [Updated] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-18111: - Description: When multiple records have the minimum value, the answer of ApproximatePercentile

[jira] [Assigned] (SPARK-18129) Sign pip artifacts

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18129: Assignee: Apache Spark > Sign pip artifacts > -- > > Key:

[jira] [Commented] (SPARK-1267) Add a pip installer for PySpark

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611202#comment-15611202 ] Apache Spark commented on SPARK-1267: - User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-1267) Add a pip installer for PySpark

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-1267: --- Assignee: (was: Apache Spark) > Add a pip installer for PySpark >

[jira] [Comment Edited] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia edited comment on SPARK-14927 at 10/27/16 6:51 AM: ---

[jira] [Commented] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610968#comment-15610968 ] Miao Wang commented on SPARK-18133: --- Use Pyspark: >>> training = spark.createDataFrame([ ...

[jira] [Comment Edited] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611214#comment-15611214 ] Genmao Yu edited comment on SPARK-18135 at 10/27/16 8:58 AM: - [~srowen] I

[jira] [Comment Edited] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611214#comment-15611214 ] Genmao Yu edited comment on SPARK-18135 at 10/27/16 8:58 AM: - [~srowen] I

[jira] [Created] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Nirmal Fernando (JIRA)
Nirmal Fernando created SPARK-18133: --- Summary: Python ML Pipeline Example has syntax errors Key: SPARK-18133 URL: https://issues.apache.org/jira/browse/SPARK-18133 Project: Spark Issue

[jira] [Commented] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia commented on SPARK-14927: - [~xwu0226] for me is working fine your example with

[jira] [Commented] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610948#comment-15610948 ] Sean Owen commented on SPARK-18111: --- You may have a point but this example still doesn't seem to show

[jira] [Created] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-10-27 Thread Christian Zorneck (JIRA)
Christian Zorneck created SPARK-18134: - Summary: SQL: MapType in Group BY and Joins not working Key: SPARK-18134 URL: https://issues.apache.org/jira/browse/SPARK-18134 Project: Spark

[jira] [Comment Edited] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-27 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611160#comment-15611160 ] Zhenhua Wang edited comment on SPARK-18111 at 10/27/16 8:31 AM: [~srowen]

[jira] [Comment Edited] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions

2016-10-27 Thread Raul Saez Tapia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15610900#comment-15610900 ] Raul Saez Tapia edited comment on SPARK-14927 at 10/27/16 7:36 AM: ---

[jira] [Assigned] (SPARK-13770) Document the ML feature Interaction

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13770: Assignee: Apache Spark > Document the ML feature Interaction >

[jira] [Commented] (SPARK-13770) Document the ML feature Interaction

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611040#comment-15611040 ] Apache Spark commented on SPARK-13770: -- User 'hayashidac' has created a pull request for this issue:

[jira] [Commented] (SPARK-13770) Document the ML feature Interaction

2016-10-27 Thread chie hayashida (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611044#comment-15611044 ] chie hayashida commented on SPARK-13770: I added examples and documentation. please check it. >

[jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-27 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611083#comment-15611083 ] Liang-Chi Hsieh commented on SPARK-18107: - I can create a PR for this. But it may require

[jira] [Commented] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1560#comment-1560 ] Jagadeesan A S commented on SPARK-18133: Started working on this. > Python ML Pipeline Example

[jira] [Resolved] (SPARK-17891) SQL-based three column join loses first column

2016-10-27 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-17891. - Resolution: Fixed > SQL-based three column join loses first column >

[jira] [Assigned] (SPARK-1267) Add a pip installer for PySpark

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-1267: --- Assignee: Apache Spark > Add a pip installer for PySpark > --- >

[jira] [Commented] (SPARK-18129) Sign pip artifacts

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611203#comment-15611203 ] Apache Spark commented on SPARK-18129: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18129) Sign pip artifacts

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18129: Assignee: (was: Apache Spark) > Sign pip artifacts > -- > >

[jira] [Assigned] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18133: Assignee: (was: Apache Spark) > Python ML Pipeline Example has syntax errors >

[jira] [Updated] (SPARK-17982) Spark 2.0.0 CREATE VIEW statement fails :: java.lang.RuntimeException: Failed to analyze the canonicalized SQL. It is possible there is a bug in Spark.

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17982: -- Target Version/s: (was: 2.0.2) > Spark 2.0.0 CREATE VIEW statement fails ::

[jira] [Commented] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611457#comment-15611457 ] Ran Haim commented on SPARK-17436: -- Of course it does, every technology that supports partitioning

[jira] [Commented] (SPARK-17602) PySpark - Performance Optimization Large Size of Broadcast Variable

2016-10-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611611#comment-15611611 ] holdenk commented on SPARK-17602: - This certainly looks interesting, do you maybe have some code you

[jira] [Commented] (SPARK-18139) Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611674#comment-15611674 ] Sean Owen commented on SPARK-18139: --- I'm pretty sure this is just another instance of "case classes

[jira] [Updated] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-10-27 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alok Bhandari updated SPARK-16473: -- Affects Version/s: 2.0.0 > BisectingKMeans Algorithm failing with

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-10-27 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611340#comment-15611340 ] Alok Bhandari commented on SPARK-16473: --- This issue continue to exist for spark 2.0 "ml" library.

[jira] [Updated] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-10-27 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alok Bhandari updated SPARK-16473: -- Priority: Blocker (was: Major) > BisectingKMeans Algorithm failing with

[jira] [Updated] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-10-27 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alok Bhandari updated SPARK-16473: -- Component/s: ML > BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key

[jira] [Updated] (SPARK-18127) Add hooks and extension points to Spark

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18127: -- Target Version/s: 2.1.0 Description: As a Spark user I want to be able to

[jira] [Comment Edited] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611457#comment-15611457 ] Ran Haim edited comment on SPARK-17436 at 10/27/16 10:22 AM: - Of course it

[jira] [Commented] (SPARK-18140) Parquet NPE / Update to 1.9

2016-10-27 Thread dori waldman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611691#comment-15611691 ] dori waldman commented on SPARK-18140: -- Is there any suggestion how to solve this issue now ? I cant

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611712#comment-15611712 ] Herman van Hovell commented on SPARK-18134: --- Maps are not comparable. This makes them unusable

[jira] [Issue Comment Deleted] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu updated SPARK-18135: -- Comment: was deleted (was: [~srowen] I bulid the spark package by myself, the spark classpath contain

[jira] [Assigned] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18133: Assignee: Apache Spark > Python ML Pipeline Example has syntax errors >

[jira] [Commented] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611344#comment-15611344 ] Genmao Yu commented on SPARK-18135: --- My fault, close it! > Failed to access Spark UI and HistoryServer

[jira] [Commented] (SPARK-13819) using a regexp_replace in a group by clause raises a nullpointerexception

2016-10-27 Thread zenglinxi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611589#comment-15611589 ] zenglinxi commented on SPARK-13819: --- We encountered the same problem, is there any progress? > using a

[jira] [Created] (SPARK-18138) Remove support for Python 2.6, Hadoop 2.6-, Java 7, and Scala 2.10

2016-10-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18138: --- Summary: Remove support for Python 2.6, Hadoop 2.6-, Java 7, and Scala 2.10 Key: SPARK-18138 URL: https://issues.apache.org/jira/browse/SPARK-18138 Project: Spark

[jira] [Comment Edited] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611457#comment-15611457 ] Ran Haim edited comment on SPARK-17436 at 10/27/16 10:32 AM: - Of course it

[jira] [Commented] (SPARK-15042) ConnectedComponents fails to compute graph with 200 vertices (but long paths)

2016-10-27 Thread Jarno Rajala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611616#comment-15611616 ] Jarno Rajala commented on SPARK-15042: -- I was able to reproduce this issue with 2.1.0-snapshot

[jira] [Updated] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Song Jun updated SPARK-18137: - Summary: RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck (was:

[jira] [Updated] (SPARK-18140) Parquet NPE / Update to 1.9

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18140: -- Summary: Parquet NPE / Update to 1.9 (was: Spark - parquet - null pointer exception ) I think we

[jira] [Commented] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611350#comment-15611350 ] Sean Owen commented on SPARK-17436: --- I don't think partitioning is supposed to maintain ordering. >

[jira] [Commented] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611466#comment-15611466 ] Sean Owen commented on SPARK-17436: --- Yes, but a partitioning operation does not necessarily preserve

[jira] [Closed] (SPARK-18135) Failed to access Spark UI and HistoryServer UI

2016-10-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu closed SPARK-18135. - Resolution: Works for Me > Failed to access Spark UI and HistoryServer UI >

[jira] [Updated] (SPARK-18127) Add hooks and extension points to Spark

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18127: -- Target Version/s: 2.2.0 (was: 2.1.0) > Add hooks and extension points to Spark >

[jira] [Updated] (SPARK-18137) RewriteDistinctAggregates UnresolvedException when a UDAF has a foldable TypeCheck

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18137: -- Description: when run a sql with distinct(on spark github master branch), it throw

[jira] [Updated] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18133: -- Target Version/s: (was: 2.0.2) > Python ML Pipeline Example has syntax errors >

[jira] [Updated] (SPARK-18054) Unexpected error from UDF that gets an element of a vector: argument 1 requires vector type, however, '`_column_`' is of vector type

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18054: -- Target Version/s: (was: 2.0.2) > Unexpected error from UDF that gets an element of a vector:

[jira] [Comment Edited] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611483#comment-15611483 ] Ran Haim edited comment on SPARK-17436 at 10/27/16 10:35 AM: - Usually you

[jira] [Commented] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611483#comment-15611483 ] Ran Haim commented on SPARK-17436: -- usually you partition the data, and then you order it - this way you

[jira] [Commented] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611310#comment-15611310 ] Apache Spark commented on SPARK-18133: -- User 'jagadeesanas2' has created a pull request for this

[jira] [Commented] (SPARK-17436) dataframe.write sometimes does not keep sorting

2016-10-27 Thread Ran Haim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611331#comment-15611331 ] Ran Haim commented on SPARK-17436: -- anyone? > dataframe.write sometimes does not keep sorting >

[jira] [Updated] (SPARK-17770) Make ObjectType SQL Type Public

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17770: -- Assignee: Aleksander Eskilson > Make ObjectType SQL Type Public > --- > >

[jira] [Updated] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16473: -- Priority: Major (was: Blocker) > BisectingKMeans Algorithm failing with

[jira] [Issue Comment Deleted] (SPARK-18127) Add hooks and extension points to Spark

2016-10-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18127: -- Comment: was deleted (was: There's not much of any detail here. Can you elaborate or close?) > Add

[jira] [Updated] (SPARK-18127) Add hooks and extension points to Spark

2016-10-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18127: -- Description: As a Spark user I want to be able to customize my spark session. I

[jira] [Updated] (SPARK-18138) Remove support for Python 2.6, Hadoop 2.6-, Java 7, and Scala 2.10

2016-10-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18138: Priority: Blocker (was: Major) > Remove support for Python 2.6, Hadoop 2.6-, Java 7, and Scala

[jira] [Created] (SPARK-18139) Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found

2016-10-27 Thread Zach Kull (JIRA)
Zach Kull created SPARK-18139: - Summary: Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found Key: SPARK-18139 URL:

[jira] [Created] (SPARK-18140) Spark - parquet - null pointer exception

2016-10-27 Thread dori waldman (JIRA)
dori waldman created SPARK-18140: Summary: Spark - parquet - null pointer exception Key: SPARK-18140 URL: https://issues.apache.org/jira/browse/SPARK-18140 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17822: - Target Version/s: 2.0.3, 2.1.0 (was: 2.0.2, 2.1.0) > JVMObjectTracker.objMap may leak JVM objects >

[jira] [Updated] (SPARK-17823) Make JVMObjectTracker.objMap thread-safe

2016-10-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17823: - Target Version/s: 2.0.3, 2.1.0 (was: 2.0.2, 2.1.0) > Make JVMObjectTracker.objMap thread-safe >

[jira] [Assigned] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18141: Assignee: (was: Apache Spark) > jdbc datasource read fails when quoted columns

[jira] [Commented] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15612762#comment-15612762 ] Apache Spark commented on SPARK-18141: -- User 'sureshthalamati' has created a pull request for this

[jira] [Assigned] (SPARK-18141) jdbc datasource read fails when quoted columns (eg:mixed case, reserved words) in source table are used in the filter.

2016-10-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18141: Assignee: Apache Spark > jdbc datasource read fails when quoted columns (eg:mixed case,

[jira] [Updated] (SPARK-18144) StreamingQueryListener.QueryStartedEvent is not written to event log

2016-10-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18144: - Affects Version/s: 2.0.0 2.0.1 > StreamingQueryListener.QueryStartedEvent

[jira] [Created] (SPARK-18144) StreamingQueryListener.QueryStartedEvent is not written to event log

2016-10-27 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18144: Summary: StreamingQueryListener.QueryStartedEvent is not written to event log Key: SPARK-18144 URL: https://issues.apache.org/jira/browse/SPARK-18144 Project: Spark

  1   2   >