[jira] [Assigned] (SPARK-7093) Using newPredicate in NestedLoopJoin to enable code generation

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7093: --- Assignee: Apache Spark Using newPredicate in NestedLoopJoin to enable code generation

[jira] [Assigned] (SPARK-7093) Using newPredicate in NestedLoopJoin to enable code generation

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7093: --- Assignee: (was: Apache Spark) Using newPredicate in NestedLoopJoin to enable code

[jira] [Commented] (SPARK-7093) Using newPredicate in NestedLoopJoin to enable code generation

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509136#comment-14509136 ] Apache Spark commented on SPARK-7093: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-6273) Got error when one table's alias name is the same with other table's column name

2015-04-23 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509164#comment-14509164 ] Shuai Zheng commented on SPARK-6273: I use 1.3.1, and I have similar issue. It is

[jira] [Comment Edited] (SPARK-6273) Got error when one table's alias name is the same with other table's column name

2015-04-23 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509164#comment-14509164 ] Shuai Zheng edited comment on SPARK-6273 at 4/23/15 2:49 PM: -

[jira] [Comment Edited] (SPARK-6273) Got error when one table's alias name is the same with other table's column name

2015-04-23 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509164#comment-14509164 ] Shuai Zheng edited comment on SPARK-6273 at 4/23/15 2:49 PM: -

[jira] [Created] (SPARK-7094) driver process will be suspend when driver network has down

2015-04-23 Thread yuemeng (JIRA)
yuemeng created SPARK-7094: -- Summary: driver process will be suspend when driver network has down Key: SPARK-7094 URL: https://issues.apache.org/jira/browse/SPARK-7094 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6932) A Prototype of Parameter Server

2015-04-23 Thread Andy Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509142#comment-14509142 ] Andy Huang commented on SPARK-6932: --- [~mengxr] Here are the changes:

[jira] [Resolved] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5252. -- Resolution: Cannot Reproduce Streaming StatefulNetworkWordCount example hangs

[jira] [Updated] (SPARK-6924) driver hangs when net is broken

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6924: - Priority: Minor (was: Major) driver hangs when net is broken ---

[jira] [Updated] (SPARK-7094) driver process will be suspend when driver network has down

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7094: - Priority: Minor (was: Major) Target Version/s: (was: 1.2.0) Fix Version/s: (was:

[jira] [Resolved] (SPARK-7094) driver process will be suspend when driver network has down

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7094. -- Resolution: Duplicate driver process will be suspend when driver network has down

[jira] [Reopened] (SPARK-6924) driver hangs when net is broken

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-6924: -- Reopening because there is a PR now driver hangs when net is broken ---

[jira] [Commented] (SPARK-6067) Spark sql hive dynamic partitions job will fail if task fails

2015-04-23 Thread Jason Hubbard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509250#comment-14509250 ] Jason Hubbard commented on SPARK-6067: -- I was able to test this. I had problems

[jira] [Commented] (SPARK-6856) Make RDD information more useful in SparkR

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509399#comment-14509399 ] Apache Spark commented on SPARK-6856: - User 'His-name-is-Joof' has created a pull

[jira] [Assigned] (SPARK-6856) Make RDD information more useful in SparkR

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6856: --- Assignee: (was: Apache Spark) Make RDD information more useful in SparkR

[jira] [Assigned] (SPARK-6856) Make RDD information more useful in SparkR

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6856: --- Assignee: Apache Spark Make RDD information more useful in SparkR

[jira] [Updated] (SPARK-7044) [Spark SQL] query would hang when using scripts in SQL statement

2015-04-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7044: --- Description: Query with 'USING' operator like below would hang when using scripts in SQL statement

[jira] [Resolved] (SPARK-7044) [Spark SQL] query would hang when using scripts in SQL statement

2015-04-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7044. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Cheng Hao [Spark SQL] query would

[jira] [Updated] (SPARK-6921) Spark SQL API saveAsParquetFile will output tachyon file with different block size

2015-04-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6921: --- Priority: Critical (was: Blocker) Spark SQL API saveAsParquetFile will output tachyon file

[jira] [Created] (SPARK-7097) Partitioned tables should only consider referred partitions in query during size estimation for checking against autoBroadcastJoinThreshold

2015-04-23 Thread Yash Datta (JIRA)
Yash Datta created SPARK-7097: - Summary: Partitioned tables should only consider referred partitions in query during size estimation for checking against autoBroadcastJoinThreshold Key: SPARK-7097 URL:

[jira] [Commented] (SPARK-6921) Spark SQL API saveAsParquetFile will output tachyon file with different block size

2015-04-23 Thread zhangxiongfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509349#comment-14509349 ] zhangxiongfei commented on SPARK-6921: -- I think the root cause may be the following:

[jira] [Updated] (SPARK-5894) Add PolynomialMapper

2015-04-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5894: - Assignee: Xusen Yin Add PolynomialMapper Key: SPARK-5894

[jira] [Issue Comment Deleted] (SPARK-6921) Spark SQL API saveAsParquetFile will output tachyon file with different block size

2015-04-23 Thread zhangxiongfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangxiongfei updated SPARK-6921: - Comment: was deleted (was: I think the root cause may be the following: 1)When the

[jira] [Assigned] (SPARK-5553) Reimplement SQL binary type with more efficient data structure

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5553: --- Assignee: (was: Apache Spark) Reimplement SQL binary type with more efficient data

[jira] [Commented] (SPARK-5553) Reimplement SQL binary type with more efficient data structure

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509352#comment-14509352 ] Apache Spark commented on SPARK-5553: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-5553) Reimplement SQL binary type with more efficient data structure

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5553: --- Assignee: Apache Spark Reimplement SQL binary type with more efficient data structure

[jira] [Updated] (SPARK-6292) Add RDD methods to DataFrame to preserve schema

2015-04-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6292: --- Target Version/s: 1.5.0 (was: 1.4.0) Add RDD methods to DataFrame to preserve schema

[jira] [Created] (SPARK-7095) Pass DataType to source.Filter classes

2015-04-23 Thread Alex Liu (JIRA)
Alex Liu created SPARK-7095: --- Summary: Pass DataType to source.Filter classes Key: SPARK-7095 URL: https://issues.apache.org/jira/browse/SPARK-7095 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-7096) Java example for Streaming on site uses map instead of mapToPair

2015-04-23 Thread Edward Sargisson (JIRA)
Edward Sargisson created SPARK-7096: --- Summary: Java example for Streaming on site uses map instead of mapToPair Key: SPARK-7096 URL: https://issues.apache.org/jira/browse/SPARK-7096 Project: Spark

[jira] [Updated] (SPARK-6752) Allow StreamingContext to be recreated from checkpoint and existing SparkContext

2015-04-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6752: - Fix Version/s: 1.4.0 Allow StreamingContext to be recreated from checkpoint and existing

[jira] [Resolved] (SPARK-6752) Allow StreamingContext to be recreated from checkpoint and existing SparkContext

2015-04-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-6752. -- Resolution: Fixed Allow StreamingContext to be recreated from checkpoint and existing

[jira] [Comment Edited] (SPARK-6921) Spark SQL API saveAsParquetFile will output tachyon file with different block size

2015-04-23 Thread zhangxiongfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509349#comment-14509349 ] zhangxiongfei edited comment on SPARK-6921 at 4/23/15 4:52 PM:

[jira] [Created] (SPARK-7093) Using newPredicate in NestedLoopJoin to enable code generation

2015-04-23 Thread Fei Wang (JIRA)
Fei Wang created SPARK-7093: --- Summary: Using newPredicate in NestedLoopJoin to enable code generation Key: SPARK-7093 URL: https://issues.apache.org/jira/browse/SPARK-7093 Project: Spark Issue

[jira] [Updated] (SPARK-7094) driver process will be suspend when driver network has down

2015-04-23 Thread yuemeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuemeng updated SPARK-7094: --- Description: Run a application with yarn-client mode base on spark on yarn.During the application is

[jira] [Commented] (SPARK-6999) infinite recursion with createDataFrame(JavaRDD[Row], java.util.List[String])

2015-04-23 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509209#comment-14509209 ] Justin Uang commented on SPARK-6999: We might be able to use

[jira] [Updated] (SPARK-6781) sqlCtx - sqlContext in pyspark shell

2015-04-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6781: Issue Type: Task (was: Bug) sqlCtx - sqlContext in pyspark shell -

[jira] [Commented] (SPARK-7097) Partitioned tables should only consider referred partitions in query during size estimation for checking against autoBroadcastJoinThreshold

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509512#comment-14509512 ] Apache Spark commented on SPARK-7097: - User 'saucam' has created a pull request for

[jira] [Assigned] (SPARK-7097) Partitioned tables should only consider referred partitions in query during size estimation for checking against autoBroadcastJoinThreshold

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7097: --- Assignee: Apache Spark Partitioned tables should only consider referred partitions in query

[jira] [Assigned] (SPARK-7097) Partitioned tables should only consider referred partitions in query during size estimation for checking against autoBroadcastJoinThreshold

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7097: --- Assignee: (was: Apache Spark) Partitioned tables should only consider referred

[jira] [Resolved] (SPARK-7055) getContextOrSparkClassLoader is not used while loading JDBC driver class

2015-04-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7055. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5633

[jira] [Created] (SPARK-7099) Floating point literals cannot be specified using exponent

2015-04-23 Thread Peter Hagelund (JIRA)
Peter Hagelund created SPARK-7099: - Summary: Floating point literals cannot be specified using exponent Key: SPARK-7099 URL: https://issues.apache.org/jira/browse/SPARK-7099 Project: Spark

[jira] [Assigned] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7100: --- Assignee: Apache Spark GradientBoostTrees leaks a persisted RDD

[jira] [Assigned] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7100: --- Assignee: (was: Apache Spark) GradientBoostTrees leaks a persisted RDD

[jira] [Created] (SPARK-7102) update apache hosted graphx-programming-guide doc

2015-04-23 Thread Deborah Siegel (JIRA)
Deborah Siegel created SPARK-7102: - Summary: update apache hosted graphx-programming-guide doc Key: SPARK-7102 URL: https://issues.apache.org/jira/browse/SPARK-7102 Project: Spark Issue

[jira] [Comment Edited] (SPARK-6290) spark.ml.param.Params.checkInputColumn bug upon error

2015-04-23 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509538#comment-14509538 ] Glenn Weidner edited comment on SPARK-6290 at 4/23/15 6:45 PM:

[jira] [Created] (SPARK-7098) Inconsistent Timestamp behavior when used in WHERE clause

2015-04-23 Thread Peter Hagelund (JIRA)
Peter Hagelund created SPARK-7098: - Summary: Inconsistent Timestamp behavior when used in WHERE clause Key: SPARK-7098 URL: https://issues.apache.org/jira/browse/SPARK-7098 Project: Spark

[jira] [Created] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Jim Carroll (JIRA)
Jim Carroll created SPARK-7100: -- Summary: GradientBoostTrees leaks a persisted RDD Key: SPARK-7100 URL: https://issues.apache.org/jira/browse/SPARK-7100 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509625#comment-14509625 ] Apache Spark commented on SPARK-7100: - User 'jimfcarroll' has created a pull request

[jira] [Created] (SPARK-7101) Spark SQL should support java.sql.Time

2015-04-23 Thread Peter Hagelund (JIRA)
Peter Hagelund created SPARK-7101: - Summary: Spark SQL should support java.sql.Time Key: SPARK-7101 URL: https://issues.apache.org/jira/browse/SPARK-7101 Project: Spark Issue Type:

[jira] [Reopened] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-04-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reopened SPARK-5252: -- Streaming StatefulNetworkWordCount example hangs

[jira] [Issue Comment Deleted] (SPARK-6273) Got error when one table's alias name is the same with other table's column name

2015-04-23 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuai Zheng updated SPARK-6273: --- Comment: was deleted (was: I use 1.3.1, and I have similar issue. It is still there. And I am using

[jira] [Updated] (SPARK-7085) Inconsistent default miniBatchFraction parameters in the train methods of RidgeRegression

2015-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7085: - Assignee: Nobuyuki Kuromatsu Inconsistent default miniBatchFraction parameters in the

[jira] [Resolved] (SPARK-7085) Inconsistent default miniBatchFraction parameters in the train methods of RidgeRegression

2015-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7085. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5658

[jira] [Resolved] (SPARK-7057) spark streaming kafka support multiline

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7057. -- Resolution: Invalid Target Version/s: (was: 1.4.0) spark streaming kafka support multiline

[jira] [Updated] (SPARK-6879) Check if the app is completed before clean it up

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6879: - Assignee: Tao Wang Check if the app is completed before clean it up

[jira] [Resolved] (SPARK-7029) Unable to use hive built-in functions in sparkSQL

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7029. -- Resolution: Invalid This is more of a question than valid issue report at this stage. It is likely

[jira] [Updated] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7100: - Target Version/s: 1.4.0 GradientBoostTrees leaks a persisted RDD

[jira] [Updated] (SPARK-7085) Inconsistent default miniBatchFraction parameters in the train methods of RidgeRegression

2015-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7085: - Target Version/s: 1.4.0 Inconsistent default miniBatchFraction parameters in the train

[jira] [Updated] (SPARK-7087) Scala Version Change script is dependent on current working directory

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7087: - Assignee: Tijo Thomas Scala Version Change script is dependent on current working directory

[jira] [Resolved] (SPARK-7087) Scala Version Change script is dependent on current working directory

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7087. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5656

[jira] [Updated] (SPARK-7099) Floating point literals cannot be specified using exponent

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7099: - Priority: Minor (was: Major) Dumb question, but is this commonly supported in SQL engines? Floating

[jira] [Updated] (SPARK-7051) Support Compression write for Parquet

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7051: - Issue Type: Improvement (was: Bug) Support Compression write for Parquet

[jira] [Created] (SPARK-7104) Support model save/load in Python's Word2Vec

2015-04-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7104: Summary: Support model save/load in Python's Word2Vec Key: SPARK-7104 URL: https://issues.apache.org/jira/browse/SPARK-7104 Project: Spark Issue

[jira] [Created] (SPARK-7103) SparkContext.union crashed when some RDDs have no partitioner

2015-04-23 Thread Steven She (JIRA)
Steven She created SPARK-7103: - Summary: SparkContext.union crashed when some RDDs have no partitioner Key: SPARK-7103 URL: https://issues.apache.org/jira/browse/SPARK-7103 Project: Spark Issue

[jira] [Resolved] (SPARK-7070) LDA.setBeta calls itself

2015-04-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7070. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull

[jira] [Updated] (SPARK-7092) Update spark scala version to 2.11.6

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7092: - Priority: Minor (was: Major) Update spark scala version to 2.11.6

[jira] [Created] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-04-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7106: Summary: Support model save/load in Python's FPGrowth Key: SPARK-7106 URL: https://issues.apache.org/jira/browse/SPARK-7106 Project: Spark Issue

[jira] [Commented] (SPARK-7049) File does not exist in checkpoint directory

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509913#comment-14509913 ] Sean Owen commented on SPARK-7049: -- Can you provide any info to reproduce this? there's

[jira] [Resolved] (SPARK-7102) update apache hosted graphx-programming-guide doc

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7102. -- Resolution: Not A Problem (You can just ask on u...@spark.apache.org) It is committed to master:

[jira] [Commented] (SPARK-7099) Floating point literals cannot be specified using exponent

2015-04-23 Thread Peter Hagelund (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509894#comment-14509894 ] Peter Hagelund commented on SPARK-7099: --- Yes, it's quite common in fact. It makes

[jira] [Updated] (SPARK-7086) Do not retry when public service start on port

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7086: - Priority: Minor (was: Major) Do not retry when public service start on port

[jira] [Created] (SPARK-7105) Support model save/load in Python's GaussianMixture

2015-04-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7105: Summary: Support model save/load in Python's GaussianMixture Key: SPARK-7105 URL: https://issues.apache.org/jira/browse/SPARK-7105 Project: Spark

[jira] [Commented] (SPARK-7103) SparkContext.union crashed when some RDDs have no partitioner

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509939#comment-14509939 ] Sean Owen commented on SPARK-7103: -- Looks like the check needs to be expanded. To this:

[jira] [Updated] (SPARK-6672) createDataFrame from RDD[Row] with UDTs cannot be saved

2015-04-23 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6672: Fix Version/s: 1.3.1 createDataFrame from RDD[Row] with UDTs cannot be saved

[jira] [Updated] (SPARK-7096) Java example for Streaming on site uses map instead of mapToPair

2015-04-23 Thread Edward Sargisson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Edward Sargisson updated SPARK-7096: Affects Version/s: (was: 1.1.0) 1.3.1 Java example for

[jira] [Closed] (SPARK-5553) Reimplement SQL binary type with more efficient data structure

2015-04-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-5553. -- Resolution: Later Let's close this for now since we will do it as part of

[jira] [Updated] (SPARK-7096) Java example for Streaming on site uses map instead of mapToPair

2015-04-23 Thread Edward Sargisson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Edward Sargisson updated SPARK-7096: Description: https://spark.apache.org/docs/latest/streaming-programming-guide.html Here

[jira] [Resolved] (SPARK-7096) Java example for Streaming on site uses map instead of mapToPair

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7096. -- Resolution: Duplicate Assignee: (was: Reynold Xin) Have a look at master in instances like

[jira] [Commented] (SPARK-6290) spark.ml.param.Params.checkInputColumn bug upon error

2015-04-23 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14509538#comment-14509538 ] Glenn Weidner commented on SPARK-6290: -- Thank you Joseph for the quick reply and my

[jira] [Assigned] (SPARK-7109) Push down left side filter for left semi join

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7109: --- Assignee: (was: Apache Spark) Push down left side filter for left semi join

[jira] [Commented] (SPARK-7109) Push down left side filter for left semi join

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510417#comment-14510417 ] Apache Spark commented on SPARK-7109: - User 'scwf' has created a pull request for this

[jira] [Assigned] (SPARK-7109) Push down left side filter for left semi join

2015-04-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7109: --- Assignee: Apache Spark Push down left side filter for left semi join

[jira] [Created] (SPARK-7109) Push down left side filter for left semi join

2015-04-23 Thread Fei Wang (JIRA)
Fei Wang created SPARK-7109: --- Summary: Push down left side filter for left semi join Key: SPARK-7109 URL: https://issues.apache.org/jira/browse/SPARK-7109 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-23 Thread gu-chi (JIRA)
gu-chi created SPARK-7110: - Summary: when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication Key: SPARK-7110 URL:

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-23 Thread gu-chi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510420#comment-14510420 ] gu-chi commented on SPARK-7110: --- exception trace stack as below:

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-23 Thread gu-chi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510422#comment-14510422 ] gu-chi commented on SPARK-7110: --- As I searched the history patch, SPARK-1203 is showing the

[jira] [Updated] (SPARK-7100) GradientBoostTrees leaks a persisted RDD

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7100: - Priority: Minor (was: Major) Target Version/s: (was: 1.4.0) Fix Version/s: (was:

[jira] [Resolved] (SPARK-7058) Task deserialization time metric does not include time to deserialize broadcasted RDDs

2015-04-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-7058. --- Resolution: Fixed Fix Version/s: 1.4.0 Task deserialization time metric does not

[jira] [Resolved] (SPARK-7037) Inconsistent behavior for non-spark config properties in spark-shell and spark-submit

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7037. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5617

[jira] [Updated] (SPARK-7037) Inconsistent behavior for non-spark config properties in spark-shell and spark-submit

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7037: - Assignee: Cheolsoo Park Inconsistent behavior for non-spark config properties in spark-shell and

[jira] [Updated] (SPARK-7103) SparkContext.union crashed when some RDDs have no partitioner

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7103: - Component/s: Spark Core Priority: Minor (was: Major) SparkContext.union crashed when some RDDs

[jira] [Updated] (SPARK-6927) Sorting Error when codegen on

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6927: - Assignee: Chen Song Sorting Error when codegen on - Key:

[jira] [Updated] (SPARK-7055) getContextOrSparkClassLoader is not used while loading JDBC driver class

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7055: - Assignee: Vinod KC getContextOrSparkClassLoader is not used while loading JDBC driver class

[jira] [Updated] (SPARK-6881) Change the checkpoint directory name from checkpoints to checkpoint

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6881: - Assignee: Hao Change the checkpoint directory name from checkpoints to checkpoint

[jira] [Updated] (SPARK-6969) Refresh the cached table when REFRESH TABLE is used

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6969: - Assignee: Yin Huai Refresh the cached table when REFRESH TABLE is used

[jira] [Updated] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6550: - Assignee: Michael Armbrust Add PreAnalyzer to keep logical plan consistent across DataFrame

[jira] [Updated] (SPARK-6899) Type mismatch when using codegen

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6899: - Assignee: Liang-Chi Hsieh Type mismatch when using codegen

[jira] [Updated] (SPARK-6694) SparkSQL CLI must be able to specify an option --database on the command line.

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6694: - Assignee: Jin Adachi SparkSQL CLI must be able to specify an option --database on the command line.

[jira] [Updated] (SPARK-6647) Make trait StringComparison as BinaryPredicate and throw error when Predicate can't translate to data source Filter

2015-04-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6647: - Assignee: Liang-Chi Hsieh Make trait StringComparison as BinaryPredicate and throw error when Predicate

  1   2   3   >