[jira] [Updated] (SPARK-5180) Data source API improvement

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5180: Target Version/s: 1.4.0 (was: 1.3.0) Data source API improvement

[jira] [Updated] (SPARK-4768) Add Support For Impala Encoded Timestamp (INT96)

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4768: Priority: Blocker (was: Critical) Add Support For Impala Encoded Timestamp (INT96)

[jira] [Updated] (SPARK-4768) Add Support For Impala Encoded Timestamp (INT96)

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4768: Assignee: Yin Huai Add Support For Impala Encoded Timestamp (INT96)

[jira] [Updated] (SPARK-3851) Support for reading parquet files with different but compatible schema

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3851: Priority: Blocker (was: Critical) Support for reading parquet files with different but

[jira] [Updated] (SPARK-5425) ConcurrentModificationException during SparkConf creation

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5425: -- Target Version/s: 1.2.2 I've merged [~jlewandowski]'s patch (https://github.com/apache/spark/pull/4222)

[jira] [Commented] (SPARK-5534) EdgeRDD, VertexRDD getStorageLevel return bad values

2015-02-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302043#comment-14302043 ] Joseph K. Bradley commented on SPARK-5534: -- Note: This is needed for

[jira] [Assigned] (SPARK-5534) EdgeRDD, VertexRDD getStorageLevel return bad values

2015-02-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-5534: Assignee: Joseph K. Bradley EdgeRDD, VertexRDD getStorageLevel return bad values

[jira] [Commented] (SPARK-5505) ConsumerRebalanceFailedException from Kafka consumer

2015-02-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302062#comment-14302062 ] Tathagata Das commented on SPARK-5505: -- Since this is a problem with the HighLevel

[jira] [Updated] (SPARK-5514) collect should call executeCollect

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5514: Assignee: Reynold Xin collect should call executeCollect

[jira] [Resolved] (SPARK-5491) Chi-square feature selection

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5491. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 1484

[jira] [Closed] (SPARK-5437) DriverSuite and SparkSubmitSuite incorrect timeout behavior

2015-02-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5437. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 DriverSuite and

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Summary: Provide a stable application submission gateway in standalone cluster mode (was: Provide a

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Description: The existing submission gateway in standalone mode is not compatible across Spark versions.

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301792#comment-14301792 ] Xiangrui Meng commented on SPARK-5226: -- [~alitouka] Thanks for implementing DBSCAN on

[jira] [Updated] (SPARK-4523) Improve handling of serialized schema information

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4523: Priority: Critical (was: Blocker) Improve handling of serialized schema information

[jira] [Updated] (SPARK-3851) Support for reading parquet files with different but compatible schema

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3851: Assignee: Cheng Lian Support for reading parquet files with different but compatible

[jira] [Updated] (SPARK-3575) Hive Schema is ignored when using convertMetastoreParquet

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3575: Priority: Blocker (was: Critical) Hive Schema is ignored when using

[jira] [Commented] (SPARK-3039) Spark assembly for new hadoop API (hadoop 2) contains avro-mapred for hadoop 1 API

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302027#comment-14302027 ] Apache Spark commented on SPARK-3039: - User 'medale' has created a pull request for

[jira] [Updated] (SPARK-4497) HiveThriftServer2 does not exit properly on failure

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4497: Target Version/s: 1.4.0 (was: 1.3.0) HiveThriftServer2 does not exit properly on failure

[jira] [Updated] (SPARK-5530) ApplicationMaster can't kill executor when using dynamicAllocation

2015-02-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5530: - Affects Version/s: 1.3.0 ApplicationMaster can't kill executor when using dynamicAllocation

[jira] [Commented] (SPARK-5514) collect should call executeCollect

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301953#comment-14301953 ] Apache Spark commented on SPARK-5514: - User 'rxin' has created a pull request for this

[jira] [Updated] (SPARK-5501) Write support for the data source API

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5501: Assignee: Yin Huai Write support for the data source API

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5463: Assignee: Cheng Lian Fix Parquet filter push-down

[jira] [Updated] (SPARK-5532) Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

2015-02-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5532: - Description: Deterministic failure: {code} import org.apache.spark.mllib.linalg._ import

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5463: Priority: Blocker (was: Critical) Fix Parquet filter push-down

[jira] [Resolved] (SPARK-5184) Improve the performance of metadata operations

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5184. - Resolution: Won't Fix Improve the performance of metadata operations

[jira] [Comment Edited] (SPARK-3039) Spark assembly for new hadoop API (hadoop 2) contains avro-mapred for hadoop 1 API

2015-02-02 Thread Markus Dale (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301780#comment-14301780 ] Markus Dale edited comment on SPARK-3039 at 2/2/15 8:40 PM:

[jira] [Commented] (SPARK-3039) Spark assembly for new hadoop API (hadoop 2) contains avro-mapred for hadoop 1 API

2015-02-02 Thread Markus Dale (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301780#comment-14301780 ] Markus Dale commented on SPARK-3039: For me, Spark 1.2.0 either downloading

[jira] [Assigned] (SPARK-5518) Error messages for plans with invalid AttributeReferences

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-5518: --- Assignee: Michael Armbrust Error messages for plans with invalid

[jira] [Updated] (SPARK-3267) Deadlock between ScalaReflectionLock and Data type initialization

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3267: Target Version/s: 1.4.0 (was: 1.3.0) Deadlock between ScalaReflectionLock and Data type

[jira] [Updated] (SPARK-5258) Clean up exposed classes in sql.hive package

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5258: Priority: Blocker (was: Major) Clean up exposed classes in sql.hive package

[jira] [Created] (SPARK-5534) EdgeRDD, VertexRDD getStorageLevel return bad values

2015-02-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5534: Summary: EdgeRDD, VertexRDD getStorageLevel return bad values Key: SPARK-5534 URL: https://issues.apache.org/jira/browse/SPARK-5534 Project: Spark

[jira] [Created] (SPARK-5531) Spark download .tgz file does not get unpacked

2015-02-02 Thread DeepakVohra (JIRA)
DeepakVohra created SPARK-5531: -- Summary: Spark download .tgz file does not get unpacked Key: SPARK-5531 URL: https://issues.apache.org/jira/browse/SPARK-5531 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4811) Custom UDTFs not working in Spark SQL

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4811: Target Version/s: 1.4.0 (was: 1.3.0) Custom UDTFs not working in Spark SQL

[jira] [Created] (SPARK-5532) Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

2015-02-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5532: Summary: Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT Key: SPARK-5532 URL: https://issues.apache.org/jira/browse/SPARK-5532

[jira] [Updated] (SPARK-4553) query for parquet table with string fields in spark sql hive get binary result

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4553: Assignee: Cheng Lian query for parquet table with string fields in spark sql hive get

[jira] [Updated] (SPARK-4553) query for parquet table with string fields in spark sql hive get binary result

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4553: Priority: Blocker (was: Major) query for parquet table with string fields in spark sql

[jira] [Closed] (SPARK-4585) Spark dynamic executor allocation shouldn't use maxExecutors as initial number

2015-02-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4585. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Sandy Ryza Target Version/s:

[jira] [Updated] (SPARK-5425) ConcurrentModificationException during SparkConf creation

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5425: -- Assignee: Jacek Lewandowski ConcurrentModificationException during SparkConf creation

[jira] [Updated] (SPARK-5425) ConcurrentModificationException during SparkConf creation

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5425: -- Fix Version/s: 1.1.2 1.3.0 ConcurrentModificationException during SparkConf

[jira] [Updated] (SPARK-4986) Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

2015-02-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-4986: - Priority: Blocker (was: Major) Graceful shutdown for Spark Streaming does not work in

[jira] [Updated] (SPARK-4986) Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

2015-02-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-4986: - Fix Version/s: (was: 1.2.1) Graceful shutdown for Spark Streaming does not work in

[jira] [Updated] (SPARK-5027) add SVMWithLBFGS interface in MLLIB

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5027: - Assignee: zhengbing li add SVMWithLBFGS interface in MLLIB ---

[jira] [Updated] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2206: - Target Version/s: 1.4.0 (was: 1.3.0) Automatically infer the number of classification classes

[jira] [Updated] (SPARK-5027) add SVMWithLBFGS interface in MLLIB

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5027: - Target Version/s: 1.4.0 (was: 1.3.0) add SVMWithLBFGS interface in MLLIB

[jira] [Created] (SPARK-5536) Wrap the old ALS to use the new ALS implementation.

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5536: Summary: Wrap the old ALS to use the new ALS implementation. Key: SPARK-5536 URL: https://issues.apache.org/jira/browse/SPARK-5536 Project: Spark Issue

[jira] [Commented] (SPARK-5542) Decouple publishing, packaging, and tagging in release script

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302578#comment-14302578 ] Apache Spark commented on SPARK-5542: - User 'pwendell' has created a pull request for

[jira] [Resolved] (SPARK-3883) Provide SSL support for Akka and HttpServer based connections

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3883. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3571

[jira] [Updated] (SPARK-5532) Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5532: - Assignee: Cheng Lian Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

[jira] [Commented] (SPARK-3778) newAPIHadoopRDD doesn't properly pass credentials for secure hdfs on yarn

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302290#comment-14302290 ] Patrick Wendell commented on SPARK-3778: /cc [~hshreedharan] newAPIHadoopRDD

[jira] [Updated] (SPARK-3778) newAPIHadoopRDD doesn't properly pass credentials for secure hdfs on yarn

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3778: --- Priority: Blocker (was: Critical) newAPIHadoopRDD doesn't properly pass credentials for

[jira] [Updated] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4550: --- Target Version/s: 1.4.0 In sort-based shuffle, store map outputs in serialized form

[jira] [Updated] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4550: --- Priority: Critical (was: Major) In sort-based shuffle, store map outputs in serialized form

[jira] [Created] (SPARK-5537) Expand user guide for multinomial logistic regression

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5537: Summary: Expand user guide for multinomial logistic regression Key: SPARK-5537 URL: https://issues.apache.org/jira/browse/SPARK-5537 Project: Spark Issue

[jira] [Resolved] (SPARK-4508) Native Date type for SQL92 Date

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4508. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3732

[jira] [Commented] (SPARK-5540) Hide ALS.solveLeastSquares.

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302495#comment-14302495 ] Apache Spark commented on SPARK-5540: - User 'mengxr' has created a pull request for

[jira] [Resolved] (SPARK-5513) Add NMF option to the new ALS implementation

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5513. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4302

[jira] [Updated] (SPARK-5231) History Server shows wrong job submission time.

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5231: -- Target Version/s: 1.3.0, 1.2.2 (was: 1.3.0) History Server shows wrong job submission time.

[jira] [Updated] (SPARK-5195) when hive table is query with alias the cache data lose effectiveness.

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5195: --- Fix Version/s: (was: 1.2.1) when hive table is query with alias the cache data lose

[jira] [Updated] (SPARK-5231) History Server shows wrong job submission time.

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5231: -- Labels: backport-needed (was: ) History Server shows wrong job submission time.

[jira] [Updated] (SPARK-5454) [SQL] Self join with ArrayType columns problems

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5454: Priority: Blocker (was: Major) [SQL] Self join with ArrayType columns problems

[jira] [Resolved] (SPARK-5514) collect should call executeCollect

2015-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5514. Resolution: Fixed Fix Version/s: 1.3.0 collect should call executeCollect

[jira] [Updated] (SPARK-4508) Native Date type for SQL92 Date

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4508: --- Fix Version/s: (was: 1.3.0) Native Date type for SQL92 Date

[jira] [Created] (SPARK-5544) wholeTextFiles should recognize multiple input paths delimited by ,

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5544: Summary: wholeTextFiles should recognize multiple input paths delimited by , Key: SPARK-5544 URL: https://issues.apache.org/jira/browse/SPARK-5544 Project: Spark

[jira] [Resolved] (SPARK-5500) Document that feeding hadoopFile into a shuffle operation will cause problems

2015-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5500. Resolution: Fixed Fix Version/s: 1.3.0 Document that feeding hadoopFile into a shuffle

[jira] [Resolved] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2309. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3833

[jira] [Updated] (SPARK-4980) Add decay factors to streaming linear methods

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4980: - Target Version/s: 1.4.0 (was: 1.3.0) Add decay factors to streaming linear methods

[jira] [Updated] (SPARK-5520) Make FP-Growth implementation take generic item types

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5520: - Priority: Critical (was: Major) Make FP-Growth implementation take generic item types

[jira] [Updated] (SPARK-4526) Gradient should be added batch computing interface

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4526: - Assignee: Guoqiang Li Gradient should be added batch computing interface

[jira] [Updated] (SPARK-4526) Gradient should be added batch computing interface

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4526: - Target Version/s: 1.4.0 (was: 1.3.0) Gradient should be added batch computing interface

[jira] [Updated] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2309: - Priority: Critical (was: Major) Generalize the binary logistic regression into multinomial

[jira] [Created] (SPARK-5539) User guide for LDA

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5539: Summary: User guide for LDA Key: SPARK-5539 URL: https://issues.apache.org/jira/browse/SPARK-5539 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-2005) Investigate linux container-based solution

2015-02-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302554#comment-14302554 ] Nicholas Chammas commented on SPARK-2005: - [~mengxr] - Do you mind if I renamed

[jira] [Updated] (SPARK-5454) [SQL] Self join with ArrayType columns problems

2015-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5454: Target Version/s: 1.3.0 [SQL] Self join with ArrayType columns problems

[jira] [Resolved] (SPARK-5534) EdgeRDD, VertexRDD getStorageLevel return bad values

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5534. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4317

[jira] [Updated] (SPARK-1406) PMML model evaluation support via MLib

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1406: - Target Version/s: 1.4.0 (was: 1.3.0) PMML model evaluation support via MLib

[jira] [Updated] (SPARK-5520) Make FP-Growth implementation take generic item types

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5520: - Assignee: Jacky Li Make FP-Growth implementation take generic item types

[jira] [Updated] (SPARK-5519) Add user guide for FP-Growth

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5519: - Assignee: Jacky Li Add user guide for FP-Growth

[jira] [Updated] (SPARK-4285) Transpose RDD[Vector] to column store for ML

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4285: - Target Version/s: 1.4.0 (was: 1.3.0) Transpose RDD[Vector] to column store for ML

[jira] [Updated] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-02 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4550: -- Attachment: SPARK-4550-design-v1.pdf In sort-based shuffle, store map outputs in serialized form

[jira] [Closed] (SPARK-3505) Augmenting SparkStreaming updateStateByKey API with timestamp

2015-02-02 Thread Xi Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xi Liu closed SPARK-3505. - Resolution: Won't Fix Close this issue for now. Will re-open later when I find time to work on it. Augmenting

[jira] [Created] (SPARK-5535) Add parameter for storage levels.

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5535: Summary: Add parameter for storage levels. Key: SPARK-5535 URL: https://issues.apache.org/jira/browse/SPARK-5535 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-5538) CachedTableSuite failure due to unpersisting RDDs in a non-blocking way

2015-02-02 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5538: - Summary: CachedTableSuite failure due to unpersisting RDDs in a non-blocking way Key: SPARK-5538 URL: https://issues.apache.org/jira/browse/SPARK-5538 Project: Spark

[jira] [Created] (SPARK-5540) Hide ALS.solveLeastSquares.

2015-02-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5540: Summary: Hide ALS.solveLeastSquares. Key: SPARK-5540 URL: https://issues.apache.org/jira/browse/SPARK-5540 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5541) Allow running Maven or SBT in the Spark build

2015-02-02 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5541: -- Summary: Allow running Maven or SBT in the Spark build Key: SPARK-5541 URL: https://issues.apache.org/jira/browse/SPARK-5541 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5541) Allow running Maven or SBT in run-tests

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5541: --- Summary: Allow running Maven or SBT in run-tests (was: Allow running Maven or SBT in the

[jira] [Reopened] (SPARK-4508) Native Date type for SQL92 Date

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-4508: This has caused several date-related test failures in the master and pull request builds, so

[jira] [Commented] (SPARK-5536) Wrap the old ALS to use the new ALS implementation.

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302653#comment-14302653 ] Apache Spark commented on SPARK-5536: - User 'mengxr' has created a pull request for

[jira] [Reopened] (SPARK-4986) Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

2015-02-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reopened SPARK-4986: -- Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

[jira] [Updated] (SPARK-4588) Add API for feature attributes

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4588: - Target Version/s: 1.4.0 (was: 1.3.0) Add API for feature attributes

[jira] [Commented] (SPARK-5534) EdgeRDD, VertexRDD getStorageLevel return bad values

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302285#comment-14302285 ] Apache Spark commented on SPARK-5534: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302326#comment-14302326 ] Patrick Wendell commented on SPARK-4550: Yeah, this is a good idea. I don't see

[jira] [Commented] (SPARK-5131) A typo in configuration doc

2015-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302342#comment-14302342 ] Sean Owen commented on SPARK-5131: -- Sometimes site changes reflect changes not in the

[jira] [Created] (SPARK-5543) Remove unused import JsonUtil from from org.apache.spark.util.JsonProtocol.scala which fails builds with older versions of hadoop-core

2015-02-02 Thread Nathan M (JIRA)
Nathan M created SPARK-5543: --- Summary: Remove unused import JsonUtil from from org.apache.spark.util.JsonProtocol.scala which fails builds with older versions of hadoop-core Key: SPARK-5543 URL:

[jira] [Updated] (SPARK-3883) Provide SSL support for Akka and HttpServer based connections

2015-02-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-3883: -- Assignee: Jacek Lewandowski Provide SSL support for Akka and HttpServer based connections

[jira] [Commented] (SPARK-5543) Remove unused import JsonUtil from from org.apache.spark.util.JsonProtocol.scala which fails builds with older versions of hadoop-core

2015-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302641#comment-14302641 ] Apache Spark commented on SPARK-5543: - User 'nemccarthy' has created a pull request

[jira] [Resolved] (SPARK-5461) Graph should have isCheckpointed, getCheckpointFiles methods

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5461. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4253

[jira] [Updated] (SPARK-5532) Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

2015-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5532: - Priority: Critical (was: Blocker) Repartitioning DataFrame causes saveAsParquetFile to fail

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-02 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302303#comment-14302303 ] Sandy Ryza commented on SPARK-4550: --- Just posted a design doc. Would love to get

[jira] [Commented] (SPARK-5541) Allow running Maven or SBT in run-tests

2015-02-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302525#comment-14302525 ] Nicholas Chammas commented on SPARK-5541: - Dup of SPARK-3355? Allow running

  1   2   >