[jira] [Resolved] (SPARK-7511) PySpark ML seed Param should be varied per class

2015-05-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7511. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6139 [https

[jira] [Assigned] (SPARK-7719) Java 6 code in UnsafeShuffleWriterSuite

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7719: --- Assignee: Josh Rosen (was: Apache Spark) > Java 6 code in UnsafeShuffleWriterSuite > ---

[jira] [Assigned] (SPARK-7719) Java 6 code in UnsafeShuffleWriterSuite

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7719: --- Assignee: Apache Spark (was: Josh Rosen) > Java 6 code in UnsafeShuffleWriterSuite > ---

[jira] [Commented] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553216#comment-14553216 ] Davies Liu commented on SPARK-7565: --- [~tailhook] The patch is kind of workaround, it doe

[jira] [Commented] (SPARK-7719) Java 6 code in UnsafeShuffleWriterSuite

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553212#comment-14553212 ] Apache Spark commented on SPARK-7719: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-7565: - Assignee: Davies Liu > Broken maps in jsonRDD > -- > > Key: S

[jira] [Resolved] (SPARK-7769) How to represent a recursive data type in Spark SQL

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7769. -- Resolution: Invalid Please ask questions at u...@spark.apache.org, not JIRA > How to represent a recurs

[jira] [Commented] (SPARK-6548) Adding stddev to DataFrame functions

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553191#comment-14553191 ] Apache Spark commented on SPARK-6548: - User 'JihongMA' has created a pull request for

[jira] [Assigned] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7574: --- Assignee: Ram Sriharsha (was: Apache Spark) > User guide update for OneVsRest >

[jira] [Commented] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553190#comment-14553190 ] Apache Spark commented on SPARK-7574: - User 'harsha2010' has created a pull request fo

[jira] [Assigned] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7574: --- Assignee: Apache Spark (was: Ram Sriharsha) > User guide update for OneVsRest >

[jira] [Updated] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7565: Target Version/s: 1.4.0 > Broken maps in jsonRDD > -- > > Key: SPARK-756

[jira] [Updated] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7565: Priority: Blocker (was: Major) > Broken maps in jsonRDD > -- > > Key: S

[jira] [Commented] (SPARK-7600) Stopping Streaming Context (sometimes) crashes master

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553178#comment-14553178 ] Josh Rosen commented on SPARK-7600: --- Is event logging enabled? This could be a duplicat

[jira] [Updated] (SPARK-7613) Serialization fails in pyspark for lambdas referencing class data members

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7613: -- Description: The following code snippet works in pyspark 1.1.0, but fails post 1.2 with the indicated e

[jira] [Updated] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7741: - Assignee: Andrew Or (was: Tathagata Das) > ContextCleaner not used by many DStream operations > -

[jira] [Created] (SPARK-7770) Should GBT validationTol be relative tolerance?

2015-05-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7770: Summary: Should GBT validationTol be relative tolerance? Key: SPARK-7770 URL: https://issues.apache.org/jira/browse/SPARK-7770 Project: Spark Issue T

[jira] [Created] (SPARK-7769) How to represent a recursive data type in Spark SQL

2015-05-20 Thread Jeremy A. Lucas (JIRA)
Jeremy A. Lucas created SPARK-7769: -- Summary: How to represent a recursive data type in Spark SQL Key: SPARK-7769 URL: https://issues.apache.org/jira/browse/SPARK-7769 Project: Spark Issue T

[jira] [Assigned] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7606: --- Assignee: (was: Apache Spark) > Document all PySpark SQL/DataFrame public methods with @s

[jira] [Commented] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553098#comment-14553098 ] Apache Spark commented on SPARK-7606: - User 'davies' has created a pull request for th

[jira] [Assigned] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7606: --- Assignee: Apache Spark > Document all PySpark SQL/DataFrame public methods with @since tag >

[jira] [Resolved] (SPARK-7757) mllib IndexedRowMatrix multiply IndexedRowMatrix

2015-05-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7757. -- Resolution: Done I'm resolving this, but if you think we need a larger discussion about

[jira] [Commented] (SPARK-5681) Calling graceful stop() immediately after start() on StreamingContext should not get stuck indefinitely

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553082#comment-14553082 ] Apache Spark commented on SPARK-5681: - User 'zsxwing' has created a pull request for t

[jira] [Commented] (SPARK-7757) mllib IndexedRowMatrix multiply IndexedRowMatrix

2015-05-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553080#comment-14553080 ] Joseph K. Bradley commented on SPARK-7757: -- This is supported by BlockMatrix, not

[jira] [Commented] (SPARK-7320) Add rollup and cube support to DataFrame DSL

2015-05-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553073#comment-14553073 ] Patrick Wendell commented on SPARK-7320: Hey [~liancheng] and [~chenghao] - I reve

[jira] [Commented] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553065#comment-14553065 ] Michael Armbrust commented on SPARK-7724: - If you make a patch feel free to reopen

[jira] [Comment Edited] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553059#comment-14553059 ] Santiago M. Mola edited comment on SPARK-7724 at 5/20/15 8:36 PM: --

[jira] [Commented] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553059#comment-14553059 ] Santiago M. Mola commented on SPARK-7724: - DataFrame is beyond the scope here. I d

[jira] [Updated] (SPARK-7768) Make user-defined type (UDT) API public

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7768: - Priority: Critical (was: Major) > Make user-defined type (UDT) API public > -

[jira] [Created] (SPARK-7768) Make user-defined type (UDT) API public

2015-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7768: Summary: Make user-defined type (UDT) API public Key: SPARK-7768 URL: https://issues.apache.org/jira/browse/SPARK-7768 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-7760) Master & Worker json endpoints missing

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553049#comment-14553049 ] Josh Rosen commented on SPARK-7760: --- I've added 1.4.0 as a target version so that this s

[jira] [Updated] (SPARK-7760) Master & Worker json endpoints missing

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7760: -- Target Version/s: 1.4.0 > Master & Worker json endpoints missing > -

[jira] [Updated] (SPARK-7766) KryoSerializerInstance reuse is not safe when auto-reset is disabled

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7766: -- Summary: KryoSerializerInstance reuse is not safe when auto-reset is disabled (was: KryoSerializerInsta

[jira] [Updated] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-reset is disabled

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7766: -- Summary: KryoSerializerInstance re-use is not safe when auto-reset is disabled (was: KryoSerializerInst

[jira] [Assigned] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-flush is disabled

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7766: --- Assignee: Josh Rosen (was: Apache Spark) > KryoSerializerInstance re-use is not safe when au

[jira] [Commented] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-flush is disabled

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553043#comment-14553043 ] Apache Spark commented on SPARK-7766: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-flush is disabled

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7766: --- Assignee: Apache Spark (was: Josh Rosen) > KryoSerializerInstance re-use is not safe when au

[jira] [Assigned] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7767: --- Assignee: Apache Spark (was: Tathagata Das) > Fail fast if the DStream checkpoint is not ser

[jira] [Assigned] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7767: --- Assignee: Tathagata Das (was: Apache Spark) > Fail fast if the DStream checkpoint is not ser

[jira] [Commented] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553035#comment-14553035 ] Apache Spark commented on SPARK-7767: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7767: Summary: Fail fast if the DStream checkpoint is not serializable Key: SPARK-7767 URL: https://issues.apache.org/jira/browse/SPARK-7767 Project: Spark Issue T

[jira] [Created] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-flush is disabled

2015-05-20 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7766: - Summary: KryoSerializerInstance re-use is not safe when auto-flush is disabled Key: SPARK-7766 URL: https://issues.apache.org/jira/browse/SPARK-7766 Project: Spark

[jira] [Resolved] (SPARK-7579) User guide update for OneHotEncoder

2015-05-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7579. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6126 [https

[jira] [Commented] (SPARK-6880) Spark Shutdowns with NoSuchElementException when running parallel collect on cachedRDD

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553008#comment-14553008 ] Apache Spark commented on SPARK-6880: - User 'markhamstra' has created a pull request f

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552997#comment-14552997 ] Apache Spark commented on SPARK-6980: - User 'hardmettle' has created a pull request fo

[jira] [Commented] (SPARK-5966) Spark-submit deploy-mode incorrectly affecting submission when master = local[4]

2015-05-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552988#comment-14552988 ] Tathagata Das commented on SPARK-5966: -- Yes, but its super non-intuitive error messag

[jira] [Resolved] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7537. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6280 [https://githu

[jira] [Updated] (SPARK-7178) Improve DataFrame documentation and code samples

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7178: Target Version/s: 1.5.0 (was: 1.4.0) > Improve DataFrame documentation and code samples > -

[jira] [Updated] (SPARK-6956) Improve DataFrame API compatibility with Pandas

2015-05-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6956: --- Target Version/s: 1.5.0 (was: 1.4.0) > Improve DataFrame API compatibility with Pandas >

[jira] [Updated] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5508: Target Version/s: 1.5.0 (was: 1.3.2, 1.4.0) > Arrays and Maps stored with Hive Parquet Serd

[jira] [Updated] (SPARK-6831) Document how to use external data sources

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6831: Target Version/s: 1.5.0 (was: 1.4.0) > Document how to use external data sources >

[jira] [Updated] (SPARK-4131) Support "Writing data into the filesystem from queries"

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4131: Target Version/s: 1.5.0 (was: 1.4.0) > Support "Writing data into the filesystem from queri

[jira] [Updated] (SPARK-6964) Support Cancellation in the Thrift Server

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6964: Target Version/s: 1.5.0 (was: 1.4.0) > Support Cancellation in the Thrift Server >

[jira] [Updated] (SPARK-6775) Simplify CatalystConverter class hierarchy and pass in Parquet schema

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6775: Target Version/s: 1.5.0 (was: 1.4.0) > Simplify CatalystConverter class hierarchy and pass

[jira] [Updated] (SPARK-2973) Use LocalRelation for all ExecutedCommands, avoid job for take/collect()

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2973: Target Version/s: 1.5.0 (was: 1.4.0) > Use LocalRelation for all ExecutedCommands, avoid jo

[jira] [Updated] (SPARK-7394) Add Pandas style cast (astype)

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7394: Target Version/s: 1.5.0 (was: 1.4.0) > Add Pandas style cast (astype) > ---

[jira] [Updated] (SPARK-6774) Implement Parquet complex types backwards-compatiblity rules

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6774: Target Version/s: 1.5.0 (was: 1.4.0) > Implement Parquet complex types backwards-compatibli

[jira] [Updated] (SPARK-5295) Stabilize data types

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5295: Target Version/s: 1.5.0 (was: 1.4.0) > Stabilize data types > > >

[jira] [Updated] (SPARK-6189) Pandas to DataFrame conversion should check field names for periods

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6189: Target Version/s: 1.5.0 (was: 1.4.0) > Pandas to DataFrame conversion should check field na

[jira] [Updated] (SPARK-4485) Add broadcast outer join to optimize left outer join and right outer join

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4485: Target Version/s: 1.5.0 (was: 1.4.0) > Add broadcast outer join to optimize left outer joi

[jira] [Updated] (SPARK-6380) Resolution of equi-join key in post-join projection

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6380: Target Version/s: 1.5.0 (was: 1.4.0) > Resolution of equi-join key in post-join projection

[jira] [Updated] (SPARK-7200) Tungsten test suites should fail if memory leak is detected

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7200: Target Version/s: 1.5.0 (was: 1.4.0) > Tungsten test suites should fail if memory leak is d

[jira] [Updated] (SPARK-6777) Implement backwards-compatibility rules in Parquet schema converters

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6777: Target Version/s: 1.5.0 (was: 1.4.0) > Implement backwards-compatibility rules in Parquet s

[jira] [Updated] (SPARK-6795) Avoid reading Parquet footers on driver side when an global arbitrative schema is available

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6795: Target Version/s: 1.5.0 (was: 1.4.0) > Avoid reading Parquet footers on driver side when an

[jira] [Updated] (SPARK-6941) Provide a better error message to explain that tables created from RDDs are immutable

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6941: Target Version/s: 1.5.0 (was: 1.4.0) > Provide a better error message to explain that table

[jira] [Updated] (SPARK-6776) Implement backwards-compatibility rules in CatalystConverters

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6776: Target Version/s: 1.5.0 (was: 1.4.0) > Implement backwards-compatibility rules in CatalystC

[jira] [Updated] (SPARK-7158) collect and take return different results

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7158: Target Version/s: 1.5.0 (was: 1.4.0) > collect and take return different results >

[jira] [Updated] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6923: Target Version/s: 1.5.0 (was: 1.4.0) > Spark SQL CLI does not read Data Source schema corre

[jira] [Updated] (SPARK-6759) Do not borrow/release a kryo instance for every value in a complex type value when doing serialization/deserialization in in-memory columnar store

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6759: Target Version/s: 1.5.0 (was: 1.4.0) > Do not borrow/release a kryo instance for every valu

[jira] [Updated] (SPARK-6914) DataFrame.withColumn should take metadata

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6914: Target Version/s: 1.5.0 (was: 1.4.0) > DataFrame.withColumn should take metadata >

[jira] [Updated] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6411: Target Version/s: 1.5.0 (was: 1.4.0) > PySpark DataFrames can't be created if any datetimes

[jira] [Updated] (SPARK-6573) Convert inbound NaN values as null

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6573: Target Version/s: 1.5.0 (was: 1.4.0) > Convert inbound NaN values as null > ---

[jira] [Resolved] (SPARK-4689) Unioning 2 SchemaRDDs should return a SchemaRDD in Python, Scala, and Java

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4689. - Resolution: Fixed I think this is fixed by dataframes. Please reopen if you are still hav

[jira] [Resolved] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7724. - Resolution: Won't Fix These both exist in [dataframes|https://github.com/apache/spark/blo

[jira] [Commented] (SPARK-5966) Spark-submit deploy-mode incorrectly affecting submission when master = local[4]

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552923#comment-14552923 ] Andrew Or commented on SPARK-5966: -- You're not supposed to run with cluster deploy mode u

[jira] [Closed] (SPARK-7644) Ensure all scoped RDD operations are tested and cleaned

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7644. Resolution: Done Fix Version/s: 1.4.0 > Ensure all scoped RDD operations are tested and cleaned > ---

[jira] [Resolved] (SPARK-4331) SBT Scalastyle doesn't work for the sources under hive's v0.12.0 and v0.13.1

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4331. - Resolution: Not A Problem We are getting rid of the shim code, so I'm closing this issue.

[jira] [Closed] (SPARK-7627) DAG visualization: cached RDDs not shown on job page

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7627. Resolution: Fixed Fix Version/s: 1.4.0 > DAG visualization: cached RDDs not shown on job page > -

[jira] [Closed] (SPARK-7472) DAG visualization: handle skipped stages differently

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7472. Resolution: Fixed Fix Version/s: 1.4.0 > DAG visualization: handle skipped stages differently > -

[jira] [Resolved] (SPARK-6674) Use different types to represent rows used inside and outside Catalyst

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6674. - Resolution: Duplicate > Use different types to represent rows used inside and outside Cata

[jira] [Resolved] (SPARK-7564) performance bottleneck in SparkSQL using columnar storage

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7564. - Resolution: Duplicate > performance bottleneck in SparkSQL using columnar storage > --

[jira] [Resolved] (SPARK-5325) Simplifying Hive shim implementation

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5325. - Resolution: Not A Problem Obviated by isolated client loader. > Simplifying Hive shim imp

[jira] [Commented] (SPARK-7762) Set default value for outputCol based on UID

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552838#comment-14552838 ] Apache Spark commented on SPARK-7762: - User 'mengxr' has created a pull request for th

[jira] [Resolved] (SPARK-7713) Use shared broadcast hadoop conf for partitioned table scan.

2015-05-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-7713. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6252 [https://github.com/apac

[jira] [Closed] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-7765. -- Resolution: Duplicate > Input vector should divide with the norm in Word2Vec's findSynonyms > --

[jira] [Updated] (SPARK-1529) Support DFS based shuffle in addition to Netty shuffle

2015-05-20 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kannan Rajah updated SPARK-1529: Description: In some environments, like with MapR, local volumes are accessed through the Hadoop fil

[jira] [Commented] (SPARK-7108) spark.local.dir is no longer honored in Standalone mode

2015-05-20 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552769#comment-14552769 ] Marcelo Vanzin commented on SPARK-7108: --- bq. This boils down to an issue around cle

[jira] [Resolved] (SPARK-6041) Compute shortest path for graph with edge distances

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6041. -- Resolution: Won't Fix > Compute shortest path for graph with edge distances > --

[jira] [Updated] (SPARK-1529) Support DFS based shuffle in addition to Netty shuffle

2015-05-20 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kannan Rajah updated SPARK-1529: Summary: Support DFS based shuffle in addition to Netty shuffle (was: Support setting spark.local.d

[jira] [Commented] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552755#comment-14552755 ] Sean Owen commented on SPARK-7765: -- This duplicates https://issues.apache.org/jira/browse

[jira] [Commented] (SPARK-7108) spark.local.dir is no longer honored in Standalone mode

2015-05-20 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552756#comment-14552756 ] Matt Cheah commented on SPARK-7108: --- Just wanted to add my two cents here. I've had seve

[jira] [Assigned] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7765: --- Assignee: (was: Apache Spark) > Input vector should divide with the norm in Word2Vec's fi

[jira] [Assigned] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7765: --- Assignee: Apache Spark > Input vector should divide with the norm in Word2Vec's findSynonyms

[jira] [Updated] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-7765: --- Summary: Input vector should divide with the norm in Word2Vec's findSynonyms (was: Vector sho

[jira] [Updated] (SPARK-7765) Vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-7765: --- Summary: Vector should divide with the norm in Word2Vec's findSynonyms (was: Given vector sho

[jira] [Commented] (SPARK-7765) Input vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552749#comment-14552749 ] Apache Spark commented on SPARK-7765: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-7765) Given vector should divide with the norm in Word2Vec's findSynonyms

2015-05-20 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7765: -- Summary: Given vector should divide with the norm in Word2Vec's findSynonyms Key: SPARK-7765 URL: https://issues.apache.org/jira/browse/SPARK-7765 Project: Spark

[jira] [Commented] (SPARK-7749) Parquet metastore conversion does not use metastore cache

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552734#comment-14552734 ] Apache Spark commented on SPARK-7749: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-7749) Parquet metastore conversion does not use metastore cache

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7749: --- Assignee: Apache Spark (was: Cheng Lian) > Parquet metastore conversion does not use metasto

[jira] [Assigned] (SPARK-7749) Parquet metastore conversion does not use metastore cache

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7749: --- Assignee: Cheng Lian (was: Apache Spark) > Parquet metastore conversion does not use metasto

<    1   2   3   >