[jira] [Updated] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7537: - Issue Type: Umbrella (was: Sub-task) Parent: (was: SPARK-7443) Audit new public

[jira] [Created] (SPARK-7752) NaiveBayes.modelType should use lowercase letters

2015-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7752: Summary: NaiveBayes.modelType should use lowercase letters Key: SPARK-7752 URL: https://issues.apache.org/jira/browse/SPARK-7752 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6094: --- Assignee: (was: Apache Spark) Add MultilabelMetrics in PySpark/MLlib

[jira] [Commented] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551870#comment-14551870 ] Apache Spark commented on SPARK-6094: - User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6094: --- Assignee: Apache Spark Add MultilabelMetrics in PySpark/MLlib

[jira] [Comment Edited] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551863#comment-14551863 ] Xiangrui Meng edited comment on SPARK-7537 at 5/20/15 6:05 AM:

[jira] [Commented] (SPARK-7752) NaiveBayes.modelType should use lowercase letters

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551917#comment-14551917 ] Apache Spark commented on SPARK-7752: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-7711) startTime() is missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7711: --- Assignee: (was: Apache Spark) startTime() is missing --

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-20 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551898#comment-14551898 ] Sandy Ryza commented on SPARK-4352: --- I don't think we should kill executors in order to

[jira] [Resolved] (SPARK-6333) saveAsObjectFile support for compression codec

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6333. -- Resolution: Won't Fix saveAsObjectFile support for compression codec

[jira] [Commented] (SPARK-7711) startTime() is missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551931#comment-14551931 ] Apache Spark commented on SPARK-7711: - User 'holdenk' has created a pull request for

[jira] [Assigned] (SPARK-7711) startTime() is missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7711: --- Assignee: Apache Spark startTime() is missing --

[jira] [Resolved] (SPARK-2445) MesosExecutorBackend crashes in fine grained mode

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2445. -- Resolution: Duplicate MesosExecutorBackend crashes in fine grained mode

[jira] [Resolved] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7663. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6228

[jira] [Created] (SPARK-7756) Ensure Spark runs clean on IBM Java implementation

2015-05-20 Thread Tim Ellison (JIRA)
Tim Ellison created SPARK-7756: -- Summary: Ensure Spark runs clean on IBM Java implementation Key: SPARK-7756 URL: https://issues.apache.org/jira/browse/SPARK-7756 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7755) MetadataCache.refresh does not take into account _SUCCESS

2015-05-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552120#comment-14552120 ] Cheng Lian commented on SPARK-7755: --- Thanks for reporting, would you mind to elaborate

[jira] [Commented] (SPARK-7756) Ensure Spark runs clean on IBM Java implementation

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552158#comment-14552158 ] Apache Spark commented on SPARK-7756: - User 'tellison' has created a pull request for

[jira] [Comment Edited] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-05-20 Thread Edoardo Vacchi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552223#comment-14552223 ] Edoardo Vacchi edited comment on SPARK-6981 at 5/20/15 12:09 PM:

[jira] [Comment Edited] (SPARK-7755) MetadataCache.refresh does not take into account _SUCCESS

2015-05-20 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552173#comment-14552173 ] Rowan Chattaway edited comment on SPARK-7755 at 5/20/15 12:00 PM:

[jira] [Assigned] (SPARK-7756) Ensure Spark runs clean on IBM Java implementation

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7756: --- Assignee: (was: Apache Spark) Ensure Spark runs clean on IBM Java implementation

[jira] [Assigned] (SPARK-7756) Ensure Spark runs clean on IBM Java implementation

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7756: --- Assignee: Apache Spark Ensure Spark runs clean on IBM Java implementation

[jira] [Commented] (SPARK-7755) MetadataCache.refresh does not take into account _SUCCESS

2015-05-20 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552173#comment-14552173 ] Rowan Chattaway commented on SPARK-7755: These the the kinds of errors you can

[jira] [Commented] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-05-20 Thread Edoardo Vacchi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552223#comment-14552223 ] Edoardo Vacchi commented on SPARK-6981: --- Besides, is there any particular reason why

[jira] [Created] (SPARK-7755) MetadataCache.refresh does not take into account _SUCCESS

2015-05-20 Thread Rowan Chattaway (JIRA)
Rowan Chattaway created SPARK-7755: -- Summary: MetadataCache.refresh does not take into account _SUCCESS Key: SPARK-7755 URL: https://issues.apache.org/jira/browse/SPARK-7755 Project: Spark

[jira] [Updated] (SPARK-7755) MetadataCache.refresh does not take into account _SUCCESS

2015-05-20 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rowan Chattaway updated SPARK-7755: --- Affects Version/s: 1.3.1 MetadataCache.refresh does not take into account _SUCCESS

[jira] [Created] (SPARK-7753) Improve kernel density API

2015-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7753: Summary: Improve kernel density API Key: SPARK-7753 URL: https://issues.apache.org/jira/browse/SPARK-7753 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-7753) Improve kernel density API

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7753: --- Assignee: Xiangrui Meng (was: Apache Spark) Improve kernel density API

[jira] [Commented] (SPARK-7753) Improve kernel density API

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551996#comment-14551996 ] Apache Spark commented on SPARK-7753: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-7712) Native Spark Window Functions Performance Improvements

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551957#comment-14551957 ] Apache Spark commented on SPARK-7712: - User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-7712) Native Spark Window Functions Performance Improvements

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7712: --- Assignee: (was: Apache Spark) Native Spark Window Functions Performance Improvements

[jira] [Resolved] (SPARK-7439) Should delete temporary local directories

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7439. -- Resolution: Duplicate Not sure what to do with this one as the dirs should already be cleaned up on

[jira] [Assigned] (SPARK-7753) Improve kernel density API

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7753: --- Assignee: Apache Spark (was: Xiangrui Meng) Improve kernel density API

[jira] [Commented] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552022#comment-14552022 ] Xiangrui Meng commented on SPARK-7537: -- We need to expose EMLDAOptimizer and

[jira] [Created] (SPARK-7754) Use PartialFunction literals instead of objects in Catalyst

2015-05-20 Thread Edoardo Vacchi (JIRA)
Edoardo Vacchi created SPARK-7754: - Summary: Use PartialFunction literals instead of objects in Catalyst Key: SPARK-7754 URL: https://issues.apache.org/jira/browse/SPARK-7754 Project: Spark

[jira] [Assigned] (SPARK-7712) Native Spark Window Functions Performance Improvements

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7712: --- Assignee: Apache Spark Native Spark Window Functions Performance Improvements

[jira] [Commented] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552018#comment-14552018 ] Apache Spark commented on SPARK-7654: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7663: - Assignee: Xusen Yin [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is

[jira] [Commented] (SPARK-7700) Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551983#comment-14551983 ] Sean Owen commented on SPARK-7700: -- I'm not quite sure what change you're proposing but

[jira] [Commented] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552009#comment-14552009 ] Apache Spark commented on SPARK-7537: - User 'mengxr' has created a pull request for

[jira] [Resolved] (SPARK-5220) keepPushingBlocks in BlockGenerator terminated when an exception occurs, which causes the block pushing thread to terminate and blocks receiver

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5220. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Assignee: Hari

[jira] [Resolved] (SPARK-7533) Decrease spacing between AM-RM heartbeats.

2015-05-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-7533. -- Resolution: Fixed Fix Version/s: 1.5.0 Assignee: Zoltán Zvara Decrease spacing

[jira] [Created] (SPARK-7757) mllib IndexedRowMatrix multiply IndexedRowMatrix

2015-05-20 Thread zhaoxiangyu (JIRA)
zhaoxiangyu created SPARK-7757: -- Summary: mllib IndexedRowMatrix multiply IndexedRowMatrix Key: SPARK-7757 URL: https://issues.apache.org/jira/browse/SPARK-7757 Project: Spark Issue Type:

[jira] [Created] (SPARK-7759) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
Tao Wang created SPARK-7759: --- Summary: Failed to start thrift server when metastore is postgre sql Key: SPARK-7759 URL: https://issues.apache.org/jira/browse/SPARK-7759 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7754) Use PartialFunction literals instead of objects in Catalyst

2015-05-20 Thread Edoardo Vacchi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Edoardo Vacchi updated SPARK-7754: -- Description: Catalyst rules extend two distinct rule types: {{Rule[LogicalPlan]}} and

[jira] [Created] (SPARK-7758) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
Tao Wang created SPARK-7758: --- Summary: Failed to start thrift server when metastore is postgre sql Key: SPARK-7758 URL: https://issues.apache.org/jira/browse/SPARK-7758 Project: Spark Issue Type:

[jira] [Closed] (SPARK-7759) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang closed SPARK-7759. --- Resolution: Duplicate Post twice as network issue. Close this. Failed to start thrift server when metastore

[jira] [Updated] (SPARK-7758) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang updated SPARK-7758: Attachment: hive-site.xml Failed to start thrift server when metastore is postgre sql

[jira] [Closed] (SPARK-7701) UDT not working

2015-05-20 Thread bogdan baraila (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] bogdan baraila closed SPARK-7701. - UDT not working --- Key: SPARK-7701 URL:

[jira] [Commented] (SPARK-7746) SetFetchSize for JDBCRDD's prepareStatement

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552335#comment-14552335 ] Apache Spark commented on SPARK-7746: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7746) SetFetchSize for JDBCRDD's prepareStatement

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7746: --- Assignee: (was: Apache Spark) SetFetchSize for JDBCRDD's prepareStatement

[jira] [Assigned] (SPARK-7746) SetFetchSize for JDBCRDD's prepareStatement

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7746: --- Assignee: Apache Spark SetFetchSize for JDBCRDD's prepareStatement

[jira] [Assigned] (SPARK-7760) Master Worker json endpoints missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7760: --- Assignee: Apache Spark (was: Imran Rashid) Master Worker json endpoints missing

[jira] [Closed] (SPARK-7540) PMML correctness check

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7540. Resolution: Done Fix Version/s: 1.4.0 PMML correctness check --

[jira] [Assigned] (SPARK-7535) Audit Pipeline APIs for 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-7535: Assignee: Xiangrui Meng Audit Pipeline APIs for 1.4 ---

[jira] [Commented] (SPARK-7535) Audit Pipeline APIs for 1.4

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552542#comment-14552542 ] Xiangrui Meng commented on SPARK-7535: -- ALS.train is exposed for developers who needs

[jira] [Assigned] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7763: --- Assignee: Apache Spark (was: Cheng Lian) Partition columns of data source tables should be

[jira] [Commented] (SPARK-7760) Master Worker json endpoints missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552503#comment-14552503 ] Apache Spark commented on SPARK-7760: - User 'squito' has created a pull request for

[jira] [Assigned] (SPARK-7760) Master Worker json endpoints missing

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7760: --- Assignee: Imran Rashid (was: Apache Spark) Master Worker json endpoints missing

[jira] [Created] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-7763: - Summary: Partition columns of data source tables should be persisted into metastore when creating persisted tables Key: SPARK-7763 URL: https://issues.apache.org/jira/browse/SPARK-7763

[jira] [Created] (SPARK-7762) Set default value for outputCol based on UID

2015-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7762: Summary: Set default value for outputCol based on UID Key: SPARK-7762 URL: https://issues.apache.org/jira/browse/SPARK-7762 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7763: -- Component/s: SQL Description: Partition columns of {{HadoopFsRelation}} should be

[jira] [Created] (SPARK-7764) Add negative sampling to Word2Vec

2015-05-20 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7764: -- Summary: Add negative sampling to Word2Vec Key: SPARK-7764 URL: https://issues.apache.org/jira/browse/SPARK-7764 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-7758) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang updated SPARK-7758: Attachment: with no error.log with error.log Failed to start thrift server when metastore

[jira] [Commented] (SPARK-7540) PMML correctness check

2015-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552676#comment-14552676 ] Xiangrui Meng commented on SPARK-7540: -- Thanks everyone for the discussion and

[jira] [Commented] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552689#comment-14552689 ] Apache Spark commented on SPARK-7763: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7763: --- Assignee: Cheng Lian (was: Apache Spark) Partition columns of data source tables should be

[jira] [Updated] (SPARK-7758) Failed to start thrift server when metastore is postgre sql

2015-05-20 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang updated SPARK-7758: Priority: Critical (was: Major) Failed to start thrift server when metastore is postgre sql

[jira] [Commented] (SPARK-7763) Partition columns of data source tables should be persisted into metastore when creating persisted tables

2015-05-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552698#comment-14552698 ] Cheng Lian commented on SPARK-7763: --- This bug also disables metastore relation cache for

[jira] [Updated] (SPARK-1529) Support DFS based shuffle in addition to Netty shuffle

2015-05-20 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kannan Rajah updated SPARK-1529: Summary: Support DFS based shuffle in addition to Netty shuffle (was: Support setting

[jira] [Resolved] (SPARK-5325) Simplifying Hive shim implementation

2015-05-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5325. - Resolution: Not A Problem Obviated by isolated client loader. Simplifying Hive shim

[jira] [Closed] (SPARK-7472) DAG visualization: handle skipped stages differently

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7472. Resolution: Fixed Fix Version/s: 1.4.0 DAG visualization: handle skipped stages differently

[jira] [Closed] (SPARK-7627) DAG visualization: cached RDDs not shown on job page

2015-05-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7627. Resolution: Fixed Fix Version/s: 1.4.0 DAG visualization: cached RDDs not shown on job page

[jira] [Commented] (SPARK-6880) Spark Shutdowns with NoSuchElementException when running parallel collect on cachedRDD

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553008#comment-14553008 ] Apache Spark commented on SPARK-6880: - User 'markhamstra' has created a pull request

[jira] [Commented] (SPARK-5681) Calling graceful stop() immediately after start() on StreamingContext should not get stuck indefinitely

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553082#comment-14553082 ] Apache Spark commented on SPARK-5681: - User 'zsxwing' has created a pull request for

[jira] [Created] (SPARK-7770) Should GBT validationTol be relative tolerance?

2015-05-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7770: Summary: Should GBT validationTol be relative tolerance? Key: SPARK-7770 URL: https://issues.apache.org/jira/browse/SPARK-7770 Project: Spark Issue

[jira] [Updated] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7741: - Assignee: Andrew Or (was: Tathagata Das) ContextCleaner not used by many DStream operations

[jira] [Resolved] (SPARK-7769) How to represent a recursive data type in Spark SQL

2015-05-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7769. -- Resolution: Invalid Please ask questions at u...@spark.apache.org, not JIRA How to represent a

[jira] [Commented] (SPARK-6548) Adding stddev to DataFrame functions

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553191#comment-14553191 ] Apache Spark commented on SPARK-6548: - User 'JihongMA' has created a pull request for

[jira] [Resolved] (SPARK-7579) User guide update for OneHotEncoder

2015-05-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7579. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6126

[jira] [Created] (SPARK-7766) KryoSerializerInstance re-use is not safe when auto-flush is disabled

2015-05-20 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7766: - Summary: KryoSerializerInstance re-use is not safe when auto-flush is disabled Key: SPARK-7766 URL: https://issues.apache.org/jira/browse/SPARK-7766 Project: Spark

[jira] [Created] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7767: Summary: Fail fast if the DStream checkpoint is not serializable Key: SPARK-7767 URL: https://issues.apache.org/jira/browse/SPARK-7767 Project: Spark Issue

[jira] [Commented] (SPARK-7760) Master Worker json endpoints missing

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553049#comment-14553049 ] Josh Rosen commented on SPARK-7760: --- I've added 1.4.0 as a target version so that this

[jira] [Updated] (SPARK-7760) Master Worker json endpoints missing

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7760: -- Target Version/s: 1.4.0 Master Worker json endpoints missing --

[jira] [Comment Edited] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553059#comment-14553059 ] Santiago M. Mola edited comment on SPARK-7724 at 5/20/15 8:36 PM:

[jira] [Commented] (SPARK-7724) Add support for Intersect and Except in Catalyst DSL

2015-05-20 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553059#comment-14553059 ] Santiago M. Mola commented on SPARK-7724: - DataFrame is beyond the scope here. I

[jira] [Updated] (SPARK-7613) Serialization fails in pyspark for lambdas referencing class data members

2015-05-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7613: -- Description: The following code snippet works in pyspark 1.1.0, but fails post 1.2 with the indicated

[jira] [Assigned] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7767: --- Assignee: Apache Spark (was: Tathagata Das) Fail fast if the DStream checkpoint is not

[jira] [Assigned] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7767: --- Assignee: Tathagata Das (was: Apache Spark) Fail fast if the DStream checkpoint is not

[jira] [Commented] (SPARK-7767) Fail fast if the DStream checkpoint is not serializable

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553035#comment-14553035 ] Apache Spark commented on SPARK-7767: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-7320) Add rollup and cube support to DataFrame DSL

2015-05-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553073#comment-14553073 ] Patrick Wendell commented on SPARK-7320: Hey [~liancheng] and [~chenghao] - I

[jira] [Updated] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7565: Priority: Blocker (was: Major) Broken maps in jsonRDD -- Key:

[jira] [Assigned] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7574: --- Assignee: Apache Spark (was: Ram Sriharsha) User guide update for OneVsRest

[jira] [Assigned] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7574: --- Assignee: Ram Sriharsha (was: Apache Spark) User guide update for OneVsRest

[jira] [Commented] (SPARK-7574) User guide update for OneVsRest

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553190#comment-14553190 ] Apache Spark commented on SPARK-7574: - User 'harsha2010' has created a pull request

[jira] [Commented] (SPARK-7565) Broken maps in jsonRDD

2015-05-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553216#comment-14553216 ] Davies Liu commented on SPARK-7565: --- [~tailhook] The patch is kind of workaround, it

[jira] [Assigned] (SPARK-7719) Java 6 code in UnsafeShuffleWriterSuite

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7719: --- Assignee: Apache Spark (was: Josh Rosen) Java 6 code in UnsafeShuffleWriterSuite

[jira] [Assigned] (SPARK-7719) Java 6 code in UnsafeShuffleWriterSuite

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7719: --- Assignee: Josh Rosen (was: Apache Spark) Java 6 code in UnsafeShuffleWriterSuite

[jira] [Created] (SPARK-7768) Make user-defined type (UDT) API public

2015-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7768: Summary: Make user-defined type (UDT) API public Key: SPARK-7768 URL: https://issues.apache.org/jira/browse/SPARK-7768 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7606: --- Assignee: Apache Spark Document all PySpark SQL/DataFrame public methods with @since tag

[jira] [Commented] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553098#comment-14553098 ] Apache Spark commented on SPARK-7606: - User 'davies' has created a pull request for

  1   2   3   >