[jira] [Updated] (SPARK-5473) Expose SSH failures after status checks pass

2015-03-04 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5473: Description: If there is some fatal problem with launching a cluster, `spark-ec2` just

[jira] [Updated] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2015-03-04 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3533: Target Version/s: 1.4.0 Add saveAsTextFileByKey() method to RDDs

[jira] [Commented] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347022#comment-14347022 ] Apache Spark commented on SPARK-6145: - User 'chenghao-intel' has created a pull

[jira] [Commented] (SPARK-6153) intellij import from maven cannot debug sparksqlclidriver

2015-03-04 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347054#comment-14347054 ] Adrian Wang commented on SPARK-6153: [~sowen]I have updated my steps in GitHub, can

[jira] [Commented] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2015-03-04 Thread Aaron (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347155#comment-14347155 ] Aaron commented on SPARK-3533: -- [~ilganeli] FYI I'm pretty sure the `init()` means missing no

[jira] [Commented] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-04 Thread Ronald Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347220#comment-14347220 ] Ronald Chen commented on SPARK-6152: I've made the description more clear. The

[jira] [Commented] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2015-03-04 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347216#comment-14347216 ] Ilya Ganelin commented on SPARK-3533: - [~aaronjosephs] - Let me see if that's it.

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347237#comment-14347237 ] Joseph K. Bradley commented on SPARK-5981: -- When predict() is called on a single

[jira] [Comment Edited] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347231#comment-14347231 ] Manoj Kumar edited comment on SPARK-5981 at 3/4/15 5:47 PM: I

[jira] [Updated] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6149: --- Priority: Critical (was: Blocker) Spark SQL CLI doesn't work when compiled against Hive 12

[jira] [Commented] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347210#comment-14347210 ] Patrick Wendell commented on SPARK-6149: Since this only affects the sbt build and

[jira] [Commented] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347209#comment-14347209 ] Marcelo Vanzin commented on SPARK-6144: --- Separately, we should check the unit tests

[jira] [Updated] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-04 Thread Ronald Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ronald Chen updated SPARK-6152: --- Description: Spark uses reflectasm to check Scala closures which fails if the *user defined Scala

[jira] [Updated] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-04 Thread Ronald Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ronald Chen updated SPARK-6152: --- Description: Spark uses reflectasm to check Scala closures which fails if the *user defined Scala

[jira] [Commented] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347358#comment-14347358 ] Joseph K. Bradley commented on SPARK-6158: -- I'm not quite sure what the issue is.

[jira] [Commented] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347378#comment-14347378 ] Joseph K. Bradley commented on SPARK-6158: -- I'm also not clear on what you're

[jira] [Commented] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347308#comment-14347308 ] Manoj Kumar commented on SPARK-6158: [~josephkb] Is it necessary to link this with the

[jira] [Commented] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347369#comment-14347369 ] Apache Spark commented on SPARK-6144: - User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347305#comment-14347305 ] Manoj Kumar commented on SPARK-5981: Thanks for the quick response. I understand store

[jira] [Created] (SPARK-6161) sqlCtx.parquetFile(dataFilePath) throws NPE when using s3, but OK when using local filesystem

2015-03-04 Thread Marshall (JIRA)
Marshall created SPARK-6161: --- Summary: sqlCtx.parquetFile(dataFilePath) throws NPE when using s3, but OK when using local filesystem Key: SPARK-6161 URL: https://issues.apache.org/jira/browse/SPARK-6161

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347477#comment-14347477 ] Joseph K. Bradley commented on SPARK-5981: -- {quote} I understand store the model

[jira] [Updated] (SPARK-6159) Distinguish between inprogress and abnormal event log history

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6159: - Affects Version/s: 1.0.0 Distinguish between inprogress and abnormal event log history

[jira] [Created] (SPARK-6163) jsonFile should be backed by the data source API

2015-03-04 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6163: --- Summary: jsonFile should be backed by the data source API Key: SPARK-6163 URL: https://issues.apache.org/jira/browse/SPARK-6163 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-2545) Add a diagnosis mode for closures to figure out what they're bringing in

2015-03-04 Thread Tobias Schlatter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347519#comment-14347519 ] Tobias Schlatter commented on SPARK-2545: - In the REPL, this tool would (as it

[jira] [Updated] (SPARK-6166) Add config to limit number of concurrent outbound connections for shuffle fetch

2015-03-04 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan updated SPARK-6166: --- Priority: Minor (was: Major) Add config to limit number of concurrent outbound

[jira] [Created] (SPARK-6166) Add config to limit number of concurrent outbound connections for shuffle fetch

2015-03-04 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-6166: -- Summary: Add config to limit number of concurrent outbound connections for shuffle fetch Key: SPARK-6166 URL: https://issues.apache.org/jira/browse/SPARK-6166

[jira] [Created] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-6167: - Summary: Previous Commit Broke BroadcastTest Key: SPARK-6167 URL: https://issues.apache.org/jira/browse/SPARK-6167 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6168) Expose some of the collection classes as DeveloperApi

2015-03-04 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-6168: -- Summary: Expose some of the collection classes as DeveloperApi Key: SPARK-6168 URL: https://issues.apache.org/jira/browse/SPARK-6168 Project: Spark

[jira] [Commented] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347607#comment-14347607 ] RJ Nowling commented on SPARK-6167: --- This PR fixes the issue in master and the 1.3

[jira] [Updated] (SPARK-6168) Expose some of the collection classes as DeveloperApi

2015-03-04 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan updated SPARK-6168: --- Description: It would be very useful to mark some of the collection and util classes

[jira] [Created] (SPARK-6169) Shuffle based join

2015-03-04 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-6169: -- Summary: Shuffle based join Key: SPARK-6169 URL: https://issues.apache.org/jira/browse/SPARK-6169 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6170) Add support for skew join

2015-03-04 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-6170: -- Summary: Add support for skew join Key: SPARK-6170 URL: https://issues.apache.org/jira/browse/SPARK-6170 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5143) spark-network-yarn 2.11 depends on spark-network-shuffle 2.10

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347567#comment-14347567 ] Patrick Wendell commented on SPARK-5143: Yes - good catch Sean. Curious that this

[jira] [Updated] (SPARK-5954) Add topByKey to pair RDDs

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5954: - Assignee: Shuo Xiang Add topByKey to pair RDDs - Key:

[jira] [Updated] (SPARK-5986) Model import/export for KMeansModel

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5986: - Assignee: Xusen Yin Model import/export for KMeansModel ---

[jira] [Updated] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6144: --- Component/s: Spark Core When in cluster mode using ADD JAR with a hdfs:// sourced jar will

[jira] [Closed] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-6144. Resolution: Fixed Fix Version/s: 1.3.0 When in cluster mode using ADD JAR with a hdfs:// sourced

[jira] [Created] (SPARK-6165) Aggregate and reduce should spool to disk and complete

2015-03-04 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-6165: -- Summary: Aggregate and reduce should spool to disk and complete Key: SPARK-6165 URL: https://issues.apache.org/jira/browse/SPARK-6165 Project: Spark

[jira] [Updated] (SPARK-6165) Aggregate and reduce should be able to work with very large number of tasks.

2015-03-04 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan updated SPARK-6165: --- Summary: Aggregate and reduce should be able to work with very large number of tasks.

[jira] [Updated] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-6158: --- Description: 1. a] When an instance of GradientBoostedTrees class is called and run, the boost

[jira] [Updated] (SPARK-6160) ChiSqSelector should keep test statistic info

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6160: - Description: It is useful to have the test statistics explaining selected features, but

[jira] [Created] (SPARK-6160) ChiSqSelector should keep test statistic info

2015-03-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6160: Summary: ChiSqSelector should keep test statistic info Key: SPARK-6160 URL: https://issues.apache.org/jira/browse/SPARK-6160 Project: Spark Issue

[jira] [Created] (SPARK-6162) Handle missing values in GBM

2015-03-04 Thread Devesh Parekh (JIRA)
Devesh Parekh created SPARK-6162: Summary: Handle missing values in GBM Key: SPARK-6162 URL: https://issues.apache.org/jira/browse/SPARK-6162 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-5692) Model import/export for Word2Vec

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-5692: Assignee: Xiangrui Meng Model import/export for Word2Vec

[jira] [Updated] (SPARK-4588) Add API for feature attributes

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4588: - Description: Feature attributes, e.g., continuous/categorical, feature names, feature dimension,

[jira] [Created] (SPARK-6171) No class def found for HiveConf in Spark shell

2015-03-04 Thread Andrew Or (JIRA)
Andrew Or created SPARK-6171: Summary: No class def found for HiveConf in Spark shell Key: SPARK-6171 URL: https://issues.apache.org/jira/browse/SPARK-6171 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6171) No class def found for HiveConf in Spark shell

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347669#comment-14347669 ] Andrew Or commented on SPARK-6171: -- Marking this as a blocker because I believe it's a

[jira] [Updated] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6137: - Labels: clustering (was: ) G-Means clustering algorithm implementation

[jira] [Created] (SPARK-6172) NoClassDefFoundError when launching spark shell w/o hive

2015-03-04 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-6172: --- Summary: NoClassDefFoundError when launching spark shell w/o hive Key: SPARK-6172 URL: https://issues.apache.org/jira/browse/SPARK-6172 Project: Spark

[jira] [Updated] (SPARK-6171) No class def found for HiveConf in Spark shell

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6171: - Description: I ran `build/sbt clean assembly` and then started the Spark shell, then I hit this huge

[jira] [Commented] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347678#comment-14347678 ] Apache Spark commented on SPARK-3533: - User 'ilganeli' has created a pull request for

[jira] [Resolved] (SPARK-6172) NoClassDefFoundError when launching spark shell w/o hive

2015-03-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6172. - Resolution: Duplicate NoClassDefFoundError when launching spark shell w/o hive

[jira] [Commented] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-04 Thread Calvin Jia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348050#comment-14348050 ] Calvin Jia commented on SPARK-6122: --- I've also updated the tachyon client to use the new

[jira] [Closed] (SPARK-4872) Provide sample format of training/test data in MLlib programming guide

2015-03-04 Thread zhang jun wei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhang jun wei closed SPARK-4872. Resolution: Fixed Provide sample format of training/test data in MLlib programming guide

[jira] [Commented] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347742#comment-14347742 ] Joseph K. Bradley commented on SPARK-3066: -- Are there approximate methods which

[jira] [Commented] (SPARK-6163) jsonFile should be backed by the data source API

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347815#comment-14347815 ] Apache Spark commented on SPARK-6163: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-6155) Build with Scala 2.11.5 failed for Spark v1.3.0-rc2

2015-03-04 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347969#comment-14347969 ] Jianshi Huang commented on SPARK-6155: -- Yeah, please add the feature request. Just a

[jira] [Created] (SPARK-6175) Executor log links are using internal addresses in EC2

2015-03-04 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6175: --- Summary: Executor log links are using internal addresses in EC2 Key: SPARK-6175 URL: https://issues.apache.org/jira/browse/SPARK-6175 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5271) PySpark History Web UI issues

2015-03-04 Thread zzc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348067#comment-14348067 ] zzc commented on SPARK-5271: hi, [~sowen], [~ZEMUSHKA], SPARK-3898 occured with Spark

[jira] [Created] (SPARK-6173) Python doc parity with Scala/Java in MLlib

2015-03-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6173: Summary: Python doc parity with Scala/Java in MLlib Key: SPARK-6173 URL: https://issues.apache.org/jira/browse/SPARK-6173 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6171) No class def found for HiveConf in Spark shell

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347899#comment-14347899 ] Andrew Or commented on SPARK-6171: -- Closing as cannot reproduced. It must have been

[jira] [Closed] (SPARK-6171) No class def found for HiveConf in Spark shell

2015-03-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-6171. Resolution: Cannot Reproduce No class def found for HiveConf in Spark shell

[jira] [Commented] (SPARK-4872) Provide sample format of training/test data in MLlib programming guide

2015-03-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347745#comment-14347745 ] Joseph K. Bradley commented on SPARK-4872: -- Can this issue be closed? Provide

[jira] [Commented] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347773#comment-14347773 ] RJ Nowling commented on SPARK-6167: --- Great! Thanks! Previous Commit Broke

[jira] [Commented] (SPARK-5929) Pyspark: Register a pip requirements file with spark_context

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347863#comment-14347863 ] Apache Spark commented on SPARK-5929: - User 'buckheroux' has created a pull request

[jira] [Updated] (SPARK-6154) Build error with Scala 2.11 for v1.3.0-rc2

2015-03-04 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianshi Huang updated SPARK-6154: - Component/s: SQL Build error with Scala 2.11 for v1.3.0-rc2

[jira] [Created] (SPARK-6176) Expose abstract DataTypes for DataFrames

2015-03-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6176: Summary: Expose abstract DataTypes for DataFrames Key: SPARK-6176 URL: https://issues.apache.org/jira/browse/SPARK-6176 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6177) LDA should check partitions size of the input

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348166#comment-14348166 ] Apache Spark commented on SPARK-6177: - User 'hhbyyh' has created a pull request for

[jira] [Updated] (SPARK-5692) Model import/export for Word2Vec

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5692: - Assignee: ANUPAM MEDIRATTA Model import/export for Word2Vec

[jira] [Created] (SPARK-6178) Remove unused imports from java classes

2015-03-04 Thread Vinod KC (JIRA)
Vinod KC created SPARK-6178: --- Summary: Remove unused imports from java classes Key: SPARK-6178 URL: https://issues.apache.org/jira/browse/SPARK-6178 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-5791: --- Attachment: Physcial_Plan_Hive.txt [Spark SQL] show poor performance when multiple table do join operation

[jira] [Comment Edited] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348293#comment-14348293 ] Cheng Hao edited comment on SPARK-5791 at 3/5/15 7:08 AM: -- I

[jira] [Commented] (SPARK-6061) File source dstream can not include the old file which timestamp is before the system time

2015-03-04 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348093#comment-14348093 ] Jack Hu commented on SPARK-6061: [~srowen] The issue is: I want to process the old files

[jira] [Comment Edited] (SPARK-5124) Standardize internal RPC interface

2015-03-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348215#comment-14348215 ] Shixiong Zhu edited comment on SPARK-5124 at 3/5/15 6:10 AM: -

[jira] [Commented] (SPARK-5389) spark-shell.cmd does not run from DOS Windows 7

2015-03-04 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348309#comment-14348309 ] Masayoshi TSUZUKI commented on SPARK-5389: -- The crashed program findstr.exe in

[jira] [Created] (SPARK-6179) Support SHOW PRINCIPALS role_name;

2015-03-04 Thread DoingDone9 (JIRA)
DoingDone9 created SPARK-6179: - Summary: Support SHOW PRINCIPALS role_name; Key: SPARK-6179 URL: https://issues.apache.org/jira/browse/SPARK-6179 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6182) spark-parent pom needs to be published for both 2.10 and 2.11

2015-03-04 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-6182: -- Summary: spark-parent pom needs to be published for both 2.10 and 2.11 Key: SPARK-6182 URL: https://issues.apache.org/jira/browse/SPARK-6182 Project: Spark

[jira] [Created] (SPARK-6177) LDA should check partitions size of the input

2015-03-04 Thread yuhao yang (JIRA)
yuhao yang created SPARK-6177: - Summary: LDA should check partitions size of the input Key: SPARK-6177 URL: https://issues.apache.org/jira/browse/SPARK-6177 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6061) File source dstream can not include the old file which timestamp is before the system time

2015-03-04 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348160#comment-14348160 ] Yi Tian commented on SPARK-6061: In spark 1.2.1, when you set the {{newFilesOnly}} to

[jira] [Comment Edited] (SPARK-5124) Standardize internal RPC interface

2015-03-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348215#comment-14348215 ] Shixiong Zhu edited comment on SPARK-5124 at 3/5/15 6:09 AM: -

[jira] [Comment Edited] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348293#comment-14348293 ] Cheng Hao edited comment on SPARK-5791 at 3/5/15 7:07 AM: -- I

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348293#comment-14348293 ] Cheng Hao commented on SPARK-5791: -- I think this is a typical case that we need to

[jira] [Updated] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-5791: --- Attachment: Physcial_Plan_SparkSQL_Updated.txt [Spark SQL] show poor performance when multiple table do join

[jira] [Commented] (SPARK-6115) Description for SparkSQL Jobs doesn't show up correctly until after the job finishes

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348102#comment-14348102 ] Apache Spark commented on SPARK-6115: - User 'Leolh' has created a pull request for

[jira] [Resolved] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6149. Resolution: Fixed Fix Version/s: 1.3.0 Spark SQL CLI doesn't work when compiled

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-04 Thread ANUPAM MEDIRATTA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348177#comment-14348177 ] ANUPAM MEDIRATTA commented on SPARK-5692: - Hey Xiangrui Please assign the ticket

[jira] [Updated] (SPARK-6177) LDA should check partitions size of the input

2015-03-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6177: -- Description: sc.textFile will create RDD with one partition for each file, and the possible massive

[jira] [Commented] (SPARK-6061) File source dstream can not include the old file which timestamp is before the system time

2015-03-04 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348230#comment-14348230 ] Jack Hu commented on SPARK-6061: [~tianyi] Do you know why the

[jira] [Created] (SPARK-6180) Error logged into log4j when use the HiveMetastoreCatalog::tableExists

2015-03-04 Thread Jack Hu (JIRA)
Jack Hu created SPARK-6180: -- Summary: Error logged into log4j when use the HiveMetastoreCatalog::tableExists Key: SPARK-6180 URL: https://issues.apache.org/jira/browse/SPARK-6180 Project: Spark

[jira] [Commented] (SPARK-6179) Support SHOW PRINCIPALS role_name;

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348313#comment-14348313 ] Apache Spark commented on SPARK-6179: - User 'DoingDone9' has created a pull request

[jira] [Updated] (SPARK-6180) Error logged into log4j when use the HiveMetastoreCatalog::tableExists

2015-03-04 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jack Hu updated SPARK-6180: --- Description: When using {{HiveMetastoreCatalog.tableExists}} to check a table that does not exist in hive

[jira] [Commented] (SPARK-6175) Executor log links are using internal addresses in EC2

2015-03-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348336#comment-14348336 ] Apache Spark commented on SPARK-6175: - User 'JoshRosen' has created a pull request for

[jira] [Comment Edited] (SPARK-5271) PySpark History Web UI issues

2015-03-04 Thread zzc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348067#comment-14348067 ] zzc edited comment on SPARK-5271 at 3/5/15 3:35 AM: hi, [~sowen],

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348225#comment-14348225 ] Yin Huai commented on SPARK-5791: - I see. In Hive's plan, all of item, warehouse, and

[jira] [Comment Edited] (SPARK-6084) spark-shell broken on Windows

2015-03-04 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348310#comment-14348310 ] Masayoshi TSUZUKI edited comment on SPARK-6084 at 3/5/15 7:23 AM:

[jira] [Commented] (SPARK-6084) spark-shell broken on Windows

2015-03-04 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348310#comment-14348310 ] Masayoshi TSUZUKI commented on SPARK-6084: -- Sorry for the late reply.

[jira] [Updated] (SPARK-6179) Support SHOW PRINCIPALS role_name;

2015-03-04 Thread DoingDone9 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DoingDone9 updated SPARK-6179: -- Description: SHOW PRINCIPALS role_name; Lists all roles and users who belong to this role. Only the

[jira] [Created] (SPARK-6181) Support SHOW COMPACTIONS;

2015-03-04 Thread DoingDone9 (JIRA)
DoingDone9 created SPARK-6181: - Summary: Support SHOW COMPACTIONS; Key: SPARK-6181 URL: https://issues.apache.org/jira/browse/SPARK-6181 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-5143) spark-network-yarn 2.11 depends on spark-network-shuffle 2.10

2015-03-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5143. Resolution: Fixed Fix Version/s: 1.3.0 spark-network-yarn 2.11 depends on

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348212#comment-14348212 ] Xiangrui Meng commented on SPARK-5692: -- Done. The Parquet data file should have two

  1   2   >