[jira] [Created] (SPARK-2189) Method for removing temp tables created by registerAsTable

2014-06-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2189: --- Summary: Method for removing temp tables created by registerAsTable Key: SPARK-2189 URL: https://issues.apache.org/jira/browse/SPARK-2189 Project: Spark

[jira] [Created] (SPARK-2190) Specialized ColumnType for Timestamp

2014-06-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2190: --- Summary: Specialized ColumnType for Timestamp Key: SPARK-2190 URL: https://issues.apache.org/jira/browse/SPARK-2190 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-2191) Double execution with CREATE TABLE AS SELECT

2014-06-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2191: --- Summary: Double execution with CREATE TABLE AS SELECT Key: SPARK-2191 URL: https://issues.apache.org/jira/browse/SPARK-2191 Project: Spark Issue Type:

[jira] [Updated] (SPARK-2193) Improve tasks‘ preferred locality by sorting tasks partial ordering

2014-06-19 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhihui updated SPARK-2193: -- Description: Now, the last executor(s) maybe not get it’s preferred task(s), although these tasks have build

[jira] [Updated] (SPARK-2193) Improve tasks‘ preferred locality by sorting tasks partial ordering

2014-06-19 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhihui updated SPARK-2193: -- Attachment: Improve Tasks Preferred Locality.pptx Improve tasks‘ preferred locality by sorting tasks partial

[jira] [Commented] (SPARK-2193) Improve tasks‘ preferred locality by sorting tasks partial ordering

2014-06-19 Thread Zhihui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037121#comment-14037121 ] Zhihui commented on SPARK-2193: --- PR 1131 https://github.com/apache/spark/pull/1131 Improve

[jira] [Created] (SPARK-2194) EC2 Scripts don't work in europe

2014-06-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2194: --- Summary: EC2 Scripts don't work in europe Key: SPARK-2194 URL: https://issues.apache.org/jira/browse/SPARK-2194 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-2195) Parquet extraMetadata can contain key information

2014-06-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2195: --- Summary: Parquet extraMetadata can contain key information Key: SPARK-2195 URL: https://issues.apache.org/jira/browse/SPARK-2195 Project: Spark Issue

[jira] [Created] (SPARK-2196) Fix nullability of CaseWhen.

2014-06-19 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-2196: Summary: Fix nullability of CaseWhen. Key: SPARK-2196 URL: https://issues.apache.org/jira/browse/SPARK-2196 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-2196) Fix nullability of CaseWhen.

2014-06-19 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin updated SPARK-2196: - Description: {{CaseWhen}} should use {{branches.length}} to check if {{elseValue}} is provided

[jira] [Commented] (SPARK-2196) Fix nullability of CaseWhen.

2014-06-19 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037189#comment-14037189 ] Takuya Ueshin commented on SPARK-2196: -- PRed:

[jira] [Created] (SPARK-2197) Spark invoke DecisionTree by Java

2014-06-19 Thread wulin (JIRA)
wulin created SPARK-2197: Summary: Spark invoke DecisionTree by Java Key: SPARK-2197 URL: https://issues.apache.org/jira/browse/SPARK-2197 Project: Spark Issue Type: Bug Components: MLlib

[jira] [Created] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Helena Edelson (JIRA)
Helena Edelson created SPARK-2198: - Summary: Partition the scala build file so that it is easier to maintain Key: SPARK-2198 URL: https://issues.apache.org/jira/browse/SPARK-2198 Project: Spark

[jira] [Updated] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Helena Edelson updated SPARK-2198: -- Description: Partition to standard Dependencies, Version, Settings, Publish.scala. keeping

[jira] [Updated] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Helena Edelson updated SPARK-2198: -- Remaining Estimate: 2h (was: 1m) Original Estimate: 2h (was: 1m) Partition the scala

[jira] [Updated] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Helena Edelson updated SPARK-2198: -- Remaining Estimate: 3h (was: 2h) Original Estimate: 3h (was: 2h) Partition the scala

[jira] [Resolved] (SPARK-2194) EC2 Scripts don't work in europe

2014-06-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2194. - Resolution: Cannot Reproduce After waiting a few hours the error message went away.

[jira] [Created] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-06-19 Thread Denis Turdakov (JIRA)
Denis Turdakov created SPARK-2199: - Summary: Distributed probabilistic latent semantic analysis in MLlib Key: SPARK-2199 URL: https://issues.apache.org/jira/browse/SPARK-2199 Project: Spark

[jira] [Updated] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-06-19 Thread Denis Turdakov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Denis Turdakov updated SPARK-2199: -- Description: Probabilistic latent semantic analysis (PLSA) is a topic model which extracts

[jira] [Updated] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-06-19 Thread Denis Turdakov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Denis Turdakov updated SPARK-2199: -- Description: Probabilistic latent semantic analysis (PLSA) is a topic model which extracts

[jira] [Commented] (SPARK-2200) breeze DenseVector not serializable with KryoSerializer

2014-06-19 Thread Neville Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037424#comment-14037424 ] Neville Li commented on SPARK-2200: --- https://github.com/apache/spark/pull/940 addresses

[jira] [Created] (SPARK-2200) breeze DenseVector not serializable with KryoSerializer

2014-06-19 Thread Neville Li (JIRA)
Neville Li created SPARK-2200: - Summary: breeze DenseVector not serializable with KryoSerializer Key: SPARK-2200 URL: https://issues.apache.org/jira/browse/SPARK-2200 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037431#comment-14037431 ] Mark Hamstra commented on SPARK-2198: - While this is an admirable goal, I'm afraid

[jira] [Created] (SPARK-2201) Improve FlumeInputDStream

2014-06-19 Thread sunshangchun (JIRA)
sunshangchun created SPARK-2201: --- Summary: Improve FlumeInputDStream Key: SPARK-2201 URL: https://issues.apache.org/jira/browse/SPARK-2201 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-2198) Partition the scala build file so that it is easier to maintain

2014-06-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037467#comment-14037467 ] Helena Edelson commented on SPARK-2198: --- I am sad to hear that the Maven POMs will

[jira] [Resolved] (SPARK-2051) spark.yarn.dist.* configs are not supported in yarn-cluster mode

2014-06-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-2051. -- Resolution: Fixed Fix Version/s: 1.1.0 spark.yarn.dist.* configs are not supported in

[jira] [Created] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Suren Hiraman (JIRA)
Suren Hiraman created SPARK-2202: Summary: saveAsTextFile hangs on final 2 tasks Key: SPARK-2202 URL: https://issues.apache.org/jira/browse/SPARK-2202 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2200) breeze DenseVector not serializable with KryoSerializer

2014-06-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037637#comment-14037637 ] Xiangrui Meng commented on SPARK-2200: -- [~neville] Do you know the root cause and how

[jira] [Updated] (SPARK-2126) Move MapOutputTracker behind ShuffleManager interface

2014-06-19 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Hamstra updated SPARK-2126: Assignee: Nan Zhu Move MapOutputTracker behind ShuffleManager interface

[jira] [Commented] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2014-06-19 Thread Valeriy Avanesov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037659#comment-14037659 ] Valeriy Avanesov commented on SPARK-2199: - Here is the implementation we currently

[jira] [Commented] (SPARK-2126) Move MapOutputTracker behind ShuffleManager interface

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037692#comment-14037692 ] Patrick Wendell commented on SPARK-2126: Hey All, This proposal is a fairly hairy

[jira] [Commented] (SPARK-2180) HiveQL doesn't support GROUP BY with HAVING clauses

2014-06-19 Thread William Benton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037697#comment-14037697 ] William Benton commented on SPARK-2180: --- PR is here:

[jira] [Commented] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037698#comment-14037698 ] Patrick Wendell commented on SPARK-2202: I changed the priority because we usually

[jira] [Commented] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037696#comment-14037696 ] Patrick Wendell commented on SPARK-2202: When the tasks are hanging. Could you go

[jira] [Updated] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2202: --- Priority: Major (was: Blocker) saveAsTextFile hangs on final 2 tasks

[jira] [Commented] (SPARK-2038) Don't shadow conf variable in saveAsHadoop functions

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037703#comment-14037703 ] Patrick Wendell commented on SPARK-2038: Hey [~CodingCat] - I realized there is

[jira] [Reopened] (SPARK-2038) Don't shadow conf variable in saveAsHadoop functions

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-2038: Don't shadow conf variable in saveAsHadoop functions

[jira] [Commented] (SPARK-2038) Don't shadow conf variable in saveAsHadoop functions

2014-06-19 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037739#comment-14037739 ] Nan Zhu commented on SPARK-2038: [~pwendell] Yeah, it's a good idea, just submit a new PR:

[jira] [Commented] (SPARK-2126) Move MapOutputTracker behind ShuffleManager interface

2014-06-19 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037746#comment-14037746 ] Nan Zhu commented on SPARK-2126: [~pwendell] Yes, [~markhamstra] just emailed me Yes, I

[jira] [Commented] (SPARK-2177) describe table result contains only one column

2014-06-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037810#comment-14037810 ] Yin Huai commented on SPARK-2177: - We should also put what cases we support in the release

[jira] [Commented] (SPARK-2177) describe table result contains only one column

2014-06-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037809#comment-14037809 ] Yin Huai commented on SPARK-2177: - Generally Hive generates results of DDL statements as

[jira] [Created] (SPARK-2204) Scheduler for Mesos in fine-grained mode launches tasks on random executors

2014-06-19 Thread Sebastien Rainville (JIRA)
Sebastien Rainville created SPARK-2204: -- Summary: Scheduler for Mesos in fine-grained mode launches tasks on random executors Key: SPARK-2204 URL: https://issues.apache.org/jira/browse/SPARK-2204

[jira] [Updated] (SPARK-2204) Scheduler for Mesos in fine-grained mode launches tasks on random executors

2014-06-19 Thread Sebastien Rainville (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Rainville updated SPARK-2204: --- Fix Version/s: (was: 1.0.1) Scheduler for Mesos in fine-grained mode launches

[jira] [Commented] (SPARK-1800) Add broadcast hash join operator

2014-06-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037880#comment-14037880 ] Yin Huai commented on SPARK-1800: - Maybe add an improvement in future that tasks in the

[jira] [Updated] (SPARK-2204) Scheduler for Mesos in fine-grained mode launches tasks on random executors

2014-06-19 Thread Sebastien Rainville (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Rainville updated SPARK-2204: --- Description: MesosSchedulerBackend.resourceOffers(SchedulerDriver, List[Offer]) is

[jira] [Created] (SPARK-2205) Unnecessary exchange operators in a join on multiple tables with the same join key.

2014-06-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-2205: --- Summary: Unnecessary exchange operators in a join on multiple tables with the same join key. Key: SPARK-2205 URL: https://issues.apache.org/jira/browse/SPARK-2205 Project:

[jira] [Resolved] (SPARK-2191) Double execution with CREATE TABLE AS SELECT

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2191. Resolution: Fixed Fix Version/s: 1.1.0 1.0.1 Assignee: Michael

[jira] [Closed] (SPARK-1544) Add support for deep decision trees.

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde closed SPARK-1544. -- The PR has been accepted. Add support for deep decision trees.

[jira] [Created] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2014-06-19 Thread Manish Amde (JIRA)
Manish Amde created SPARK-2206: -- Summary: Automatically infer the number of classification classes in multiclass classification Key: SPARK-2206 URL: https://issues.apache.org/jira/browse/SPARK-2206

[jira] [Created] (SPARK-2207) Add minimum info gain and min instances per node as training parameters for decision tree

2014-06-19 Thread Manish Amde (JIRA)
Manish Amde created SPARK-2207: -- Summary: Add minimum info gain and min instances per node as training parameters for decision tree Key: SPARK-2207 URL: https://issues.apache.org/jira/browse/SPARK-2207

[jira] [Updated] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-2206: --- Target Version/s: 1.1.0 Affects Version/s: 1.0.0 Automatically infer the number of

[jira] [Updated] (SPARK-2207) Add minimum info gain and min instances per node as training parameters for decision tree

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-2207: --- Target Version/s: 1.1.0 Add minimum info gain and min instances per node as training parameters for

[jira] [Updated] (SPARK-2207) Add minimum information gain and minimum instances per node as training parameters for decision tree.

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-2207: --- Summary: Add minimum information gain and minimum instances per node as training parameters for

[jira] [Commented] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Suren Hiraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037979#comment-14037979 ] Suren Hiraman commented on SPARK-2202: -- So it turns out that when we remove all of

[jira] [Updated] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2014-06-19 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2206: - Assignee: Manish Amde Automatically infer the number of classification classes in multiclass

[jira] [Updated] (SPARK-2207) Add minimum information gain and minimum instances per node as training parameters for decision tree.

2014-06-19 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2207: - Assignee: Manish Amde Add minimum information gain and minimum instances per node as training

[jira] [Updated] (SPARK-1547) Add gradient boosting algorithm to MLlib

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-1547: --- Target Version/s: 1.1.0 Add gradient boosting algorithm to MLlib

[jira] [Updated] (SPARK-1546) Add AdaBoost algorithm to Spark MLlib

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-1546: --- Affects Version/s: (was: 1.0.0) 1.1.0 Add AdaBoost algorithm to Spark

[jira] [Updated] (SPARK-1545) Add Random Forest algorithm to MLlib

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-1545: --- Target Version/s: 1.1.0 Add Random Forest algorithm to MLlib

[jira] [Updated] (SPARK-1536) Add multiclass classification support to MLlib

2014-06-19 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Amde updated SPARK-1536: --- Target Version/s: 1.1.0 Add multiclass classification support to MLlib

[jira] [Updated] (SPARK-2204) Scheduler for Mesos in fine-grained mode launches tasks on wrong executors

2014-06-19 Thread Sebastien Rainville (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Rainville updated SPARK-2204: --- Summary: Scheduler for Mesos in fine-grained mode launches tasks on wrong executors

[jira] [Commented] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038101#comment-14038101 ] Patrick Wendell commented on SPARK-2202: Yes, please do! saveAsTextFile hangs on

[jira] [Resolved] (SPARK-2151) spark-submit issue (int format expected for memory parameter)

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2151. Resolution: Fixed Fix Version/s: 1.1.0 1.0.1 Assignee: Nishkam

[jira] [Updated] (SPARK-2151) spark-submit issue (int format expected for memory parameter)

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2151: --- Description: Get this exception when invoking spark-submit in standalone cluster mode: {code}

[jira] [Commented] (SPARK-2202) saveAsTextFile hangs on final 2 tasks

2014-06-19 Thread Suren Hiraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038156#comment-14038156 ] Suren Hiraman commented on SPARK-2202: -- Will do tomorrow. Interesting problem.

[jira] [Commented] (SPARK-2192) Examples Data Not in Binary Distribution

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038200#comment-14038200 ] Patrick Wendell commented on SPARK-2192: It might be good to have all the example

[jira] [Created] (SPARK-2208) local metrics tests can fail on fast machines

2014-06-19 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-2208: -- Summary: local metrics tests can fail on fast machines Key: SPARK-2208 URL: https://issues.apache.org/jira/browse/SPARK-2208 Project: Spark Issue Type:

[jira] [Updated] (SPARK-2208) local metrics tests can fail on fast machines

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2208: --- Labels: starter (was: ) local metrics tests can fail on fast machines

[jira] [Created] (SPARK-2209) Cast shouldn't do null check twice

2014-06-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-2209: -- Summary: Cast shouldn't do null check twice Key: SPARK-2209 URL: https://issues.apache.org/jira/browse/SPARK-2209 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2209) Cast shouldn't do null check twice

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038295#comment-14038295 ] Reynold Xin commented on SPARK-2209: https://github.com/apache/spark/pull/1143 Cast

[jira] [Commented] (SPARK-768) Fail a task when the remote block it is fetching is not serializable

2014-06-19 Thread Raymond Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038307#comment-14038307 ] Raymond Liu commented on SPARK-768: --- And for case 2, the problem is that current code

[jira] [Commented] (SPARK-1209) SparkHadoopUtil should not use package org.apache.hadoop

2014-06-19 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038328#comment-14038328 ] Mark Grover commented on SPARK-1209: ok, I will take over. Thanks Sandy.

[jira] [Commented] (SPARK-2208) local metrics tests can fail on fast machines

2014-06-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038414#comment-14038414 ] Patrick Wendell commented on SPARK-2208: A hotfix was merged here, but we should

[jira] [Commented] (SPARK-1949) Servlet 2.5 vs 3.0 conflict in SBT build

2014-06-19 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038441#comment-14038441 ] Andrew Ash commented on SPARK-1949: --- Sean's PR: https://github.com/apache/spark/pull/906

[jira] [Created] (SPARK-2210) cast to boolean on boolean value gets turned into NOT((boolean_condition) = 0)

2014-06-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-2210: -- Summary: cast to boolean on boolean value gets turned into NOT((boolean_condition) = 0) Key: SPARK-2210 URL: https://issues.apache.org/jira/browse/SPARK-2210 Project:

[jira] [Created] (SPARK-2212) HashJoin

2014-06-19 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2212: Summary: HashJoin Key: SPARK-2212 URL: https://issues.apache.org/jira/browse/SPARK-2212 Project: Spark Issue Type: Sub-task Reporter: Cheng Hao

[jira] [Created] (SPARK-2211) Join Optimization

2014-06-19 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2211: Summary: Join Optimization Key: SPARK-2211 URL: https://issues.apache.org/jira/browse/SPARK-2211 Project: Spark Issue Type: Improvement Components: SQL

[jira] [Created] (SPARK-2213) Sort Merge Join

2014-06-19 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2213: Summary: Sort Merge Join Key: SPARK-2213 URL: https://issues.apache.org/jira/browse/SPARK-2213 Project: Spark Issue Type: Sub-task Reporter: Cheng Hao

[jira] [Created] (SPARK-2215) Multi-way join

2014-06-19 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2215: Summary: Multi-way join Key: SPARK-2215 URL: https://issues.apache.org/jira/browse/SPARK-2215 Project: Spark Issue Type: Sub-task Components: SQL

[jira] [Created] (SPARK-2218) rename Equals to EqualsTo in Spark SQL expressions

2014-06-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-2218: -- Summary: rename Equals to EqualsTo in Spark SQL expressions Key: SPARK-2218 URL: https://issues.apache.org/jira/browse/SPARK-2218 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2215) Multi-way join

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038497#comment-14038497 ] Reynold Xin commented on SPARK-2215: I personally find multiway join operator

[jira] [Commented] (SPARK-2216) Cost-based join reordering

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038494#comment-14038494 ] Reynold Xin commented on SPARK-2216: The prerequisite of this change is to design the

[jira] [Updated] (SPARK-2215) Multi-way join

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2215: --- Priority: Minor (was: Major) Multi-way join -- Key: SPARK-2215

[jira] [Updated] (SPARK-2214) Broadcast Join (aka map join)

2014-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2214: --- Summary: Broadcast Join (aka map join) (was: MapSide Join) Broadcast Join (aka map join)