Michael Armbrust created SPARK-2189:
---
Summary: Method for removing temp tables created by registerAsTable
Key: SPARK-2189
URL: https://issues.apache.org/jira/browse/SPARK-2189
Project: Spark
Michael Armbrust created SPARK-2190:
---
Summary: Specialized ColumnType for Timestamp
Key: SPARK-2190
URL: https://issues.apache.org/jira/browse/SPARK-2190
Project: Spark
Issue Type: Bug
Michael Armbrust created SPARK-2191:
---
Summary: Double execution with CREATE TABLE AS SELECT
Key: SPARK-2191
URL: https://issues.apache.org/jira/browse/SPARK-2191
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhihui updated SPARK-2193:
--
Description:
Now, the last executor(s) maybe not get it’s preferred task(s), although these
tasks have build
[
https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhihui updated SPARK-2193:
--
Attachment: Improve Tasks Preferred Locality.pptx
Improve tasks‘ preferred locality by sorting tasks partial
[
https://issues.apache.org/jira/browse/SPARK-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037121#comment-14037121
]
Zhihui commented on SPARK-2193:
---
PR 1131 https://github.com/apache/spark/pull/1131
Improve
Michael Armbrust created SPARK-2194:
---
Summary: EC2 Scripts don't work in europe
Key: SPARK-2194
URL: https://issues.apache.org/jira/browse/SPARK-2194
Project: Spark
Issue Type: Bug
Michael Armbrust created SPARK-2195:
---
Summary: Parquet extraMetadata can contain key information
Key: SPARK-2195
URL: https://issues.apache.org/jira/browse/SPARK-2195
Project: Spark
Issue
Takuya Ueshin created SPARK-2196:
Summary: Fix nullability of CaseWhen.
Key: SPARK-2196
URL: https://issues.apache.org/jira/browse/SPARK-2196
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Takuya Ueshin updated SPARK-2196:
-
Description: {{CaseWhen}} should use {{branches.length}} to check if
{{elseValue}} is provided
[
https://issues.apache.org/jira/browse/SPARK-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037189#comment-14037189
]
Takuya Ueshin commented on SPARK-2196:
--
PRed:
wulin created SPARK-2197:
Summary: Spark invoke DecisionTree by Java
Key: SPARK-2197
URL: https://issues.apache.org/jira/browse/SPARK-2197
Project: Spark
Issue Type: Bug
Components: MLlib
Helena Edelson created SPARK-2198:
-
Summary: Partition the scala build file so that it is easier to
maintain
Key: SPARK-2198
URL: https://issues.apache.org/jira/browse/SPARK-2198
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Helena Edelson updated SPARK-2198:
--
Description:
Partition to standard Dependencies, Version, Settings, Publish.scala. keeping
[
https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Helena Edelson updated SPARK-2198:
--
Remaining Estimate: 2h (was: 1m)
Original Estimate: 2h (was: 1m)
Partition the scala
[
https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Helena Edelson updated SPARK-2198:
--
Remaining Estimate: 3h (was: 2h)
Original Estimate: 3h (was: 2h)
Partition the scala
[
https://issues.apache.org/jira/browse/SPARK-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Armbrust resolved SPARK-2194.
-
Resolution: Cannot Reproduce
After waiting a few hours the error message went away.
Denis Turdakov created SPARK-2199:
-
Summary: Distributed probabilistic latent semantic analysis in
MLlib
Key: SPARK-2199
URL: https://issues.apache.org/jira/browse/SPARK-2199
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Denis Turdakov updated SPARK-2199:
--
Description:
Probabilistic latent semantic analysis (PLSA) is a topic model which extracts
[
https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Denis Turdakov updated SPARK-2199:
--
Description:
Probabilistic latent semantic analysis (PLSA) is a topic model which extracts
[
https://issues.apache.org/jira/browse/SPARK-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037424#comment-14037424
]
Neville Li commented on SPARK-2200:
---
https://github.com/apache/spark/pull/940 addresses
Neville Li created SPARK-2200:
-
Summary: breeze DenseVector not serializable with KryoSerializer
Key: SPARK-2200
URL: https://issues.apache.org/jira/browse/SPARK-2200
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037431#comment-14037431
]
Mark Hamstra commented on SPARK-2198:
-
While this is an admirable goal, I'm afraid
sunshangchun created SPARK-2201:
---
Summary: Improve FlumeInputDStream
Key: SPARK-2201
URL: https://issues.apache.org/jira/browse/SPARK-2201
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037467#comment-14037467
]
Helena Edelson commented on SPARK-2198:
---
I am sad to hear that the Maven POMs will
[
https://issues.apache.org/jira/browse/SPARK-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Thomas Graves resolved SPARK-2051.
--
Resolution: Fixed
Fix Version/s: 1.1.0
spark.yarn.dist.* configs are not supported in
Suren Hiraman created SPARK-2202:
Summary: saveAsTextFile hangs on final 2 tasks
Key: SPARK-2202
URL: https://issues.apache.org/jira/browse/SPARK-2202
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037637#comment-14037637
]
Xiangrui Meng commented on SPARK-2200:
--
[~neville] Do you know the root cause and how
[
https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Hamstra updated SPARK-2126:
Assignee: Nan Zhu
Move MapOutputTracker behind ShuffleManager interface
[
https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037659#comment-14037659
]
Valeriy Avanesov commented on SPARK-2199:
-
Here is the implementation we currently
[
https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037692#comment-14037692
]
Patrick Wendell commented on SPARK-2126:
Hey All,
This proposal is a fairly hairy
[
https://issues.apache.org/jira/browse/SPARK-2180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037697#comment-14037697
]
William Benton commented on SPARK-2180:
---
PR is here:
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037698#comment-14037698
]
Patrick Wendell commented on SPARK-2202:
I changed the priority because we usually
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037696#comment-14037696
]
Patrick Wendell commented on SPARK-2202:
When the tasks are hanging. Could you go
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2202:
---
Priority: Major (was: Blocker)
saveAsTextFile hangs on final 2 tasks
[
https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037703#comment-14037703
]
Patrick Wendell commented on SPARK-2038:
Hey [~CodingCat] - I realized there is
[
https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell reopened SPARK-2038:
Don't shadow conf variable in saveAsHadoop functions
[
https://issues.apache.org/jira/browse/SPARK-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037739#comment-14037739
]
Nan Zhu commented on SPARK-2038:
[~pwendell] Yeah, it's a good idea, just submit a new PR:
[
https://issues.apache.org/jira/browse/SPARK-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037746#comment-14037746
]
Nan Zhu commented on SPARK-2126:
[~pwendell] Yes, [~markhamstra] just emailed me
Yes, I
[
https://issues.apache.org/jira/browse/SPARK-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037810#comment-14037810
]
Yin Huai commented on SPARK-2177:
-
We should also put what cases we support in the release
[
https://issues.apache.org/jira/browse/SPARK-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037809#comment-14037809
]
Yin Huai commented on SPARK-2177:
-
Generally Hive generates results of DDL statements as
Sebastien Rainville created SPARK-2204:
--
Summary: Scheduler for Mesos in fine-grained mode launches tasks
on random executors
Key: SPARK-2204
URL: https://issues.apache.org/jira/browse/SPARK-2204
[
https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastien Rainville updated SPARK-2204:
---
Fix Version/s: (was: 1.0.1)
Scheduler for Mesos in fine-grained mode launches
[
https://issues.apache.org/jira/browse/SPARK-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037880#comment-14037880
]
Yin Huai commented on SPARK-1800:
-
Maybe add an improvement in future that tasks in the
[
https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastien Rainville updated SPARK-2204:
---
Description: MesosSchedulerBackend.resourceOffers(SchedulerDriver,
List[Offer]) is
Yin Huai created SPARK-2205:
---
Summary: Unnecessary exchange operators in a join on multiple
tables with the same join key.
Key: SPARK-2205
URL: https://issues.apache.org/jira/browse/SPARK-2205
Project:
[
https://issues.apache.org/jira/browse/SPARK-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin resolved SPARK-2191.
Resolution: Fixed
Fix Version/s: 1.1.0
1.0.1
Assignee: Michael
[
https://issues.apache.org/jira/browse/SPARK-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde closed SPARK-1544.
--
The PR has been accepted.
Add support for deep decision trees.
Manish Amde created SPARK-2206:
--
Summary: Automatically infer the number of classification classes
in multiclass classification
Key: SPARK-2206
URL: https://issues.apache.org/jira/browse/SPARK-2206
Manish Amde created SPARK-2207:
--
Summary: Add minimum info gain and min instances per node as
training parameters for decision tree
Key: SPARK-2207
URL: https://issues.apache.org/jira/browse/SPARK-2207
[
https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-2206:
---
Target Version/s: 1.1.0
Affects Version/s: 1.0.0
Automatically infer the number of
[
https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-2207:
---
Target Version/s: 1.1.0
Add minimum info gain and min instances per node as training parameters for
[
https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-2207:
---
Summary: Add minimum information gain and minimum instances per node as
training parameters for
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037979#comment-14037979
]
Suren Hiraman commented on SPARK-2202:
--
So it turns out that when we remove all of
[
https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matei Zaharia updated SPARK-2206:
-
Assignee: Manish Amde
Automatically infer the number of classification classes in multiclass
[
https://issues.apache.org/jira/browse/SPARK-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matei Zaharia updated SPARK-2207:
-
Assignee: Manish Amde
Add minimum information gain and minimum instances per node as training
[
https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-1547:
---
Target Version/s: 1.1.0
Add gradient boosting algorithm to MLlib
[
https://issues.apache.org/jira/browse/SPARK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-1546:
---
Affects Version/s: (was: 1.0.0)
1.1.0
Add AdaBoost algorithm to Spark
[
https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-1545:
---
Target Version/s: 1.1.0
Add Random Forest algorithm to MLlib
[
https://issues.apache.org/jira/browse/SPARK-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish Amde updated SPARK-1536:
---
Target Version/s: 1.1.0
Add multiclass classification support to MLlib
[
https://issues.apache.org/jira/browse/SPARK-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastien Rainville updated SPARK-2204:
---
Summary: Scheduler for Mesos in fine-grained mode launches tasks on wrong
executors
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038101#comment-14038101
]
Patrick Wendell commented on SPARK-2202:
Yes, please do!
saveAsTextFile hangs on
[
https://issues.apache.org/jira/browse/SPARK-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin resolved SPARK-2151.
Resolution: Fixed
Fix Version/s: 1.1.0
1.0.1
Assignee: Nishkam
[
https://issues.apache.org/jira/browse/SPARK-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-2151:
---
Description:
Get this exception when invoking spark-submit in standalone cluster mode:
{code}
[
https://issues.apache.org/jira/browse/SPARK-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038156#comment-14038156
]
Suren Hiraman commented on SPARK-2202:
--
Will do tomorrow. Interesting problem.
[
https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038200#comment-14038200
]
Patrick Wendell commented on SPARK-2192:
It might be good to have all the example
Patrick Wendell created SPARK-2208:
--
Summary: local metrics tests can fail on fast machines
Key: SPARK-2208
URL: https://issues.apache.org/jira/browse/SPARK-2208
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2208:
---
Labels: starter (was: )
local metrics tests can fail on fast machines
Reynold Xin created SPARK-2209:
--
Summary: Cast shouldn't do null check twice
Key: SPARK-2209
URL: https://issues.apache.org/jira/browse/SPARK-2209
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038295#comment-14038295
]
Reynold Xin commented on SPARK-2209:
https://github.com/apache/spark/pull/1143
Cast
[
https://issues.apache.org/jira/browse/SPARK-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038307#comment-14038307
]
Raymond Liu commented on SPARK-768:
---
And for case 2, the problem is that current code
[
https://issues.apache.org/jira/browse/SPARK-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038328#comment-14038328
]
Mark Grover commented on SPARK-1209:
ok, I will take over. Thanks Sandy.
[
https://issues.apache.org/jira/browse/SPARK-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038414#comment-14038414
]
Patrick Wendell commented on SPARK-2208:
A hotfix was merged here, but we should
[
https://issues.apache.org/jira/browse/SPARK-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038441#comment-14038441
]
Andrew Ash commented on SPARK-1949:
---
Sean's PR: https://github.com/apache/spark/pull/906
Reynold Xin created SPARK-2210:
--
Summary: cast to boolean on boolean value gets turned into
NOT((boolean_condition) = 0)
Key: SPARK-2210
URL: https://issues.apache.org/jira/browse/SPARK-2210
Project:
Cheng Hao created SPARK-2212:
Summary: HashJoin
Key: SPARK-2212
URL: https://issues.apache.org/jira/browse/SPARK-2212
Project: Spark
Issue Type: Sub-task
Reporter: Cheng Hao
Cheng Hao created SPARK-2211:
Summary: Join Optimization
Key: SPARK-2211
URL: https://issues.apache.org/jira/browse/SPARK-2211
Project: Spark
Issue Type: Improvement
Components: SQL
Cheng Hao created SPARK-2213:
Summary: Sort Merge Join
Key: SPARK-2213
URL: https://issues.apache.org/jira/browse/SPARK-2213
Project: Spark
Issue Type: Sub-task
Reporter: Cheng Hao
Cheng Hao created SPARK-2215:
Summary: Multi-way join
Key: SPARK-2215
URL: https://issues.apache.org/jira/browse/SPARK-2215
Project: Spark
Issue Type: Sub-task
Components: SQL
Reynold Xin created SPARK-2218:
--
Summary: rename Equals to EqualsTo in Spark SQL expressions
Key: SPARK-2218
URL: https://issues.apache.org/jira/browse/SPARK-2218
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038497#comment-14038497
]
Reynold Xin commented on SPARK-2215:
I personally find multiway join operator
[
https://issues.apache.org/jira/browse/SPARK-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038494#comment-14038494
]
Reynold Xin commented on SPARK-2216:
The prerequisite of this change is to design the
[
https://issues.apache.org/jira/browse/SPARK-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-2215:
---
Priority: Minor (was: Major)
Multi-way join
--
Key: SPARK-2215
[
https://issues.apache.org/jira/browse/SPARK-2214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-2214:
---
Summary: Broadcast Join (aka map join) (was: MapSide Join)
Broadcast Join (aka map join)
84 matches
Mail list logo