[jira] [Commented] (SPARK-5387) parquet writer runs into OOM during writing when number of rows is large

2015-03-18 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366973#comment-14366973 ] Chaozhong Yang commented on SPARK-5387: --- I also encountered the same issue.

[jira] [Commented] (SPARK-5818) unable to use add jar in hql

2015-03-18 Thread Venkata Ramana G (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366811#comment-14366811 ] Venkata Ramana G commented on SPARK-5818: - TranslatingClassLoader is used for

[jira] [Created] (SPARK-6396) Add timeout control for broadcast

2015-03-18 Thread Jun Fang (JIRA)
Jun Fang created SPARK-6396: --- Summary: Add timeout control for broadcast Key: SPARK-6396 URL: https://issues.apache.org/jira/browse/SPARK-6396 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6388) Spark 1.3 + Hadoop 2.6 Can't work on Java 8_40

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366982#comment-14366982 ] Sean Owen commented on SPARK-6388: -- I am just using JDK8 to compile not targeting Java 8

[jira] [Created] (SPARK-6397) Check the missingInput simply

2015-03-18 Thread Yadong Qi (JIRA)
Yadong Qi created SPARK-6397: Summary: Check the missingInput simply Key: SPARK-6397 URL: https://issues.apache.org/jira/browse/SPARK-6397 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6396) Add timeout control for broadcast

2015-03-18 Thread Dale Richardson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366799#comment-14366799 ] Dale Richardson commented on SPARK-6396: If nobody else is looking at this one

[jira] [Commented] (SPARK-6397) Check the missingInput simply

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366918#comment-14366918 ] Apache Spark commented on SPARK-6397: - User 'watermen' has created a pull request for

[jira] [Updated] (SPARK-6195) Specialized in-memory column type for fixed-precision decimal

2015-03-18 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6195: -- Summary: Specialized in-memory column type for fixed-precision decimal (was: Specialized in-memory

[jira] [Updated] (SPARK-6195) Specialized in-memory column type for decimal

2015-03-18 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6195: -- Description: When building in-memory columnar representation, decimal values are currently serialized

[jira] [Commented] (SPARK-6396) Add timeout control for broadcast

2015-03-18 Thread Jun Fang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366836#comment-14366836 ] Jun Fang commented on SPARK-6396: - I am working on it right now, sooner i will give a pull

[jira] [Updated] (SPARK-6397) Check the missingInput simply

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6397: - Priority: Minor (was: Major) There's no description and no real info in the title. JIRAs need to state

[jira] [Commented] (SPARK-6396) Add timeout control for broadcast

2015-03-18 Thread Dale Richardson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366849#comment-14366849 ] Dale Richardson commented on SPARK-6396: No problems. Add timeout control for

[jira] [Commented] (SPARK-6398) Improve utility of GaussianMixture for higer dimensional data

2015-03-18 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367028#comment-14367028 ] Travis Galoppo commented on SPARK-6398: --- Please assign to me Improve utility of

[jira] [Updated] (SPARK-6372) spark-submit --conf is not being propagated to child processes

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6372: - Assignee: Marcelo Vanzin spark-submit --conf is not being propagated to child processes

[jira] [Commented] (SPARK-5874) How to improve the current ML pipeline API?

2015-03-18 Thread Abou Haydar Elias (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367089#comment-14367089 ] Abou Haydar Elias commented on SPARK-5874: -- The tokenizer as for now converts the

[jira] [Updated] (SPARK-6325) YarnAllocator crash with dynamic allocation on

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6325: - Assignee: Marcelo Vanzin YarnAllocator crash with dynamic allocation on

[jira] [Resolved] (SPARK-6325) YarnAllocator crash with dynamic allocation on

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6325. -- Resolution: Fixed Fix Version/s: 1.3.1 1.4.0 Issue resolved by pull request

[jira] [Created] (SPARK-6398) Improve utility of GaussianMixture for higer dimensional data

2015-03-18 Thread Travis Galoppo (JIRA)
Travis Galoppo created SPARK-6398: - Summary: Improve utility of GaussianMixture for higer dimensional data Key: SPARK-6398 URL: https://issues.apache.org/jira/browse/SPARK-6398 Project: Spark

[jira] [Created] (SPARK-6399) Code compiled against 1.3.0 may not run against older Spark versions

2015-03-18 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-6399: - Summary: Code compiled against 1.3.0 may not run against older Spark versions Key: SPARK-6399 URL: https://issues.apache.org/jira/browse/SPARK-6399 Project: Spark

[jira] [Commented] (SPARK-6096) Support model save/load in Python's naive Bayes

2015-03-18 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367125#comment-14367125 ] Xusen Yin commented on SPARK-6096: -- [~mengxr] Pls assign it to me. Support model

[jira] [Resolved] (SPARK-6286) Handle TASK_ERROR in TaskState

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6286. -- Resolution: Fixed Fix Version/s: 1.3.1 1.4.0 Assignee: Iulian Dragos

[jira] [Resolved] (SPARK-6372) spark-submit --conf is not being propagated to child processes

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6372. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5057

[jira] [Resolved] (SPARK-6389) YARN app diagnostics report doesn't report NPEs

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6389. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Steve Loughran Resolved by

[jira] [Resolved] (SPARK-4416) Support Mesos framework authentication

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4416. -- Resolution: Duplicate Support Mesos framework authentication --

[jira] [Commented] (SPARK-5874) How to improve the current ML pipeline API?

2015-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367309#comment-14367309 ] Xiangrui Meng commented on SPARK-5874: -- [~Elie A.] Thanks for your feedback! This

[jira] [Created] (SPARK-6400) It would be great if you could share your test jars in Maven central repository for the Spark SQL module

2015-03-18 Thread JIRA
Óscar Puertas created SPARK-6400: Summary: It would be great if you could share your test jars in Maven central repository for the Spark SQL module Key: SPARK-6400 URL:

[jira] [Updated] (SPARK-6096) Support model save/load in Python's naive Bayes

2015-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6096: - Assignee: Xusen Yin Support model save/load in Python's naive Bayes

[jira] [Commented] (SPARK-6096) Support model save/load in Python's naive Bayes

2015-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367280#comment-14367280 ] Xiangrui Meng commented on SPARK-6096: -- Done. Support model save/load in Python's

[jira] [Created] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-18 Thread JIRA
Rémy DUBOIS created SPARK-6401: -- Summary: Unable to load a old API input format in Spark streaming Key: SPARK-6401 URL: https://issues.apache.org/jira/browse/SPARK-6401 Project: Spark Issue

[jira] [Created] (SPARK-6402) EC2 script and job scheduling documentation still refer to Shark

2015-03-18 Thread Pierre Borckmans (JIRA)
Pierre Borckmans created SPARK-6402: --- Summary: EC2 script and job scheduling documentation still refer to Shark Key: SPARK-6402 URL: https://issues.apache.org/jira/browse/SPARK-6402 Project: Spark

[jira] [Commented] (SPARK-6402) EC2 script and job scheduling documentation still refer to Shark

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367440#comment-14367440 ] Apache Spark commented on SPARK-6402: - User 'pierre-borckmans' has created a pull

[jira] [Commented] (SPARK-6397) Check the missingInput simply

2015-03-18 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367437#comment-14367437 ] Santiago M. Mola commented on SPARK-6397: - I think a proper title would be:

[jira] [Resolved] (SPARK-6374) Add getter for GeneralizedLinearAlgorithm

2015-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6374. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5058

[jira] [Commented] (SPARK-3632) ConnectionManager can run out of receive threads with authentication on

2015-03-18 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367863#comment-14367863 ] Thomas Graves commented on SPARK-3632: -- [~andrewor14] At this point doesn't seem like

[jira] [Created] (SPARK-6403) Launch master as spot instance on EC2

2015-03-18 Thread Adam Vogel (JIRA)
Adam Vogel created SPARK-6403: - Summary: Launch master as spot instance on EC2 Key: SPARK-6403 URL: https://issues.apache.org/jira/browse/SPARK-6403 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-03-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367701#comment-14367701 ] Manoj Kumar commented on SPARK-6192: Thanks for your feedback. I've fixed it up (same

[jira] [Commented] (SPARK-5078) Allow setting Akka host name from env vars

2015-03-18 Thread Timothy St. Clair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367704#comment-14367704 ] Timothy St. Clair commented on SPARK-5078: -- Cross listing details of the issue

[jira] [Created] (SPARK-6395) Rebuild the schema from a GenericRow

2015-03-18 Thread Chen Song (JIRA)
Chen Song created SPARK-6395: Summary: Rebuild the schema from a GenericRow Key: SPARK-6395 URL: https://issues.apache.org/jira/browse/SPARK-6395 Project: Spark Issue Type: Task

[jira] [Created] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Yifan Wang (JIRA)
Yifan Wang created SPARK-6404: - Summary: Call broadcast() in each interval for spark streaming programs. Key: SPARK-6404 URL: https://issues.apache.org/jira/browse/SPARK-6404 Project: Spark

[jira] [Commented] (SPARK-6406) Launcher backward compatibility issues

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368380#comment-14368380 ] Apache Spark commented on SPARK-6406: - User 'nishkamravi2' has created a pull request

[jira] [Comment Edited] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-18 Thread Jonathan Neufeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368138#comment-14368138 ] Jonathan Neufeld edited comment on SPARK-6152 at 3/18/15 11:31 PM:

[jira] [Created] (SPARK-6405) Spark Kryo buffer should be forced to be max. 2GB

2015-03-18 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-6405: - Summary: Spark Kryo buffer should be forced to be max. 2GB Key: SPARK-6405 URL: https://issues.apache.org/jira/browse/SPARK-6405 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368242#comment-14368242 ] Sean Owen commented on SPARK-6152: -- To your deleted comment -- yes indeed it looks like

[jira] [Commented] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368283#comment-14368283 ] Saisai Shao commented on SPARK-6404: Hi [~heavens...@gmail.com], I think for current

[jira] [Updated] (SPARK-6397) Override QueryPlan.missingInput when necessary and rely on CheckAnalysis

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6397: - Description: Currently, some LogicalPlans do not override missingInput, but they should. Then, the lack

[jira] [Commented] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368241#comment-14368241 ] Sean Owen commented on SPARK-6401: -- Yeah it would be more consistent. I suppose I'd be

[jira] [Commented] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-18 Thread Jonathan Neufeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368138#comment-14368138 ] Jonathan Neufeld commented on SPARK-6152: - The exception is raised in

[jira] [Commented] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Yifan Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368383#comment-14368383 ] Yifan Wang commented on SPARK-6404: --- I got an error. Is that expected? {code} Traceback

[jira] [Comment Edited] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Yifan Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368383#comment-14368383 ] Yifan Wang edited comment on SPARK-6404 at 3/19/15 2:28 AM: I

[jira] [Created] (SPARK-6406) Launcher backward compatibility issues

2015-03-18 Thread Nishkam Ravi (JIRA)
Nishkam Ravi created SPARK-6406: --- Summary: Launcher backward compatibility issues Key: SPARK-6406 URL: https://issues.apache.org/jira/browse/SPARK-6406 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-6394) cleanup BlockManager companion object and improve the getCacheLocs method in DAGScheduler

2015-03-18 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6394. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5043

[jira] [Updated] (SPARK-6394) cleanup BlockManager companion object and improve the getCacheLocs method in DAGScheduler

2015-03-18 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6394: -- Assignee: Wenchen Fan cleanup BlockManager companion object and improve the getCacheLocs method in

[jira] [Issue Comment Deleted] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-18 Thread Jonathan Neufeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Neufeld updated SPARK-6152: Comment: was deleted (was: The exception is raised in

[jira] [Updated] (SPARK-6146) Support more datatype in SqlParser

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6146: Target Version/s: 1.3.1 (was: 1.4.0) Support more datatype in SqlParser

[jira] [Updated] (SPARK-5911) Make Column.cast(to: String) support fixed precision and scale decimal type

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5911: Target Version/s: 1.3.1 (was: 1.4.0) Make Column.cast(to: String) support fixed precision and scale

[jira] [Commented] (SPARK-6146) Support more datatype in SqlParser

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368204#comment-14368204 ] Apache Spark commented on SPARK-6146: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-5911) Make Column.cast(to: String) support fixed precision and scale decimal type

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368205#comment-14368205 ] Apache Spark commented on SPARK-5911: - User 'yhuai' has created a pull request for

[jira] [Comment Edited] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368505#comment-14368505 ] Yin Huai edited comment on SPARK-5508 at 3/19/15 5:00 AM: -- Seems

[jira] [Commented] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368493#comment-14368493 ] Yin Huai commented on SPARK-5508: - I tried the following snippet in sparkShell {code}

[jira] [Updated] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-03-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-6407: Description: Like MLLib's ALS implementation for recommendation, and applying to streaming. Similar

[jira] [Commented] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368505#comment-14368505 ] Yin Huai commented on SPARK-5508: - Seems the root cause of this problem is the array

[jira] [Commented] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368409#comment-14368409 ] Saisai Shao commented on SPARK-6404: Would you please paste your code snippet to

[jira] [Commented] (SPARK-6200) Support dialect in SQL

2015-03-18 Thread haiyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368486#comment-14368486 ] haiyang commented on SPARK-6200: You are right! In fact,I haven't add the corresponding

[jira] [Created] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-03-18 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-6407: --- Summary: Streaming ALS for Collaborative Filtering Key: SPARK-6407 URL: https://issues.apache.org/jira/browse/SPARK-6407 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-4258) NPE with new Parquet Filters

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368528#comment-14368528 ] Yin Huai commented on SPARK-4258: - [~liancheng] Does the version that we are currently

[jira] [Updated] (SPARK-4485) Add broadcast outer join to optimize left outer join and right outer join

2015-03-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4485: Target Version/s: 1.4.0 (was: 1.1.0) Add broadcast outer join to optimize left outer join and right

[jira] [Updated] (SPARK-6374) Add getter for GeneralizedLinearAlgorithm

2015-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6374: - Assignee: yuhao yang Add getter for GeneralizedLinearAlgorithm

[jira] [Updated] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6401: - Priority: Minor (was: Major) You mean the .mapred. MapReduce API right? I think it's not unreasonable to

[jira] [Comment Edited] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367789#comment-14367789 ] Rémy DUBOIS edited comment on SPARK-6401 at 3/18/15 8:18 PM: -

[jira] [Commented] (SPARK-6168) Expose some of the collection classes as DeveloperApi

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367787#comment-14367787 ] Apache Spark commented on SPARK-6168: - User 'mridulm' has created a pull request for

[jira] [Commented] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367789#comment-14367789 ] Rémy DUBOIS commented on SPARK-6401: Yes I mean the mapred API. All our input formats

[jira] [Updated] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-03-18 Thread Yifan Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yifan Wang updated SPARK-6404: -- Description: If I understand it correctly, Spark’s broadcast() function will be called only once at the

[jira] [Commented] (SPARK-5945) Spark should not retry a stage infinitely on a FetchFailedException

2015-03-18 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367891#comment-14367891 ] Ilya Ganelin commented on SPARK-5945: - Hi Imran - I'd be happy to tackle this. Could

[jira] [Commented] (SPARK-5932) Use consistent naming for byte properties

2015-03-18 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367919#comment-14367919 ] Ilya Ganelin commented on SPARK-5932: - [~andrewor14] - I can take this out. Thanks.

[jira] [Comment Edited] (SPARK-5931) Use consistent naming for time properties

2015-03-18 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367915#comment-14367915 ] Ilya Ganelin edited comment on SPARK-5931 at 3/18/15 9:09 PM: --

[jira] [Commented] (SPARK-5931) Use consistent naming for time properties

2015-03-18 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367915#comment-14367915 ] Ilya Ganelin commented on SPARK-5931: - @andrewor - I can take this out. Thanks. Use

[jira] [Commented] (SPARK-6364) hashCode and equals for Matrices

2015-03-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14366720#comment-14366720 ] Apache Spark commented on SPARK-6364: - User 'MechCoder' has created a pull request for