[jira] [Commented] (SPARK-1394) calling system.platform on worker raises IOError

2014-04-04 Thread Idan Zalzberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13959735#comment-13959735 ] Idan Zalzberg commented on SPARK-1394: -- This seems to be related to the way the

[jira] [Commented] (SPARK-1413) Parquet messes up stdout and stdin when used in Spark REPL

2014-04-04 Thread witgo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13959784#comment-13959784 ] witgo commented on SPARK-1413: -- Try [the PR 325|https://github.com/apache/spark/pull/325]

[jira] [Commented] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2014-04-04 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960100#comment-13960100 ] Shivaram Venkataraman commented on SPARK-1391: -- Thanks for the patch. I will

[jira] [Resolved] (SPARK-1383) Spark-SQL: ParquetRelation improvements

2014-04-04 Thread Andre Schumacher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andre Schumacher resolved SPARK-1383. - Resolution: Fixed Fixed by

[jira] [Resolved] (SPARK-1133) Add a new small files input for MLlib, which will return an RDD[(fileName, content)]

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1133. -- Resolution: Fixed Fix Version/s: 1.0.0 Add a new small files input for MLlib, which

[jira] [Commented] (SPARK-1366) The sql function should be consistent between different types of SQLContext

2014-04-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960292#comment-13960292 ] Michael Armbrust commented on SPARK-1366: -

[jira] [Assigned] (SPARK-1414) Python API for SparkContext.wholeTextFiles

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1414: Assignee: Matei Zaharia Python API for SparkContext.wholeTextFiles

[jira] [Created] (SPARK-1416) Add support for SequenceFiles in PySpark

2014-04-04 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1416: Summary: Add support for SequenceFiles in PySpark Key: SPARK-1416 URL: https://issues.apache.org/jira/browse/SPARK-1416 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-1056) Header comment in Executor incorrectly implies it's not used for YARN

2014-04-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-1056: - Assignee: Sandy Ryza (was: Sandy Pérez González) Header comment in Executor incorrectly

[jira] [Assigned] (SPARK-1033) Ask for cores in Yarn container requests

2014-04-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-1033: - Assignee: Sandy Ryza (was: Sandy Pérez González) Ask for cores in Yarn container requests

[jira] [Assigned] (SPARK-1211) In ApplicationMaster, set spark.master system property to yarn-cluster

2014-04-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-1211: - Assignee: Sandy Ryza (was: Sandy Pérez González) In ApplicationMaster, set spark.master system

[jira] [Assigned] (SPARK-1197) Rename yarn-standalone and fix up docs for running on YARN

2014-04-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-1197: - Assignee: Sandy Ryza (was: Sandy Pérez González) Rename yarn-standalone and fix up docs for

[jira] [Assigned] (SPARK-1417) Spark on Yarn - spark UI link from resourcemanager is broken

2014-04-04 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves reassigned SPARK-1417: Assignee: Thomas Graves Spark on Yarn - spark UI link from resourcemanager is broken

[jira] [Commented] (SPARK-1399) Reason for Stage Failure should be shown in UI

2014-04-04 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960790#comment-13960790 ] Kay Ousterhout commented on SPARK-1399: --- FYI this outstanding pull request changes

[jira] [Created] (SPARK-1419) Apache parent POM to version 14

2014-04-04 Thread Mark Hamstra (JIRA)
Mark Hamstra created SPARK-1419: --- Summary: Apache parent POM to version 14 Key: SPARK-1419 URL: https://issues.apache.org/jira/browse/SPARK-1419 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-1198) Allow pipes tasks to run in different sub-directories

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1198. -- Resolution: Fixed Fix Version/s: 1.0.0 Allow pipes tasks to run in different

[jira] [Assigned] (SPARK-1415) Add a minSplits parameter to wholeTextFiles

2014-04-04 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin reassigned SPARK-1415: Assignee: Xusen Yin Add a minSplits parameter to wholeTextFiles

[jira] [Assigned] (SPARK-1216) Add a OneHotEncoder for handling categorical features

2014-04-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-1216: - Assignee: Sandy Ryza (was: Sandy Pérez González) Add a OneHotEncoder for handling categorical

[jira] [Commented] (SPARK-1415) Add a minSplits parameter to wholeTextFiles

2014-04-04 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960908#comment-13960908 ] Xusen Yin commented on SPARK-1415: -- Hi Matei, I just looked around in those Hadoop APIs.

[jira] [Resolved] (SPARK-1419) Apache parent POM to version 14

2014-04-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1419. Resolution: Fixed Fix Version/s: 1.0.0 Apache parent POM to version 14

[jira] [Commented] (SPARK-1402) 3 more compression algorithms for in-memory columnar storage

2014-04-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960937#comment-13960937 ] Cheng Lian commented on SPARK-1402: --- Corresponding PR:

[jira] [Updated] (SPARK-922) Update Spark AMI to Python 2.7

2014-04-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-922: -- Issue Type: Task (was: Improvement) Update Spark AMI to Python 2.7

[jira] [Resolved] (SPARK-1305) Support persisting RDD's directly to Tachyon

2014-04-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1305. Resolution: Fixed Support persisting RDD's directly to Tachyon