[jira] [Commented] (SPARK-6118) making package name of deploy.worker.CommandUtils and deploy.CommandUtilsSuite consistent

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343791#comment-14343791 ] Apache Spark commented on SPARK-6118: - User 'CodingCat' has created a pull request for

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343829#comment-14343829 ] RJ Nowling commented on SPARK-2308: --- Ok, we should mark the status of the JIRA as won't

[jira] [Commented] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343944#comment-14343944 ] Apache Spark commented on SPARK-6121: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-1391: Attachment: BlockLimitDesign.pdf design doc BlockManager cannot transfer blocks larger than 2G in

[jira] [Updated] (SPARK-6115) Description for SparkSQL Jobs doesn't show up correctly until after the job finishes

2015-03-02 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-6115: -- Attachment: description_sql.tiff Description for SparkSQL Jobs doesn't show up correctly until

[jira] [Created] (SPARK-6116) Making DataFrame API non-experimental

2015-03-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6116: -- Summary: Making DataFrame API non-experimental Key: SPARK-6116 URL: https://issues.apache.org/jira/browse/SPARK-6116 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343817#comment-14343817 ] Apache Spark commented on SPARK-1391: - User 'squito' has created a pull request for

[jira] [Commented] (SPARK-6114) Explode on nested field fails

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343754#comment-14343754 ] Apache Spark commented on SPARK-6114: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-3151) DiskStore attempts to map any size BlockId without checking MappedByteBuffer limit

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343818#comment-14343818 ] Apache Spark commented on SPARK-3151: - User 'squito' has created a pull request for

[jira] [Created] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6120: Summary: DecisionTree.save uses too much Java heap space for default spark shell settings Key: SPARK-6120 URL: https://issues.apache.org/jira/browse/SPARK-6120

[jira] [Updated] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6121: - Priority: Critical (was: Minor) Python DataFrame type inference for LabeledPoint gets

[jira] [Updated] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6121: - Assignee: Xiangrui Meng Python DataFrame type inference for LabeledPoint gets wrong type

[jira] [Created] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Haoyuan Li (JIRA)
Haoyuan Li created SPARK-6122: - Summary: Upgrade Tachyon dependency to 0.6.0 Key: SPARK-6122 URL: https://issues.apache.org/jira/browse/SPARK-6122 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-3621) Provide a way to broadcast an RDD (instead of just a variable made of the RDD) so that a job can access

2015-03-02 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343977#comment-14343977 ] Xuefu Zhang commented on SPARK-3621: {quote} you can go a step further if you wanted

[jira] [Resolved] (SPARK-6040) Fix the percent bug in tablesample

2015-03-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6040. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4789

[jira] [Created] (SPARK-6118) making package name of deploy.worker.CommandUtils and deploy.CommandUtilsSuite consistent

2015-03-02 Thread Nan Zhu (JIRA)
Nan Zhu created SPARK-6118: -- Summary: making package name of deploy.worker.CommandUtils and deploy.CommandUtilsSuite consistent Key: SPARK-6118 URL: https://issues.apache.org/jira/browse/SPARK-6118 Project:

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343804#comment-14343804 ] RJ Nowling commented on SPARK-2308: --- [~derrickburns] and [~mengxr] Is work still being

[jira] [Issue Comment Deleted] (SPARK-5953) NoSuchMethodException with a Kafka input stream and custom decoder in Scala

2015-03-02 Thread Aleksandar Stojadinovic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aleksandar Stojadinovic updated SPARK-5953: --- Comment: was deleted (was: I'm sorry to report that putting the decoder in

[jira] [Resolved] (SPARK-6050) Spark on YARN does not work --executor-cores is specified

2015-03-02 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-6050. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.0 Spark on YARN does not

[jira] [Updated] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6120: - Description: When the Python DecisionTree example in the programming guide is run, it

[jira] [Updated] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6120: - Description: When the Python DecisionTree example in the programming guide is run, it

[jira] [Created] (SPARK-6123) Parquet reader should use the schema of every file to create converter

2015-03-02 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6123: --- Summary: Parquet reader should use the schema of every file to create converter Key: SPARK-6123 URL: https://issues.apache.org/jira/browse/SPARK-6123 Project: Spark

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343823#comment-14343823 ] Derrick Burns commented on SPARK-2308: -- I'm not proceeding with the PR. Fyi, you

[jira] [Updated] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6121: - Target Version/s: 1.3.0 (was: 1.4.0) Python DataFrame type inference for LabeledPoint

[jira] [Created] (SPARK-6117) describe function for summary statistics

2015-03-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6117: -- Summary: describe function for summary statistics Key: SPARK-6117 URL: https://issues.apache.org/jira/browse/SPARK-6117 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-5953) NoSuchMethodException with a Kafka input stream and custom decoder in Scala

2015-03-02 Thread Aleksandar Stojadinovic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343826#comment-14343826 ] Aleksandar Stojadinovic commented on SPARK-5953: I'm sorry to report that

[jira] [Commented] (SPARK-5953) NoSuchMethodException with a Kafka input stream and custom decoder in Scala

2015-03-02 Thread Aleksandar Stojadinovic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343825#comment-14343825 ] Aleksandar Stojadinovic commented on SPARK-5953: I'm sorry to report that

[jira] [Resolved] (SPARK-4992) Prominent Python example has bad, beginner style

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4992. -- Resolution: Fixed Assignee: Sean Owen I committed this patch to the web site. I haven't marked a

[jira] [Commented] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343900#comment-14343900 ] Reynold Xin commented on SPARK-1391: @squito if you want to attempt something this

[jira] [Commented] (SPARK-1548) Add Partial Random Forest algorithm to MLlib

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343980#comment-14343980 ] Sean Owen commented on SPARK-1548: -- Same question, curious about the status as there

[jira] [Resolved] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5390. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4843

[jira] [Updated] (SPARK-6117) describe function for summary statistics

2015-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6117: --- Description: DataFrame.describe should return a DataFrame with summary statistics. {code} def

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343806#comment-14343806 ] RJ Nowling commented on SPARK-2429: --- [~yuu.ishik...@gmail.com] are you still working on

[jira] [Resolved] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2308. -- Resolution: Won't Fix Yes I think there are a few factors that may mean a number of these JIRAs are

[jira] [Commented] (SPARK-1546) Add AdaBoost algorithm to Spark MLlib

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343974#comment-14343974 ] Sean Owen commented on SPARK-1546: -- I'm curious about the status of this and other

[jira] [Created] (SPARK-6119) missing data support

2015-03-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6119: -- Summary: missing data support Key: SPARK-6119 URL: https://issues.apache.org/jira/browse/SPARK-6119 Project: Spark Issue Type: Sub-task Reporter:

[jira] [Comment Edited] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14338648#comment-14338648 ] Imran Rashid edited comment on SPARK-1391 at 3/2/15 10:10 PM: --

[jira] [Updated] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6120: - Description: When the Python DecisionTree example in the programming guide is run, it

[jira] [Created] (SPARK-6115) Description for SparkSQL Jobs doesn't show up correctly until after the job finishes

2015-03-02 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-6115: - Summary: Description for SparkSQL Jobs doesn't show up correctly until after the job finishes Key: SPARK-6115 URL: https://issues.apache.org/jira/browse/SPARK-6115

[jira] [Commented] (SPARK-6112) Leverage HDFS RAM_DISK capacity to provide off_heap feature similar to Tachyon

2015-03-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343808#comment-14343808 ] Zhan Zhang commented on SPARK-6112: --- Will start scoping it. Leverage HDFS RAM_DISK

[jira] [Comment Edited] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343900#comment-14343900 ] Reynold Xin edited comment on SPARK-1391 at 3/2/15 10:34 PM: -

[jira] [Created] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6121: Summary: Python DataFrame type inference for LabeledPoint gets wrong type Key: SPARK-6121 URL: https://issues.apache.org/jira/browse/SPARK-6121 Project:

[jira] [Commented] (SPARK-6124) Support jdbc connection properties in OPTIONS part of the query

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344071#comment-14344071 ] Apache Spark commented on SPARK-6124: - User 'vlyubin' has created a pull request for

[jira] [Updated] (SPARK-6126) Support UDTs in JSON

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6126: - Issue Type: Improvement (was: Bug) Support UDTs in JSON

[jira] [Updated] (SPARK-6126) Support UDTs in JSON

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6126: - Summary: Support UDTs in JSON (was: RDD[LabeledPoint].toDF().toJSON() fails) Support

[jira] [Updated] (SPARK-6126) Support UDTs in JSON

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6126: - Description: This would be nice to do (in Python): {code} from pyspark.mllib.util import

[jira] [Created] (SPARK-6128) Update Spark Streaming Guide for Spark 1.3

2015-03-02 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-6128: Summary: Update Spark Streaming Guide for Spark 1.3 Key: SPARK-6128 URL: https://issues.apache.org/jira/browse/SPARK-6128 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5537) Expand user guide for multinomial logistic regression

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344221#comment-14344221 ] Apache Spark commented on SPARK-5537: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5310) Update SQL programming guide for 1.3

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344259#comment-14344259 ] Apache Spark commented on SPARK-5310: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-3203) ClassNotFoundException in spark-shell with Cassandra

2015-03-02 Thread Hung Duong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344396#comment-14344396 ] Hung Duong commented on SPARK-3203: --- I ran into the same exception in cluster mode (it

[jira] [Commented] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Calvin Jia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344462#comment-14344462 ] Calvin Jia commented on SPARK-6122: --- New dependencies in Tachyon 0.6.0 include

[jira] [Resolved] (SPARK-3915) backport 'spark.localExecution.enabled' to 1.0

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3915. -- Resolution: Not a Problem Resolving per comment from Patrick backport 'spark.localExecution.enabled'

[jira] [Assigned] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-6120: Assignee: Joseph K. Bradley DecisionTree.save uses too much Java heap space for

[jira] [Updated] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6120: - Target Version/s: 1.3.0 (was: 1.4.0) DecisionTree.save uses too much Java heap space

[jira] [Commented] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344252#comment-14344252 ] Joseph K. Bradley commented on SPARK-6120: -- Rather than one of the difficult

[jira] [Commented] (SPARK-6123) Parquet reader should use the schema of every file to create converter

2015-03-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344278#comment-14344278 ] Yin Huai commented on SPARK-6123: - To workaround this issue, users need to load the

[jira] [Commented] (SPARK-6090) Add BinaryClassificationMetrics in PySpark/MLlib

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344317#comment-14344317 ] Apache Spark commented on SPARK-6090: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6122: --- Assignee: Calvin Jia Upgrade Tachyon dependency to 0.6.0

[jira] [Commented] (SPARK-5537) Expand user guide for multinomial logistic regression

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344429#comment-14344429 ] Apache Spark commented on SPARK-5537: - User 'dbtsai' has created a pull request for

[jira] [Updated] (SPARK-6125) Custom Loss function

2015-03-02 Thread Joanne Shin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joanne Shin updated SPARK-6125: --- Description: Currently, there are only a small selection of loss functions available for the SGD

[jira] [Resolved] (SPARK-5950) Insert array into a metastore table saved as parquet should work when using datasource api

2015-03-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5950. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4826

[jira] [Updated] (SPARK-6112) Leverage HDFS RAM_DISK capacity to provide off_heap feature similar to Tachyon

2015-03-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-6112: -- Component/s: (was: Spark Core) Block Manager Leverage HDFS RAM_DISK capacity to

[jira] [Closed] (SPARK-5977) PySpark SPARK_CLASSPATH doesn't distribute jars to executors

2015-03-02 Thread Michael Nazario (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Nazario closed SPARK-5977. -- Resolution: Not a Problem This was my misunderstanding of how setting in spark are supposed to

[jira] [Comment Edited] (SPARK-3071) Increase default driver memory

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344186#comment-14344186 ] Joseph K. Bradley edited comment on SPARK-3071 at 3/3/15 1:00 AM:

[jira] [Commented] (SPARK-6130) support if not exists for insert overwrite into partition in hiveQl

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344419#comment-14344419 ] Apache Spark commented on SPARK-6130: - User 'adrian-wang' has created a pull request

[jira] [Commented] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344460#comment-14344460 ] Apache Spark commented on SPARK-6122: - User 'calvinjia' has created a pull request for

[jira] [Commented] (SPARK-1548) Add Partial Random Forest algorithm to MLlib

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344037#comment-14344037 ] Joseph K. Bradley commented on SPARK-1548: -- I think the idea is to train a

[jira] [Resolved] (SPARK-3551) Remove redundant putting FetchResult which means Fetch Fail when Remote fetching

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3551. -- Resolution: Not a Problem It looks like this code was subsequently changed anyway, and no longer

[jira] [Commented] (SPARK-3071) Increase default driver memory

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344186#comment-14344186 ] Joseph K. Bradley commented on SPARK-3071: -- +1000 for increasing default driver

[jira] [Closed] (SPARK-4777) Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory)

2015-03-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4777. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: SuYan Target Version/s: 1.4.0

[jira] [Resolved] (SPARK-6121) Python DataFrame type inference for LabeledPoint gets wrong type

2015-03-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6121. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4858

[jira] [Resolved] (SPARK-5537) Expand user guide for multinomial logistic regression

2015-03-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5537. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4861

[jira] [Created] (SPARK-6124) Support jdbc connection properties in OPTIONS part of the query

2015-03-02 Thread Volodymyr Lyubinets (JIRA)
Volodymyr Lyubinets created SPARK-6124: -- Summary: Support jdbc connection properties in OPTIONS part of the query Key: SPARK-6124 URL: https://issues.apache.org/jira/browse/SPARK-6124 Project:

[jira] [Resolved] (SPARK-6114) Explode on nested field fails

2015-03-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6114. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4855

[jira] [Updated] (SPARK-6126) RDD[LabeledPoint].toDF().toJSON() fails

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6126: - Target Version/s: 1.4.0 (was: 1.3.0) RDD[LabeledPoint].toDF().toJSON() fails

[jira] [Updated] (SPARK-6126) RDD[LabeledPoint].toDF().toJSON() fails

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6126: - Priority: Major (was: Blocker) RDD[LabeledPoint].toDF().toJSON() fails

[jira] [Updated] (SPARK-6127) Kafka API not visible in Python API docs

2015-03-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6127: - Summary: Kafka API not visible in Python API docs (was: Kafka API not visible in Python docs)

[jira] [Created] (SPARK-6127) Kafka API not visible in Python docs

2015-03-02 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-6127: Summary: Kafka API not visible in Python docs Key: SPARK-6127 URL: https://issues.apache.org/jira/browse/SPARK-6127 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6129) Add section in user guide for model evaluation

2015-03-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6129: Summary: Add section in user guide for model evaluation Key: SPARK-6129 URL: https://issues.apache.org/jira/browse/SPARK-6129 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-3530) Pipeline and Parameters

2015-03-02 Thread Evan Sparks (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344291#comment-14344291 ] Evan Sparks commented on SPARK-3530: We have looked at integrating Caffe with spark -

[jira] [Commented] (SPARK-2769) Ganglia Support Broken / Not working

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344039#comment-14344039 ] Sean Owen commented on SPARK-2769: -- [~DarkSlice] do you believe this is still an issue?

[jira] [Created] (SPARK-6125) Custom Loss function

2015-03-02 Thread Joanne Shin (JIRA)
Joanne Shin created SPARK-6125: -- Summary: Custom Loss function Key: SPARK-6125 URL: https://issues.apache.org/jira/browse/SPARK-6125 Project: Spark Issue Type: Question Components:

[jira] [Resolved] (SPARK-6048) SparkConf.translateConfKey should not translate on set

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6048. Resolution: Fixed Fix Version/s: 1.3.0 SparkConf.translateConfKey should not

[jira] [Updated] (SPARK-5522) Accelerate the History Server start

2015-03-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5522: - Affects Version/s: 1.0.0 Accelerate the History Server start ---

[jira] [Updated] (SPARK-5522) Accelerate the History Server start

2015-03-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5522: - Assignee: Liangliang Gu Accelerate the History Server start ---

[jira] [Updated] (SPARK-5522) Accelerate the History Server start

2015-03-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5522: - Fix Version/s: 1.4.0 Accelerate the History Server start ---

[jira] [Commented] (SPARK-6127) Kafka API not visible in Python API docs

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344210#comment-14344210 ] Apache Spark commented on SPARK-6127: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344243#comment-14344243 ] Joseph K. Bradley commented on SPARK-6120: -- This also affects tree ensembles, of

[jira] [Resolved] (SPARK-6127) Kafka API not visible in Python API docs

2015-03-02 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-6127. -- Resolution: Fixed Fix Version/s: 1.3.0 Kafka API not visible in Python API docs

[jira] [Updated] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6122: --- Assignee: (was: Patrick Wendell) Upgrade Tachyon dependency to 0.6.0

[jira] [Updated] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6122: --- Target Version/s: 1.4.0 Upgrade Tachyon dependency to 0.6.0

[jira] [Updated] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6122: --- Fix Version/s: (was: 1.3.0) Upgrade Tachyon dependency to 0.6.0

[jira] [Comment Edited] (SPARK-3203) ClassNotFoundException in spark-shell with Cassandra

2015-03-02 Thread Hung Duong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344396#comment-14344396 ] Hung Duong edited comment on SPARK-3203 at 3/3/15 2:58 AM: --- I

[jira] [Updated] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6122: --- Assignee: Patrick Wendell Upgrade Tachyon dependency to 0.6.0

[jira] [Created] (SPARK-6130) support if not exists for insert overwrite into partition in hiveQl

2015-03-02 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-6130: -- Summary: support if not exists for insert overwrite into partition in hiveQl Key: SPARK-6130 URL: https://issues.apache.org/jira/browse/SPARK-6130 Project: Spark

[jira] [Commented] (SPARK-3684) Can't configure local dirs in Yarn mode

2015-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344088#comment-14344088 ] Sean Owen commented on SPARK-3684: -- Wouldn't this amount to just configuring YARN to use

[jira] [Resolved] (SPARK-6066) Metadata in event log makes it very difficult for external libraries to parse event log

2015-03-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6066. Resolution: Fixed Fix Version/s: 1.3.0 Thanks Andrew and Marcelo for your work on

[jira] [Commented] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2015-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344320#comment-14344320 ] Reynold Xin commented on SPARK-1391: - I absolutely agree that better error messages

[jira] [Updated] (SPARK-6129) Add a section in user guide for model evaluation

2015-03-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6129: - Summary: Add a section in user guide for model evaluation (was: Add section in user guide for

[jira] [Commented] (SPARK-6120) DecisionTree.save uses too much Java heap space for default spark shell settings

2015-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344323#comment-14344323 ] Apache Spark commented on SPARK-6120: - User 'jkbradley' has created a pull request for

  1   2   3   >