[jira] [Created] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread StanZhai (JIRA)
StanZhai created SPARK-9010: --- Summary: Improve the Spark Configuration document about `spark.kryoserializer.buffer` Key: SPARK-9010 URL: https://issues.apache.org/jira/browse/SPARK-9010 Project: Spark

[jira] [Resolved] (SPARK-8941) Standalone cluster worker does not accept multiple masters on launch

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8941. -- Resolution: Duplicate Standalone cluster worker does not accept multiple masters on launch

[jira] [Commented] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624483#comment-14624483 ] Shivam Verma commented on SPARK-9011: - Thanks Sean, I did some more experiments. It

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624484#comment-14624484 ] kumar ranganathan commented on SPARK-9009: -- Yes, all this in a single machine

[jira] [Commented] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624572#comment-14624572 ] Apache Spark commented on SPARK-9012: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9012: --- Assignee: (was: Apache Spark) Accumulators in the task table should be escaped

[jira] [Assigned] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9012: --- Assignee: Apache Spark Accumulators in the task table should be escaped

[jira] [Assigned] (SPARK-7751) Add @since to stable and experimental methods in MLlib

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7751: --- Assignee: Apache Spark (was: Xiangrui Meng) Add @since to stable and experimental methods

[jira] [Assigned] (SPARK-7751) Add @since to stable and experimental methods in MLlib

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7751: --- Assignee: Xiangrui Meng (was: Apache Spark) Add @since to stable and experimental methods

[jira] [Commented] (SPARK-7751) Add @since to stable and experimental methods in MLlib

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624590#comment-14624590 ] Apache Spark commented on SPARK-7751: - User 'petz2000' has created a pull request for

[jira] [Comment Edited] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624484#comment-14624484 ] kumar ranganathan edited comment on SPARK-9009 at 7/13/15 10:27 AM:

[jira] [Updated] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivam Verma updated SPARK-9011: Description: Hi, I ran into this bug while using pyspark.ml.tuning.CrossValidator on an RF

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624487#comment-14624487 ] Sean Owen commented on SPARK-9009: -- Try {{file:///C:/Spark/conf/...}} Don't use

[jira] [Updated] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9011: - Priority: Minor (was: Critical) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid

[jira] [Updated] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-9012: Attachment: (was: screenshot-1.png) Accumulators in the task table should be escaped

[jira] [Updated] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-9012: Attachment: Screen Shot 2015-07-13 at 8.02.44 PM.png Accumulators in the task table should be

[jira] [Updated] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-9012: Attachment: screenshot-1.png Accumulators in the task table should be escaped

[jira] [Commented] (SPARK-8915) Add @since tags to mllib.classification

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624594#comment-14624594 ] Apache Spark commented on SPARK-8915: - User 'petz2000' has created a pull request for

[jira] [Assigned] (SPARK-8915) Add @since tags to mllib.classification

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8915: --- Assignee: Apache Spark Add @since tags to mllib.classification

[jira] [Assigned] (SPARK-8915) Add @since tags to mllib.classification

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8915: --- Assignee: (was: Apache Spark) Add @since tags to mllib.classification

[jira] [Assigned] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9010: --- Assignee: Apache Spark Improve the Spark Configuration document about

[jira] [Assigned] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9010: --- Assignee: (was: Apache Spark) Improve the Spark Configuration document about

[jira] [Commented] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624381#comment-14624381 ] Apache Spark commented on SPARK-9010: - User 'stanzhai' has created a pull request for

[jira] [Updated] (SPARK-9011) Issue with running CrossValidator with RandomForestClassifier on dataset

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivam Verma updated SPARK-9011: Description: Hi I'm a beginner to Spark, and am trying to run grid search on an RF classifier to

[jira] [Created] (SPARK-9011) Issue with running CrossValidator with RandomForestClassifier on dataset

2015-07-13 Thread Shivam Verma (JIRA)
Shivam Verma created SPARK-9011: --- Summary: Issue with running CrossValidator with RandomForestClassifier on dataset Key: SPARK-9011 URL: https://issues.apache.org/jira/browse/SPARK-9011 Project: Spark

[jira] [Updated] (SPARK-9008) Stop and remove driver from supervised mode in spark-master interface

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9008: - Priority: Minor (was: Major) Component/s: Deploy Can you not just kill -9 the driver process? You

[jira] [Comment Edited] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624485#comment-14624485 ] Shivam Verma edited comment on SPARK-9011 at 7/13/15 10:24 AM:

[jira] [Reopened] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivam Verma reopened SPARK-9011: - I did some more experiments. It is really a bug because pyspark.ml.tuning.CrossValidator seems to

[jira] [Comment Edited] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624484#comment-14624484 ] kumar ranganathan edited comment on SPARK-9009 at 7/13/15 10:24 AM:

[jira] [Issue Comment Deleted] (SPARK-9011) Spark 1.4.0| Spark.ML Classifier Output Formats Inconsistent -- Grid search working on LR but not on RF

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivam Verma updated SPARK-9011: Comment: was deleted (was: Thanks Sean, I did some more experiments. It is really a bug because

[jira] [Comment Edited] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624484#comment-14624484 ] kumar ranganathan edited comment on SPARK-9009 at 7/13/15 10:26 AM:

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624523#comment-14624523 ] kumar ranganathan commented on SPARK-9009: -- I have tried the below code and gets

[jira] [Updated] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread StanZhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] StanZhai updated SPARK-9010: Component/s: (was: SQL) Documentation Improve the Spark Configuration document about

[jira] [Updated] (SPARK-9010) Improve the Spark Configuration document about `spark.kryoserializer.buffer`

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9010: - Target Version/s: 1.4.2, 1.5.0 Priority: Trivial (was: Minor) Improve the Spark

[jira] [Updated] (SPARK-9007) start-slave.sh changed API in 1.4 and the documentation got updated to mention the old API

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9007: - Priority: Trivial (was: Major) Component/s: (was: Deploy) Documentation

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624434#comment-14624434 ] Sean Owen commented on SPARK-9009: -- Is this all on one machine? because the file would

[jira] [Updated] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9009: - Priority: Minor (was: Major) Component/s: (was: YARN) SPARK Encryption FileNotFoundException

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624499#comment-14624499 ] kumar ranganathan commented on SPARK-9009: -- Yes i tried with file:/ and file:///

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624516#comment-14624516 ] Sean Owen commented on SPARK-9009: -- Can you paste exactly what worked? I'm still not sure

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-13 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624521#comment-14624521 ] Lianhui Wang commented on SPARK-8646: - [~juliet] from your spark1.4-verbose.log, i

[jira] [Updated] (SPARK-9011) Issue with running CrossValidator with RandomForestClassifier on dataset

2015-07-13 Thread Shivam Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivam Verma updated SPARK-9011: Description: Hi I'm a beginner to Spark, and am trying to run grid search on an RF classifier to

[jira] [Resolved] (SPARK-9011) Issue with running CrossValidator with RandomForestClassifier on dataset

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9011. -- Resolution: Invalid This is really a question, which you should ask on user@ first. Until you have

[jira] [Created] (SPARK-9012) Accumulators in the task table should be escaped

2015-07-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-9012: --- Summary: Accumulators in the task table should be escaped Key: SPARK-9012 URL: https://issues.apache.org/jira/browse/SPARK-9012 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7751) Add @since to stable and experimental methods in MLlib

2015-07-13 Thread Patrick Baier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624598#comment-14624598 ] Patrick Baier commented on SPARK-7751: -- sorry, wrong ticket number Add @since to

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624501#comment-14624501 ] Sean Owen commented on SPARK-9009: -- Try a small Java program using the File object to see

[jira] [Commented] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore

2015-07-13 Thread kumar ranganathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624514#comment-14624514 ] kumar ranganathan commented on SPARK-9009: -- Yes I tried, I could read the file

[jira] [Commented] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625755#comment-14625755 ] Apache Spark commented on SPARK-6851: - User 'adrian-wang' has created a pull request

[jira] [Created] (SPARK-9030) Add Kinesis.createStream unit tests that actual send data

2015-07-13 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-9030: Summary: Add Kinesis.createStream unit tests that actual send data Key: SPARK-9030 URL: https://issues.apache.org/jira/browse/SPARK-9030 Project: Spark

[jira] [Commented] (SPARK-9027) Generalize predicate pushdown into the metastore

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625733#comment-14625733 ] Apache Spark commented on SPARK-9027: - User 'marmbrus' has created a pull request for

[jira] [Assigned] (SPARK-9026) SimpleFutureAction.onComplete should not tie up a separate thread for each callback

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9026: --- Assignee: Apache Spark (was: Josh Rosen) SimpleFutureAction.onComplete should not tie up a

[jira] [Created] (SPARK-9027) Generalize predicate pushdown into the metastore

2015-07-13 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-9027: --- Summary: Generalize predicate pushdown into the metastore Key: SPARK-9027 URL: https://issues.apache.org/jira/browse/SPARK-9027 Project: Spark Issue

[jira] [Assigned] (SPARK-9026) SimpleFutureAction.onComplete should not tie up a separate thread for each callback

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9026: --- Assignee: Josh Rosen (was: Apache Spark) SimpleFutureAction.onComplete should not tie up a

[jira] [Commented] (SPARK-9026) SimpleFutureAction.onComplete should not tie up a separate thread for each callback

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625730#comment-14625730 ] Apache Spark commented on SPARK-9026: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-13 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625725#comment-14625725 ] Lianhui Wang commented on SPARK-8646: - [~juliet] can you provide your spark-submit

[jira] [Resolved] (SPARK-6910) Support for pushing predicates down to metastore for partition pruning

2015-07-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6910. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7216

[jira] [Assigned] (SPARK-9027) Generalize predicate pushdown into the metastore

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9027: --- Assignee: Apache Spark (was: Michael Armbrust) Generalize predicate pushdown into the

[jira] [Created] (SPARK-9028) Add CountVectorizer as an estimator to generate CountVectorizerModel

2015-07-13 Thread yuhao yang (JIRA)
yuhao yang created SPARK-9028: - Summary: Add CountVectorizer as an estimator to generate CountVectorizerModel Key: SPARK-9028 URL: https://issues.apache.org/jira/browse/SPARK-9028 Project: Spark

[jira] [Commented] (SPARK-9021) Have pyspark's RDD.aggregate() make a deepcopy of zeroValue for each partition

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625789#comment-14625789 ] Apache Spark commented on SPARK-9021: - User 'njhwang' has created a pull request for

[jira] [Assigned] (SPARK-9021) Have pyspark's RDD.aggregate() make a deepcopy of zeroValue for each partition

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9021: --- Assignee: (was: Apache Spark) Have pyspark's RDD.aggregate() make a deepcopy of

[jira] [Assigned] (SPARK-9021) Have pyspark's RDD.aggregate() make a deepcopy of zeroValue for each partition

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9021: --- Assignee: Apache Spark Have pyspark's RDD.aggregate() make a deepcopy of zeroValue for each

[jira] [Commented] (SPARK-8965) Add ml-guide Python Example: Estimator, Transformer, and Param

2015-07-13 Thread Arijit Saha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625831#comment-14625831 ] Arijit Saha commented on SPARK-8965: Hi Joseph, I would like to take up this task.

[jira] [Commented] (SPARK-3703) Ensemble learning methods

2015-07-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625859#comment-14625859 ] Manoj Kumar commented on SPARK-3703: Hi, I am interested in working on ensemble

[jira] [Updated] (SPARK-9028) Add CountVectorizer as an estimator to generate CountVectorizerModel

2015-07-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-9028: -- Description: Add an estimator for CountVectorizerModel. The estimator will extract a vocabulary from

[jira] [Assigned] (SPARK-9029) shortcut CaseKeyWhen if key is null

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9029: --- Assignee: Apache Spark shortcut CaseKeyWhen if key is null

[jira] [Commented] (SPARK-9029) shortcut CaseKeyWhen if key is null

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625778#comment-14625778 ] Apache Spark commented on SPARK-9029: - User 'cloud-fan' has created a pull request for

[jira] [Assigned] (SPARK-9029) shortcut CaseKeyWhen if key is null

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9029: --- Assignee: (was: Apache Spark) shortcut CaseKeyWhen if key is null

[jira] [Resolved] (SPARK-1403) Spark on Mesos does not set Thread's context class loader

2015-07-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1403. Resolution: Fixed Target Version/s: (was: 1.5.0) Hey All, This issue should

[jira] [Comment Edited] (SPARK-1403) Spark on Mesos does not set Thread's context class loader

2015-07-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625739#comment-14625739 ] Patrick Wendell edited comment on SPARK-1403 at 7/14/15 2:59 AM:

[jira] [Assigned] (SPARK-9028) Add CountVectorizer as an estimator to generate CountVectorizerModel

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9028: --- Assignee: Apache Spark Add CountVectorizer as an estimator to generate CountVectorizerModel

[jira] [Created] (SPARK-9029) shortcut CaseKeyWhen if key is null

2015-07-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-9029: -- Summary: shortcut CaseKeyWhen if key is null Key: SPARK-9029 URL: https://issues.apache.org/jira/browse/SPARK-9029 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-9013) generate MutableProjection directly instead of return a function

2015-07-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-9013: -- Summary: generate MutableProjection directly instead of return a function Key: SPARK-9013 URL: https://issues.apache.org/jira/browse/SPARK-9013 Project: Spark

[jira] [Assigned] (SPARK-9013) generate MutableProjection directly instead of return a function

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9013: --- Assignee: (was: Apache Spark) generate MutableProjection directly instead of return a

[jira] [Comment Edited] (SPARK-3155) Support DecisionTree pruning

2015-07-13 Thread Walter Petersen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14622041#comment-14622041 ] Walter Petersen edited comment on SPARK-3155 at 7/13/15 12:57 PM:

[jira] [Assigned] (SPARK-9013) generate MutableProjection directly instead of return a function

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9013: --- Assignee: Apache Spark generate MutableProjection directly instead of return a function

[jira] [Commented] (SPARK-7549) Support aggregating over nested fields

2015-07-13 Thread Chen Song (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625704#comment-14625704 ] Chen Song commented on SPARK-7549: -- I prefer the former. I thought about using explode,

[jira] [Updated] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed

2015-07-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7126: - Target Version/s: (was: 1.5.0) For spark.ml Classifiers, automatically index labels if

[jira] [Commented] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed

2015-07-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625708#comment-14625708 ] Joseph K. Bradley commented on SPARK-7126: -- I agree we should emulate

[jira] [Commented] (SPARK-6884) Random forest: predict class probabilities

2015-07-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625710#comment-14625710 ] Joseph K. Bradley commented on SPARK-6884: -- Once [SPARK-7131] gets merged, then

[jira] [Commented] (SPARK-8998) Collect enough frequent prefixes before projection in PrefixSpan

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625716#comment-14625716 ] Apache Spark commented on SPARK-8998: - User 'zhangjiajin' has created a pull request

[jira] [Assigned] (SPARK-8998) Collect enough frequent prefixes before projection in PrefixSpan

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8998: --- Assignee: Zhang JiaJin (was: Apache Spark) Collect enough frequent prefixes before

[jira] [Assigned] (SPARK-8998) Collect enough frequent prefixes before projection in PrefixSpan

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8998: --- Assignee: Apache Spark (was: Zhang JiaJin) Collect enough frequent prefixes before

[jira] [Created] (SPARK-9026) SimpleFutureAction.onComplete should not tie up a separate thread for each callback

2015-07-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-9026: - Summary: SimpleFutureAction.onComplete should not tie up a separate thread for each callback Key: SPARK-9026 URL: https://issues.apache.org/jira/browse/SPARK-9026 Project:

[jira] [Assigned] (SPARK-9026) SimpleFutureAction.onComplete should not tie up a separate thread for each callback

2015-07-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-9026: - Assignee: Josh Rosen SimpleFutureAction.onComplete should not tie up a separate thread for each

[jira] [Issue Comment Deleted] (SPARK-9015) Maven cleanup / Clean Project Import in scala-ide

2015-07-13 Thread Jan Prach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Prach updated SPARK-9015: - Comment: was deleted (was: PR #7375) Maven cleanup / Clean Project Import in scala-ide

[jira] [Updated] (SPARK-6319) DISTINCT doesn't work for binary type

2015-07-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6319: -- Priority: Critical (was: Major) DISTINCT doesn't work for binary type

[jira] [Commented] (SPARK-6319) DISTINCT doesn't work for binary type

2015-07-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625045#comment-14625045 ] Josh Rosen commented on SPARK-6319: --- I think that we should revisit this issue. It

[jira] [Commented] (SPARK-8907) Speed up path construction in DynamicPartitionWriterContainer.outputWriterForRow

2015-07-13 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625150#comment-14625150 ] Ilya Ganelin commented on SPARK-8907: - [~rxin] The code for this in master has

[jira] [Commented] (SPARK-4362) Make prediction probability available in NaiveBayesModel

2015-07-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625157#comment-14625157 ] Apache Spark commented on SPARK-4362: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-8954) Building Docker Images Fails in 1.4 branch

2015-07-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8954. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7346

[jira] [Resolved] (SPARK-8991) Update SharedParamsCodeGen's Generated Documentation

2015-07-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8991. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7367

[jira] [Created] (SPARK-9017) More timers for MLlib algorithms

2015-07-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-9017: Summary: More timers for MLlib algorithms Key: SPARK-9017 URL: https://issues.apache.org/jira/browse/SPARK-9017 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-9018) Implement a generic Timer utility for ML algorithms

2015-07-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-9018: Summary: Implement a generic Timer utility for ML algorithms Key: SPARK-9018 URL: https://issues.apache.org/jira/browse/SPARK-9018 Project: Spark Issue

[jira] [Updated] (SPARK-9005) RegressionMetrics computing incorrect explainedVariance and r2

2015-07-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9005: - Shepherd: Joseph K. Bradley Assignee: Feynman Liang RegressionMetrics computing

[jira] [Updated] (SPARK-8954) Building Docker Images Fails in 1.4 branch

2015-07-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8954: -- Assignee: Yong Tang Building Docker Images Fails in 1.4 branch

[jira] [Updated] (SPARK-8991) Update SharedParamsCodeGen's Generated Documentation

2015-07-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8991: - Assignee: Vinod KC Update SharedParamsCodeGen's Generated Documentation

[jira] [Updated] (SPARK-8838) Add config to enable/disable merging part-files when merging parquet schema

2015-07-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8838: Shepherd: Cheng Lian Add config to enable/disable merging part-files when merging parquet

[jira] [Commented] (SPARK-6319) DISTINCT doesn't work for binary type

2015-07-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625082#comment-14625082 ] Michael Armbrust commented on SPARK-6319: - +1 to throwing an {{AnalysisException}}

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625087#comment-14625087 ] Marcelo Vanzin commented on SPARK-8646: --- [~j_houg] could you also run the command

[jira] [Resolved] (SPARK-8950) Correct the calculation of SchedulerDelayTime in StagePage

2015-07-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-8950. --- Resolution: Fixed Assignee: Carson Wang Fix Version/s: 1.5.0 Correct the

[jira] [Created] (SPARK-9016) Make the random forest classifiers implement classification trait

2015-07-13 Thread holdenk (JIRA)
holdenk created SPARK-9016: -- Summary: Make the random forest classifiers implement classification trait Key: SPARK-9016 URL: https://issues.apache.org/jira/browse/SPARK-9016 Project: Spark Issue

  1   2   >