[jira] [Created] (SPARK-8891) Calling aggregation expressions on null literals fails at runtime

2015-07-08 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8891: - Summary: Calling aggregation expressions on null literals fails at runtime Key: SPARK-8891 URL: https://issues.apache.org/jira/browse/SPARK-8891 Project: Spark

[jira] [Assigned] (SPARK-8600) Naive Bayes API for spark.ml Pipelines

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8600: --- Assignee: Apache Spark (was: Yanbo Liang) Naive Bayes API for spark.ml Pipelines

[jira] [Commented] (SPARK-8600) Naive Bayes API for spark.ml Pipelines

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618219#comment-14618219 ] Apache Spark commented on SPARK-8600: - User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-8600) Naive Bayes API for spark.ml Pipelines

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8600: --- Assignee: Yanbo Liang (was: Apache Spark) Naive Bayes API for spark.ml Pipelines

[jira] [Resolved] (SPARK-7050) Fix Python Kafka test assembly jar not found issue under Maven build

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7050. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 5632

[jira] [Created] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Sun Rui (JIRA)
Sun Rui created SPARK-8894: -- Summary: Example code errors in SparkR documentation Key: SPARK-8894 URL: https://issues.apache.org/jira/browse/SPARK-8894 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-8895) MetricsSystem.removeSource not called in StreamingContext.stop

2015-07-08 Thread Aniket Bhatnagar (JIRA)
Aniket Bhatnagar created SPARK-8895: --- Summary: MetricsSystem.removeSource not called in StreamingContext.stop Key: SPARK-8895 URL: https://issues.apache.org/jira/browse/SPARK-8895 Project: Spark

[jira] [Created] (SPARK-8896) StreamingSource should choose a unique name

2015-07-08 Thread Aniket Bhatnagar (JIRA)
Aniket Bhatnagar created SPARK-8896: --- Summary: StreamingSource should choose a unique name Key: SPARK-8896 URL: https://issues.apache.org/jira/browse/SPARK-8896 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7050) Fix Python Kafka test assembly jar not found issue under Maven build

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7050: - Assignee: Saisai Shao Fix Python Kafka test assembly jar not found issue under Maven build

[jira] [Assigned] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8894: --- Assignee: (was: Apache Spark) Example code errors in SparkR documentation

[jira] [Commented] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618508#comment-14618508 ] Apache Spark commented on SPARK-8894: - User 'sun-rui' has created a pull request for

[jira] [Assigned] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8894: --- Assignee: Apache Spark Example code errors in SparkR documentation

[jira] [Updated] (SPARK-8896) StreamingSource should choose a unique name

2015-07-08 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Bhatnagar updated SPARK-8896: Description: If 2 instances of StreamingContext are created and run using the same

[jira] [Updated] (SPARK-8895) MetricsSystem.removeSource not called in StreamingContext.stop

2015-07-08 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Bhatnagar updated SPARK-8895: Description: StreamingContext calls env.metricsSystem.registerSource during its

[jira] [Updated] (SPARK-8896) StreamingSource should choose a unique name

2015-07-08 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Bhatnagar updated SPARK-8896: Description: If 2 instances of StreamingContext are created and run using the same

[jira] [Commented] (SPARK-8068) Add confusionMatrix method at class MulticlassMetrics in pyspark/mllib

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618403#comment-14618403 ] Apache Spark commented on SPARK-8068: - User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-8068) Add confusionMatrix method at class MulticlassMetrics in pyspark/mllib

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8068: --- Assignee: Apache Spark Add confusionMatrix method at class MulticlassMetrics in

[jira] [Updated] (SPARK-8895) MetricsSystem.removeSource not called in StreamingContext.stop

2015-07-08 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Bhatnagar updated SPARK-8895: Description: StreamingContext calls env.metricsSystem.registerSource during its

[jira] [Commented] (SPARK-8881) Scheduling fails if num_executors num_workers

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618313#comment-14618313 ] Sean Owen commented on SPARK-8881: -- Yes, the punchline is that each worker is asked for

[jira] [Commented] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-08 Thread Kashif Rasul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618352#comment-14618352 ] Kashif Rasul commented on SPARK-8872: - Ok should be ready for review. Improve

[jira] [Updated] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Daniel Darabos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Darabos updated SPARK-8893: -- Description: What does {{sc.parallelize(1 to 3).repartition(p).collect}} return? I would

[jira] [Commented] (SPARK-8866) Use 1 microsecond (us) precision for TimestampType

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618142#comment-14618142 ] Apache Spark commented on SPARK-8866: - User 'yijieshen' has created a pull request for

[jira] [Assigned] (SPARK-8866) Use 1 microsecond (us) precision for TimestampType

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8866: --- Assignee: Apache Spark (was: Yijie Shen) Use 1 microsecond (us) precision for

[jira] [Issue Comment Deleted] (SPARK-8864) Date/time function and data type design

2015-07-08 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-8864: - Comment: was deleted (was: Thanks for explanation. The design looks good to me now.) Date/time function

[jira] [Resolved] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7917. -- Resolution: Duplicate A-ha, that's the ticket. I knew there was something like this already resolved.

[jira] [Commented] (SPARK-8881) Scheduling fails if num_executors num_workers

2015-07-08 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618362#comment-14618362 ] Nishkam Ravi commented on SPARK-8881: - There's more to it. Consider the following:

[jira] [Commented] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618402#comment-14618402 ] Apache Spark commented on SPARK-8893: - User 'darabos' has created a pull request for

[jira] [Assigned] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8893: --- Assignee: Apache Spark Require positive partition counts in RDD.repartition

[jira] [Assigned] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8893: --- Assignee: (was: Apache Spark) Require positive partition counts in RDD.repartition

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2015-07-08 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618172#comment-14618172 ] Qian Huang commented on SPARK-8514: --- [~zhaoxiangyu] Not yet. [~shivaram] shared a

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-08 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618200#comment-14618200 ] Cheng Hao commented on SPARK-8864: -- Thanks for explanation. The design looks good to me

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-08 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618201#comment-14618201 ] Cheng Hao commented on SPARK-8864: -- Thanks for explanation. The design looks good to me

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-07-08 Thread Vinay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618222#comment-14618222 ] Vinay commented on SPARK-7442: -- Tried and tested-- Steps to submit spark job when jar file

[jira] [Commented] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2015-07-08 Thread Ma Xiaoyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618277#comment-14618277 ] Ma Xiaoyu commented on SPARK-5159: -- I was investigating this issue and it seems doAs in

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2015-07-08 Thread zhaoxiangyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618155#comment-14618155 ] zhaoxiangyu commented on SPARK-8514: I want to know if you have implemented the LU

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2015-07-08 Thread zhaoxiangyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618191#comment-14618191 ] zhaoxiangyu commented on SPARK-8514: can you give me the link of Shivaram

[jira] [Created] (SPARK-8892) Column.cast(LongType) does not work for large values

2015-07-08 Thread Jason Moore (JIRA)
Jason Moore created SPARK-8892: -- Summary: Column.cast(LongType) does not work for large values Key: SPARK-8892 URL: https://issues.apache.org/jira/browse/SPARK-8892 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-8885) libgplcompression.so already loaded in another classloader

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8885. -- Resolution: Invalid [~cenyuhai] There is a lot wrong with this JIRA, most importantly that it's a

[jira] [Created] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-8893: - Summary: Require positive partition counts in RDD.repartition Key: SPARK-8893 URL: https://issues.apache.org/jira/browse/SPARK-8893 Project: Spark Issue

[jira] [Assigned] (SPARK-8068) Add confusionMatrix method at class MulticlassMetrics in pyspark/mllib

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8068: --- Assignee: (was: Apache Spark) Add confusionMatrix method at class MulticlassMetrics in

[jira] [Created] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Sun Rui (JIRA)
Sun Rui created SPARK-8897: -- Summary: SparkR DataFrame fail to return data of float type Key: SPARK-8897 URL: https://issues.apache.org/jira/browse/SPARK-8897 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6266: --- Assignee: (was: Apache Spark) PySpark SparseVector missing doc for size, indices,

[jira] [Assigned] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6266: --- Assignee: Apache Spark PySpark SparseVector missing doc for size, indices, values

[jira] [Created] (SPARK-8898) Jets3t hangs with more than 1 core

2015-07-08 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-8898: - Summary: Jets3t hangs with more than 1 core Key: SPARK-8898 URL: https://issues.apache.org/jira/browse/SPARK-8898 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2015-07-08 Thread Greg Senia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618624#comment-14618624 ] Greg Senia commented on SPARK-5159: --- Yes that is the exact issue. It doesnt execute as

[jira] [Assigned] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8897: --- Assignee: Apache Spark SparkR DataFrame fail to return data of float type

[jira] [Commented] (SPARK-8896) StreamingSource should choose a unique name

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618567#comment-14618567 ] Sean Owen commented on SPARK-8896: -- Why do you have multiple contexts? I think that's not

[jira] [Commented] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618607#comment-14618607 ] Apache Spark commented on SPARK-8897: - User 'sun-rui' has created a pull request for

[jira] [Assigned] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8897: --- Assignee: (was: Apache Spark) SparkR DataFrame fail to return data of float type

[jira] [Updated] (SPARK-8600) Naive Bayes API for spark.ml Pipelines

2015-07-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8600: - Shepherd: Joseph K. Bradley Naive Bayes API for spark.ml Pipelines

[jira] [Commented] (SPARK-8896) StreamingSource should choose a unique name

2015-07-08 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618764#comment-14618764 ] Aniket Bhatnagar commented on SPARK-8896: - As per the documentation (scala docs),

[jira] [Commented] (SPARK-8899) remove duplicated equals method for Row

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618744#comment-14618744 ] Apache Spark commented on SPARK-8899: - User 'cloud-fan' has created a pull request for

[jira] [Assigned] (SPARK-8899) remove duplicated equals method for Row

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8899: --- Assignee: (was: Apache Spark) remove duplicated equals method for Row

[jira] [Assigned] (SPARK-8899) remove duplicated equals method for Row

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8899: --- Assignee: Apache Spark remove duplicated equals method for Row

[jira] [Commented] (SPARK-8898) Jets3t hangs with more than 1 core

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618747#comment-14618747 ] Sean Owen commented on SPARK-8898: -- Yeah, this is a jets3t problem. You will have to

[jira] [Created] (SPARK-8899) remove duplicated equals method for Row

2015-07-08 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-8899: -- Summary: remove duplicated equals method for Row Key: SPARK-8899 URL: https://issues.apache.org/jira/browse/SPARK-8899 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-8118) Turn off noisy log output produced by Parquet 1.7.0

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8118: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Turn off noisy log output produced by Parquet 1.7.0

[jira] [Updated] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8872: - Assignee: Kashif Rasul Improve FPGrowthSuite with equivalent R code

[jira] [Updated] (SPARK-8873) Support cleaning up shuffle files for drivers launched with Mesos

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8873: - Priority: Minor (was: Major) Component/s: Mesos Support cleaning up shuffle files for drivers

[jira] [Resolved] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8872. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7269

[jira] [Commented] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-08 Thread Kashif Rasul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618822#comment-14618822 ] Kashif Rasul commented on SPARK-8872: - [~mengxr] my jira username is krasul Improve

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618835#comment-14618835 ] Shivaram Venkataraman commented on SPARK-8514: -- I posted a link to a paper on

[jira] [Updated] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8893: - Component/s: Spark Core Require positive partition counts in RDD.repartition

[jira] [Resolved] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8897. -- Resolution: Duplicate SparkR DataFrame fail to return data of float type

[jira] [Commented] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618863#comment-14618863 ] Shivaram Venkataraman commented on SPARK-8596: -- Thanks for the PR. Will

[jira] [Commented] (SPARK-8897) SparkR DataFrame fail to return data of float type

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618849#comment-14618849 ] Shivaram Venkataraman commented on SPARK-8897: -- [~sunrui] I'm closing this as

[jira] [Assigned] (SPARK-8866) Use 1 microsecond (us) precision for TimestampType

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8866: --- Assignee: Yijie Shen (was: Apache Spark) Use 1 microsecond (us) precision for

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-08 Thread Mingyu Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618179#comment-14618179 ] Mingyu Kim commented on SPARK-7917: --- This looks like a duplicate of SPARK-5970, which is

[jira] [Commented] (SPARK-8881) Scheduling fails if num_executors num_workers

2015-07-08 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618369#comment-14618369 ] Nishkam Ravi commented on SPARK-8881: - This isn't the best example because the third

[jira] [Commented] (SPARK-8571) spark streaming hanging processes upon build exit

2015-07-08 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619098#comment-14619098 ] shane knapp commented on SPARK-8571: ok, upon auditing all of the spark builds, i

[jira] [Commented] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-07-08 Thread Matt Massie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619100#comment-14619100 ] Matt Massie commented on SPARK-7263: The Spark shuffle manager APIs, in their current

[jira] [Commented] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-07-08 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619105#comment-14619105 ] Reynold Xin commented on SPARK-7263: Can you please list the issues that made Spark

[jira] [Resolved] (SPARK-8657) Fail to upload conf archive to viewfs

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8657. -- Resolution: Fixed Fail to upload conf archive to viewfs -

[jira] [Updated] (SPARK-8657) Fail to upload conf archive to viewfs

2015-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8657: - Assignee: Tao Li Target Version/s: (was: 1.5.0) Priority: Minor (was: Major)

[jira] [Commented] (SPARK-8571) spark streaming hanging processes upon build exit

2015-07-08 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619135#comment-14619135 ] shane knapp commented on SPARK-8571: basically the code would look something like:

[jira] [Commented] (SPARK-3644) REST API for Spark application info (jobs / stages / tasks / storage info)

2015-07-08 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619144#comment-14619144 ] RJ Nowling commented on SPARK-3644: --- [~joshrosen] The issue and corresponding PR you

[jira] [Resolved] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8894. -- Resolution: Fixed Fix Version/s: 1.5.0 1.4.2 Issue

[jira] [Updated] (SPARK-8894) Example code errors in SparkR documentation

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8894: - Assignee: Sun Rui Example code errors in SparkR documentation

[jira] [Commented] (SPARK-8900) sparkPackages flag name is wrong in the documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618928#comment-14618928 ] Apache Spark commented on SPARK-8900: - User 'shivaram' has created a pull request for

[jira] [Assigned] (SPARK-8900) sparkPackages flag name is wrong in the documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8900: --- Assignee: (was: Apache Spark) sparkPackages flag name is wrong in the documentation

[jira] [Updated] (SPARK-8785) Improve Parquet schema merging

2015-07-08 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8785: -- Assignee: Liang-Chi Hsieh Improve Parquet schema merging --

[jira] [Resolved] (SPARK-8785) Improve Parquet schema merging

2015-07-08 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-8785. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7182

[jira] [Resolved] (SPARK-6912) Throw an AnalysisException when unsupported Java MapK,V types used in Hive UDF

2015-07-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6912. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7257

[jira] [Created] (SPARK-8900) sparkPackages flag name is wrong in the documentation

2015-07-08 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-8900: Summary: sparkPackages flag name is wrong in the documentation Key: SPARK-8900 URL: https://issues.apache.org/jira/browse/SPARK-8900 Project: Spark

[jira] [Assigned] (SPARK-8900) sparkPackages flag name is wrong in the documentation

2015-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8900: --- Assignee: Apache Spark sparkPackages flag name is wrong in the documentation

[jira] [Updated] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-07-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6266: - Shepherd: Xiangrui Meng PySpark SparseVector missing doc for size, indices, values

[jira] [Commented] (SPARK-7736) Exception not failing Python applications (in yarn cluster mode)

2015-07-08 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618885#comment-14618885 ] Neelesh Srinivas Salian commented on SPARK-7736: My 2 cents: To have a

[jira] [Updated] (SPARK-8785) Improve Parquet schema merging

2015-07-08 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8785: -- Shepherd: Cheng Lian Improve Parquet schema merging --

[jira] [Updated] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-07-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6266: - Assignee: Kai Sasaki PySpark SparseVector missing doc for size, indices, values

[jira] [Updated] (SPARK-6912) Throw an AnalysisException when unsupported Java MapK,V types used in Hive UDF

2015-07-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6912: Assignee: Takeshi Yamamuro Throw an AnalysisException when unsupported Java MapK,V types

[jira] [Resolved] (SPARK-5707) Enabling spark.sql.codegen throws ClassNotFound exception

2015-07-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5707. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7272

[jira] [Commented] (SPARK-8900) sparkPackages flag name is wrong in the documentation

2015-07-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619036#comment-14619036 ] Shivaram Venkataraman commented on SPARK-8900: -- [~bashyal] You can take a

[jira] [Updated] (SPARK-5427) Add support for floor function in Spark SQL

2015-07-08 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-5427: -- Description: floor() function is supported in Hive SQL. This issue is to add floor() function to Spark SQL.

[jira] [Updated] (SPARK-8571) spark streaming hanging processes upon build exit

2015-07-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8571: -- Assignee: shane knapp spark streaming hanging processes upon build exit

[jira] [Created] (SPARK-8901) [SparkR] Documentation Incorrect for sparkR.init

2015-07-08 Thread Pradeep Bashyal (JIRA)
Pradeep Bashyal created SPARK-8901: -- Summary: [SparkR] Documentation Incorrect for sparkR.init Key: SPARK-8901 URL: https://issues.apache.org/jira/browse/SPARK-8901 Project: Spark Issue

[jira] [Resolved] (SPARK-8901) [SparkR] Documentation Incorrect for sparkR.init

2015-07-08 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-8901. --- Resolution: Duplicate [SparkR] Documentation Incorrect for sparkR.init

[jira] [Resolved] (SPARK-8753) Create an IntervalType data type

2015-07-08 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8753. Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 1.5.0 Create an

[jira] [Commented] (SPARK-3164) Store DecisionTree Split.categories as Set

2015-07-08 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619215#comment-14619215 ] Rekha Joshi commented on SPARK-3164: Its all good, but thanks for taking the feedback

[jira] [Commented] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-07-08 Thread Matt Massie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619246#comment-14619246 ] Matt Massie commented on SPARK-7263: Also, let me know if you'd like me to split the

[jira] [Commented] (SPARK-8905) Spark Streaming receiving socket data sporadically on Mesos 0.22.1

2015-07-08 Thread Brandon Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619256#comment-14619256 ] Brandon Bradley commented on SPARK-8905: I also tested Spark 1.3.1 against Mesos

  1   2   3   4   >