[jira] [Commented] (SPARK-7229) SpecificMutableRow should take integer type as internal representation for DateType

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518924#comment-14518924 ] Apache Spark commented on SPARK-7229: - User 'chenghao-intel' has created a pull

[jira] [Assigned] (SPARK-7222) Added mathematical derivation in comment to LinearRegression with ElasticNet

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7222: --- Assignee: Apache Spark Added mathematical derivation in comment to LinearRegression with

[jira] [Commented] (SPARK-7159) Support multiclass logistic regression in spark.ml

2015-04-29 Thread Selim Namsi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518952#comment-14518952 ] Selim Namsi commented on SPARK-7159: I'll Work on it Support multiclass logistic

[jira] [Commented] (SPARK-7222) Added mathematical derivation in comment to LinearRegression with ElasticNet

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518769#comment-14518769 ] Apache Spark commented on SPARK-7222: - User 'dbtsai' has created a pull request for

[jira] [Updated] (SPARK-6824) Fill the docs for DataFrame API in SparkR

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6824: - Issue Type: Sub-task (was: New Feature) Parent: SPARK-7228 Fill the

[jira] [Updated] (SPARK-7225) CombineLimits in Optimizer do not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Summary: CombineLimits in Optimizer do not works (was: CombineLimits do not works)

[jira] [Updated] (SPARK-6815) Support accumulators in R

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6815: - Target Version/s: 1.5.0 (was: 1.4.0) Support accumulators in R

[jira] [Created] (SPARK-7229) SpecificMutableRow should take integer type as internal representation for DateType

2015-04-29 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-7229: Summary: SpecificMutableRow should take integer type as internal representation for DateType Key: SPARK-7229 URL: https://issues.apache.org/jira/browse/SPARK-7229 Project:

[jira] [Updated] (SPARK-7225) CombineLimits do not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Description: The optimized logical plan of select key from (select key from src limit 100)

[jira] [Assigned] (SPARK-7223) Rename RPC askWithReply - askWithReply, sendWithReply - ask

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7223: --- Assignee: Apache Spark (was: Reynold Xin) Rename RPC askWithReply - askWithReply,

[jira] [Assigned] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7202: --- Assignee: Apache Spark Add SparseMatrixPickler to SerDe

[jira] [Updated] (SPARK-7157) Add approximate stratified sampling to DataFrame

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7157: --- Summary: Add approximate stratified sampling to DataFrame (was: Add sampleByKey, sampleByKeyExact

[jira] [Commented] (SPARK-6752) Allow StreamingContext to be recreated from checkpoint and existing SparkContext

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518927#comment-14518927 ] Apache Spark commented on SPARK-6752: - User 'tdas' has created a pull request for this

[jira] [Updated] (SPARK-6825) Data sources implementation to support `sequenceFile`

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6825: - Target Version/s: 1.5.0 (was: 1.4.0) Data sources implementation to support

[jira] [Updated] (SPARK-7133) Implement struct, array, and map field accessor using apply in Scala and __getitem__ in Python

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7133: --- Assignee: Wenchen Fan Implement struct, array, and map field accessor using apply in Scala and

[jira] [Updated] (SPARK-7225) CombineLimits in Optimizer does not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Summary: CombineLimits in Optimizer does not works (was: CombineLimits in Optimizer do not

[jira] [Created] (SPARK-7223) Rename RPC askWithReply - askWithReply, sendWithReply - ask

2015-04-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7223: -- Summary: Rename RPC askWithReply - askWithReply, sendWithReply - ask Key: SPARK-7223 URL: https://issues.apache.org/jira/browse/SPARK-7223 Project: Spark Issue

[jira] [Commented] (SPARK-7225) CombineLimits optimizer does not work

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518813#comment-14518813 ] Apache Spark commented on SPARK-7225: - User 'DoingDone9' has created a pull request

[jira] [Commented] (SPARK-3808) PySpark fails to start in Windows

2015-04-29 Thread eminent (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518795#comment-14518795 ] eminent commented on SPARK-3808: Yes. It's the cause. After updating the %PATH%, spark

[jira] [Commented] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-04-29 Thread Kalle Jepsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518925#comment-14518925 ] Kalle Jepsen commented on SPARK-7035: - I've created a PR to fix the error message in

[jira] [Updated] (SPARK-6816) Add SparkConf API to configure SparkR

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6816: - Target Version/s: 1.5.0 (was: 1.4.0) Add SparkConf API to configure SparkR

[jira] [Assigned] (SPARK-7223) Rename RPC askWithReply - askWithReply, sendWithReply - ask

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7223: --- Assignee: Reynold Xin (was: Apache Spark) Rename RPC askWithReply - askWithReply,

[jira] [Assigned] (SPARK-7232) Add a Substitution batch for spark sql analyzer

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7232: --- Assignee: (was: Apache Spark) Add a Substitution batch for spark sql analyzer

[jira] [Created] (SPARK-7234) When codegen on DateType defaultPrimitive will throw type mismatch exception

2015-04-29 Thread Chen Song (JIRA)
Chen Song created SPARK-7234: Summary: When codegen on DateType defaultPrimitive will throw type mismatch exception Key: SPARK-7234 URL: https://issues.apache.org/jira/browse/SPARK-7234 Project: Spark

[jira] [Assigned] (SPARK-7234) When codegen on DateType defaultPrimitive will throw type mismatch exception

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7234: --- Assignee: (was: Apache Spark) When codegen on DateType defaultPrimitive will throw type

[jira] [Commented] (SPARK-7234) When codegen on DateType defaultPrimitive will throw type mismatch exception

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519176#comment-14519176 ] Apache Spark commented on SPARK-7234: - User 'kaka1992' has created a pull request for

[jira] [Assigned] (SPARK-7234) When codegen on DateType defaultPrimitive will throw type mismatch exception

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7234: --- Assignee: Apache Spark When codegen on DateType defaultPrimitive will throw type mismatch

[jira] [Created] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-04-29 Thread Oleksii Kostyliev (JIRA)
Oleksii Kostyliev created SPARK-7233: Summary: ClosureCleaner#clean blocks concurrent job submitter threads Key: SPARK-7233 URL: https://issues.apache.org/jira/browse/SPARK-7233 Project: Spark

[jira] [Commented] (SPARK-6913) No suitable driver found loading JDBC dataframe using driver added by through SparkContext.addJar

2015-04-29 Thread Vyacheslav Baranov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519124#comment-14519124 ] Vyacheslav Baranov commented on SPARK-6913: --- The problem is in

[jira] [Commented] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519158#comment-14519158 ] Apache Spark commented on SPARK-7196: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7196: --- Assignee: Apache Spark decimal precision lost when loading DataFrame from JDBC

[jira] [Assigned] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7196: --- Assignee: (was: Apache Spark) decimal precision lost when loading DataFrame from JDBC

[jira] [Commented] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-04-29 Thread Oleksii Kostyliev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519169#comment-14519169 ] Oleksii Kostyliev commented on SPARK-7233: -- To illustrate the issue, I performed

[jira] [Updated] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-04-29 Thread Oleksii Kostyliev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksii Kostyliev updated SPARK-7233: - Attachment: blocked_threads_closurecleaner.png ClosureCleaner#clean blocks concurrent

[jira] [Resolved] (SPARK-7077) Binary processing hash table for aggregation

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7077. Resolution: Fixed Fix Version/s: 1.4.0 Binary processing hash table for aggregation

[jira] [Updated] (SPARK-6838) Explore using Reference Classes instead of S4 objects

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6838: - Target Version/s: 1.5.0 (was: 1.4.0) Explore using Reference Classes instead of

[jira] [Updated] (SPARK-7157) Add approximate stratified sampling to DataFrame

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7157: --- Description: def sampleBy(c Add approximate stratified sampling to DataFrame

[jira] [Commented] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519021#comment-14519021 ] Apache Spark commented on SPARK-7202: - User 'MechCoder' has created a pull request for

[jira] [Created] (SPARK-7227) Support fillna / dropna in R DataFrame

2015-04-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7227: -- Summary: Support fillna / dropna in R DataFrame Key: SPARK-7227 URL: https://issues.apache.org/jira/browse/SPARK-7227 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7232) Add a Substitution batch for spark sql analyzer

2015-04-29 Thread Fei Wang (JIRA)
Fei Wang created SPARK-7232: --- Summary: Add a Substitution batch for spark sql analyzer Key: SPARK-7232 URL: https://issues.apache.org/jira/browse/SPARK-7232 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-7225) CombineLimits optimizer does not work

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7225: --- Assignee: (was: Apache Spark) CombineLimits optimizer does not work

[jira] [Resolved] (SPARK-7205) Support local ivy cache in --packages

2015-04-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7205. Resolution: Fixed Assignee: Burak Yavuz Support local ivy cache in --packages

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Description: Added detailed mathematical derivation of how scaling and LeastSquaresAggregator work. Also

[jira] [Updated] (SPARK-7213) Exception while copying Hadoop config files due to permission issues

2015-04-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7213: --- Component/s: YARN Exception while copying Hadoop config files due to permission issues

[jira] [Commented] (SPARK-5945) Spark should not retry a stage infinitely on a FetchFailedException

2015-04-29 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518807#comment-14518807 ] Imran Rashid commented on SPARK-5945: - [~kayousterhout] can you please clarify -- did

[jira] [Updated] (SPARK-6803) [SparkR] Support SparkR Streaming

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6803: - Target Version/s: 1.5.0 (was: 1.4.0) [SparkR] Support SparkR Streaming

[jira] [Commented] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2015-04-29 Thread Nicolas PHUNG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519005#comment-14519005 ] Nicolas PHUNG commented on SPARK-3601: -- For GenericData.Array Avro, I use the

[jira] [Updated] (SPARK-6833) Extend `addPackage` so that any given R file can be sourced in the worker before functions are run.

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6833: - Target Version/s: 1.5.0 (was: 1.4.0) Extend `addPackage` so that any given R

[jira] [Updated] (SPARK-6813) SparkR style guide

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6813: - Target Version/s: 1.5.0 (was: 1.4.0) SparkR style guide --

[jira] [Created] (SPARK-7222) Added mathematical derivation in comment to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7222: -- Summary: Added mathematical derivation in comment to LinearRegression with ElasticNet Key: SPARK-7222 URL: https://issues.apache.org/jira/browse/SPARK-7222 Project: Spark

[jira] [Assigned] (SPARK-7225) CombineLimits optimizer does not work

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7225: --- Assignee: Apache Spark CombineLimits optimizer does not work

[jira] [Resolved] (SPARK-7080) Binary processing based aggregate operator

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7080. Resolution: Fixed Fix Version/s: 1.4.0 Binary processing based aggregate operator

[jira] [Updated] (SPARK-6809) Make numPartitions optional in pairRDD APIs

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6809: - Priority: Major (was: Critical) Make numPartitions optional in pairRDD APIs

[jira] [Created] (SPARK-7224) Mock repositories for testing with --packages

2015-04-29 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7224: -- Summary: Mock repositories for testing with --packages Key: SPARK-7224 URL: https://issues.apache.org/jira/browse/SPARK-7224 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-6820) Convert NAs to null type in SparkR DataFrames

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6820: - Priority: Critical (was: Major) Convert NAs to null type in SparkR DataFrames

[jira] [Updated] (SPARK-6799) Add dataframe examples for SparkR

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6799: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-7228 Add

[jira] [Assigned] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7202: --- Assignee: (was: Apache Spark) Add SparseMatrixPickler to SerDe

[jira] [Updated] (SPARK-6809) Make numPartitions optional in pairRDD APIs

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6809: - Target Version/s: 1.5.0 (was: 1.4.0) Make numPartitions optional in pairRDD

[jira] [Created] (SPARK-7225) CombineLimits do not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
Zhongshuai Pei created SPARK-7225: - Summary: CombineLimits do not works Key: SPARK-7225 URL: https://issues.apache.org/jira/browse/SPARK-7225 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6826) `hashCode` support for arbitrary R objects

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6826: - Target Version/s: 1.5.0 (was: 1.4.0) `hashCode` support for arbitrary R objects

[jira] [Commented] (SPARK-7032) SparkSQL incorrect results when using UNION/EXCEPT with GROUP BY clause

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518944#comment-14518944 ] Reynold Xin commented on SPARK-7032: cc [~cloud_fan] would you have time to take a

[jira] [Created] (SPARK-7230) Make RDD API private in SparkR for Spark 1.4

2015-04-29 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-7230: Summary: Make RDD API private in SparkR for Spark 1.4 Key: SPARK-7230 URL: https://issues.apache.org/jira/browse/SPARK-7230 Project: Spark

[jira] [Updated] (SPARK-7224) Mock repositories for testing with --packages

2015-04-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7224: --- Assignee: Burak Yavuz Mock repositories for testing with --packages

[jira] [Resolved] (SPARK-7076) Binary processing compact tuple representation

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7076. Resolution: Fixed Fix Version/s: 1.4.0 Binary processing compact tuple representation

[jira] [Commented] (SPARK-7232) Add a Substitution batch for spark sql analyzer

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519090#comment-14519090 ] Apache Spark commented on SPARK-7232: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-7226) Support math functions in R DataFrame

2015-04-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7226: -- Summary: Support math functions in R DataFrame Key: SPARK-7226 URL: https://issues.apache.org/jira/browse/SPARK-7226 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-7204) Call sites in UI are not accurate for DataFrame operations

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7204. Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Call sites in UI are not

[jira] [Assigned] (SPARK-7232) Add a Substitution batch for spark sql analyzer

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7232: --- Assignee: Apache Spark Add a Substitution batch for spark sql analyzer

[jira] [Commented] (SPARK-7223) Rename RPC askWithReply - askWithReply, sendWithReply - ask

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518778#comment-14518778 ] Apache Spark commented on SPARK-7223: - User 'rxin' has created a pull request for this

[jira] [Assigned] (SPARK-7229) SpecificMutableRow should take integer type as internal representation for DateType

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7229: --- Assignee: Apache Spark SpecificMutableRow should take integer type as internal

[jira] [Commented] (SPARK-5890) Add FeatureDiscretizer

2015-04-29 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518862#comment-14518862 ] Xusen Yin commented on SPARK-5890: -- I start to do it. Add FeatureDiscretizer

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Summary: Added mathematical derivation in comment and compressed the model to LinearRegression with

[jira] [Resolved] (SPARK-7215) Make repartition and coalesce a part of the query plan

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7215. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Burak Yavuz Make repartition and

[jira] [Updated] (SPARK-6814) Support sorting for any data type in SparkR

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6814: - Target Version/s: 1.5.0 (was: 1.4.0) Support sorting for any data type in

[jira] [Assigned] (SPARK-7229) SpecificMutableRow should take integer type as internal representation for DateType

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7229: --- Assignee: (was: Apache Spark) SpecificMutableRow should take integer type as internal

[jira] [Updated] (SPARK-7225) CombineLimits do not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Description: The optimized logical plan of select key from (select key from src limit 100)

[jira] [Created] (SPARK-7228) SparkR public API for 1.4 release

2015-04-29 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-7228: Summary: SparkR public API for 1.4 release Key: SPARK-7228 URL: https://issues.apache.org/jira/browse/SPARK-7228 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6832) Handle partial reads in SparkR JVM to worker communication

2015-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6832: - Target Version/s: 1.5.0 (was: 1.4.0) Handle partial reads in SparkR JVM to

[jira] [Resolved] (SPARK-7188) Support math functions in DataFrames in Python

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7188. Resolution: Fixed Fix Version/s: 1.4.0 Support math functions in DataFrames in Python

[jira] [Updated] (SPARK-7225) CombineLimits optimizer does not work

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Summary: CombineLimits optimizer does not work (was: CombineLimits optimizer does not works)

[jira] [Updated] (SPARK-7225) CombineLimits optimizer does not works

2015-04-29 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7225: -- Summary: CombineLimits optimizer does not works (was: CombineLimits in Optimizer does not

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Issue Type: Improvement (was: Documentation) Added mathematical derivation in comment and compressed the

[jira] [Commented] (SPARK-7217) Add configuration to disable stopping of SparkContext when StreamingContext.stop()

2015-04-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519593#comment-14519593 ] Sean Owen commented on SPARK-7217: -- FWIW I'd expect the current behavior since things

[jira] [Updated] (SPARK-7236) AkkaUtils askWithReply sleeps indefinitely when a timeout exception is thrown

2015-04-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-7236: Attachment: SparkLongSleepAfterTimeout.scala Attaching some code to reproduce this issue.

[jira] [Commented] (SPARK-6989) Spark 1.3 REPL for Scala 2.11 (2.11.2) fails to start, emitting various arcane compiler errors

2015-04-29 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519730#comment-14519730 ] Michael Allman commented on SPARK-6989: --- Thank you for looking into this. I've been

[jira] [Commented] (SPARK-5945) Spark should not retry a stage infinitely on a FetchFailedException

2015-04-29 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519650#comment-14519650 ] Kay Ousterhout commented on SPARK-5945: --- I wanted to hardcode to 4 (totally agree

[jira] [Commented] (SPARK-7189) History server will always reload the same file even when no log file is updated

2015-04-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14519600#comment-14519600 ] Sean Owen commented on SPARK-7189: -- I thought that was the point, but maybe I

[jira] [Resolved] (SPARK-7209) Adding new Manning book Spark in Action to the official Spark Webpage

2015-04-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7209. -- Resolution: Fixed Looks like Patrick just added it, yes. Adding new Manning book Spark in Action to

[jira] [Resolved] (SPARK-7223) Rename RPC askWithReply - askWithReply, sendWithReply - ask

2015-04-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7223. Resolution: Fixed Fix Version/s: 1.4.0 Rename RPC askWithReply - askWithReply,

[jira] [Updated] (SPARK-7194) Vectors factors method for sparse vectors should accept the output of zipWithIndex

2015-04-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7194: - Component/s: MLlib Priority: Minor (was: Major) Affects Version/s: 1.3.1 Go ahead

[jira] [Closed] (SPARK-7238) Upgrade protobuf-java (com.google.protobuf) version from 2.4.1 to 2.5.0

2015-04-29 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-7238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Favio Vázquez closed SPARK-7238. Resolution: Won't Fix Upgrade protobuf-java (com.google.protobuf) version from 2.4.1 to 2.5.0

[jira] [Created] (SPARK-7248) Random number generators for DataFrames

2015-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7248: Summary: Random number generators for DataFrames Key: SPARK-7248 URL: https://issues.apache.org/jira/browse/SPARK-7248 Project: Spark Issue Type: New

[jira] [Created] (SPARK-7247) Add Pandas' shift method to the Dataframe API

2015-04-29 Thread Olivier Girardot (JIRA)
Olivier Girardot created SPARK-7247: --- Summary: Add Pandas' shift method to the Dataframe API Key: SPARK-7247 URL: https://issues.apache.org/jira/browse/SPARK-7247 Project: Spark Issue

[jira] [Assigned] (SPARK-7250) computeInverse for RowMatrix

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7250: --- Assignee: (was: Apache Spark) computeInverse for RowMatrix

[jira] [Commented] (SPARK-7250) computeInverse for RowMatrix

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14520053#comment-14520053 ] Apache Spark commented on SPARK-7250: - User 'SpyderRiverA' has created a pull request

[jira] [Assigned] (SPARK-7250) computeInverse for RowMatrix

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7250: --- Assignee: Apache Spark computeInverse for RowMatrix

[jira] [Commented] (SPARK-7230) Make RDD API private in SparkR for Spark 1.4

2015-04-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14520058#comment-14520058 ] Patrick Wendell commented on SPARK-7230: I think this is a good idea. We should

[jira] [Created] (SPARK-7252) Add support for creating new Hive and HBase delegation tokens

2015-04-29 Thread Hari Shreedharan (JIRA)
Hari Shreedharan created SPARK-7252: --- Summary: Add support for creating new Hive and HBase delegation tokens Key: SPARK-7252 URL: https://issues.apache.org/jira/browse/SPARK-7252 Project: Spark

[jira] [Assigned] (SPARK-7237) Many user provided closures are not actually cleaned

2015-04-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7237: --- Assignee: Andrew Or (was: Apache Spark) Many user provided closures are not actually

[jira] [Updated] (SPARK-7237) Many user provided closures are not actually cleaned

2015-04-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7237: - Description: It appears that many operations throughout Spark actually do not actually clean the

  1   2   3   >