[jira] [Commented] (SPARK-5817) UDTF column names didn't set properly

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321208#comment-14321208 ] Apache Spark commented on SPARK-5817: - User 'chenghao-intel' has created a pull reques

[jira] [Updated] (SPARK-5817) UDTF column names didn't set properly

2015-02-13 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-5817: - Description: createQueryTest("insert table with generator with column name", """ CREATE TABLE ge

[jira] [Created] (SPARK-5817) UDTF column names didn't set properly

2015-02-13 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-5817: Summary: UDTF column names didn't set properly Key: SPARK-5817 URL: https://issues.apache.org/jira/browse/SPARK-5817 Project: Spark Issue Type: Bug Compon

[jira] [Resolved] (SPARK-5679) Flaky tests in InputOutputMetricsSuite: input metrics with interleaved reads and input metrics with mixed read method

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5679. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Assignee: Jos

[jira] [Resolved] (SPARK-5227) InputOutputMetricsSuite "input metrics when reading text file with multiple splits" test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5227. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Assignee: Jos

[jira] [Created] (SPARK-5816) Add huge backward compatibility warning in DriverWrapper

2015-02-13 Thread Andrew Or (JIRA)
Andrew Or created SPARK-5816: Summary: Add huge backward compatibility warning in DriverWrapper Key: SPARK-5816 URL: https://issues.apache.org/jira/browse/SPARK-5816 Project: Spark Issue Type: Bu

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-13 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321099#comment-14321099 ] Travis Galoppo commented on SPARK-5016: --- Hmm. I'm having trouble conceptualizing how

[jira] [Created] (SPARK-5815) Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5815: Summary: Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS Key: SPARK-5815 URL: https://issues.apache.org/jira/browse/SPARK-5815 Project: Spark

[jira] [Created] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5814: Summary: Remove JBLAS from runtime dependencies Key: SPARK-5814 URL: https://issues.apache.org/jira/browse/SPARK-5814 Project: Spark Issue Type: Dependency u

[jira] [Resolved] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5730. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4600 [https://githu

[jira] [Resolved] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5803. -- Resolution: Fixed Fix Version/s: 1.3.0 > Use ArrayBuilder instead of ArrayBuffer for prim

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-13 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321028#comment-14321028 ] Chris T commented on SPARK-5436: What we normally observe is that the error rate assessed

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320995#comment-14320995 ] Florian Verhein commented on SPARK-3821: RE: Java, that reminds me... We should pr

[jira] [Created] (SPARK-5813) Spark-ec2: Switch to OracleJDK

2015-02-13 Thread Florian Verhein (JIRA)
Florian Verhein created SPARK-5813: -- Summary: Spark-ec2: Switch to OracleJDK Key: SPARK-5813 URL: https://issues.apache.org/jira/browse/SPARK-5813 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5363) Spark 1.2 freeze without error notification

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320987#comment-14320987 ] Davies Liu commented on SPARK-5363: --- [~TJKlein] Could you try the patch in https://githu

[jira] [Commented] (SPARK-5363) Spark 1.2 freeze without error notification

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320986#comment-14320986 ] Apache Spark commented on SPARK-5363: - User 'davies' has created a pull request for th

[jira] [Resolved] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5806. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4598 [https://githu

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320935#comment-14320935 ] Xiangrui Meng commented on SPARK-5016: -- I think we should compute the inverse in para

[jira] [Updated] (SPARK-5812) Potential flaky test JavaAPISuite.glom

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5812: - Labels: flaky-test (was: ) > Potential flaky test JavaAPISuite.glom > ---

[jira] [Created] (SPARK-5812) Potential flaky test JavaAPISuite.glom

2015-02-13 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-5812: Summary: Potential flaky test JavaAPISuite.glom Key: SPARK-5812 URL: https://issues.apache.org/jira/browse/SPARK-5812 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5779: -- Affects Version/s: (was: 1.2.1) (was: 1.3.0) 1.2.0

[jira] [Commented] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320908#comment-14320908 ] Apache Spark commented on SPARK-5730: - User 'mengxr' has created a pull request for th

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320905#comment-14320905 ] Nicholas Chammas commented on SPARK-3821: - If you want Java 8 alongside 7, you can

[jira] [Commented] (SPARK-5679) Flaky tests in InputOutputMetricsSuite: input metrics with interleaved reads and input metrics with mixed read method

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320904#comment-14320904 ] Apache Spark commented on SPARK-5679: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-5227) InputOutputMetricsSuite "input metrics when reading text file with multiple splits" test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320903#comment-14320903 ] Apache Spark commented on SPARK-5227: - User 'JoshRosen' has created a pull request for

[jira] [Closed] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-5779. - Resolution: Duplicate > Python broadcast does not work with Kryo serializer >

[jira] [Commented] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320897#comment-14320897 ] Davies Liu commented on SPARK-5779: --- Yes, I will close it. > Python broadcast does not

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Chris Love (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320874#comment-14320874 ] Chris Love commented on SPARK-3821: --- I notice that the packer built ami comes with java7

[jira] [Created] (SPARK-5811) Documentation for --packages and --repositories on Spark Shell

2015-02-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-5811: -- Summary: Documentation for --packages and --repositories on Spark Shell Key: SPARK-5811 URL: https://issues.apache.org/jira/browse/SPARK-5811 Project: Spark Iss

[jira] [Created] (SPARK-5810) Maven Coordinate Inclusion failing in pySpark

2015-02-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-5810: -- Summary: Maven Coordinate Inclusion failing in pySpark Key: SPARK-5810 URL: https://issues.apache.org/jira/browse/SPARK-5810 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5730: - Assignee: Xiangrui Meng > Group methods in the generated doc for spark.ml algorithms. > --

[jira] [Resolved] (SPARK-5789) Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5789. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4582 [https:/

[jira] [Resolved] (SPARK-5642) Apply column pruning on unused aggregation fields

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5642. - Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Adrian Wang > Apply colum

[jira] [Commented] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320812#comment-14320812 ] Apache Spark commented on SPARK-5806: - User 'mengxr' has created a pull request for th

[jira] [Created] (SPARK-5809) OutOfMemoryError in logDebug in RandomForest.scala

2015-02-13 Thread Devesh Parekh (JIRA)
Devesh Parekh created SPARK-5809: Summary: OutOfMemoryError in logDebug in RandomForest.scala Key: SPARK-5809 URL: https://issues.apache.org/jira/browse/SPARK-5809 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320803#comment-14320803 ] Josh Rosen commented on SPARK-5779: --- I thought we fixed this in SPARK-4882: https://git

[jira] [Commented] (SPARK-4865) Include temporary tables in SHOW TABLES

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320804#comment-14320804 ] Yin Huai commented on SPARK-4865: - I will start to work on it based on SPARK-3299. > Incl

[jira] [Updated] (SPARK-4865) Include temporary tables in SHOW TABLES

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4865: Priority: Blocker (was: Critical) > Include temporary tables in SHOW TABLES > -

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320796#comment-14320796 ] Apache Spark commented on SPARK-5731: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-5808) Assembly generated by sbt does not contain pyspark

2015-02-13 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-5808: - Summary: Assembly generated by sbt does not contain pyspark Key: SPARK-5808 URL: https://issues.apache.org/jira/browse/SPARK-5808 Project: Spark Issue Type

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter p

[jira] [Commented] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320763#comment-14320763 ] DeepakVohra commented on SPARK-5798: Re-tested on local OS Oracle Linux 6.5 and did no

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320754#comment-14320754 ] Tathagata Das commented on SPARK-5731: -- This is very weird. the stream is receiving m

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter p

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320749#comment-14320749 ] Tathagata Das commented on SPARK-5731: -- Let me take a pass at it. > Flaky Test: org.

[jira] [Created] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5807: Summary: Parallel grid search Key: SPARK-5807 URL: https://issues.apache.org/jira/browse/SPARK-5807 Project: Spark Issue Type: New Feature Compone

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320739#comment-14320739 ] Patrick Wendell commented on SPARK-5731: [~c...@koeninger.org] [~tdas] FYI we've d

[jira] [Updated] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5731: --- Labels: flaky-test (was: ) > Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSu

[jira] [Updated] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5806: - Description: We separate code examples from algorithm descriptions. It would be better if we put t

[jira] [Updated] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5731: --- Priority: Blocker (was: Major) > Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStre

[jira] [Created] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5806: Summary: Organize sections in mllib-clustering.md Key: SPARK-5806 URL: https://issues.apache.org/jira/browse/SPARK-5806 Project: Spark Issue Type: Improvemen

[jira] [Updated] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5805: - Assignee: Emre Sevinç > Fix the type error in the final example given in MLlib - Clustering > doc

[jira] [Resolved] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5805. -- Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 > Fix the type e

[jira] [Commented] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320680#comment-14320680 ] Apache Spark commented on SPARK-5805: - User 'emres' has created a pull request for thi

[jira] [Commented] (SPARK-5227) InputOutputMetricsSuite "input metrics when reading text file with multiple splits" test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320653#comment-14320653 ] Josh Rosen commented on SPARK-5227: --- I think this might be caused by HADOOP-8490: the te

[jira] [Created] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread JIRA
Emre Sevinç created SPARK-5805: -- Summary: Fix the type error in the final example given in MLlib - Clustering documentation Key: SPARK-5805 URL: https://issues.apache.org/jira/browse/SPARK-5805 Project:

[jira] [Commented] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320607#comment-14320607 ] Apache Spark commented on SPARK-5804: - User 'petro-rudenko' has created a pull request

[jira] [Created] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5804: Summary: Explicitly manage cache in Crossvalidation k-fold loop Key: SPARK-5804 URL: https://issues.apache.org/jira/browse/SPARK-5804 Project: Spark Issue Ty

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320586#comment-14320586 ] Manoj Kumar commented on SPARK-5436: Before I dive into the internals, a question rela

[jira] [Commented] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320562#comment-14320562 ] Yin Huai commented on SPARK-5746: - For now, we will throw an error when we find this case.

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320552#comment-14320552 ] Marcelo Vanzin commented on SPARK-5770: --- It might be possible to fix the behavior, a

[jira] [Updated] (SPARK-5785) Pyspark does not support narrow dependencies

2015-02-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-5785: Description: joins (& cogroups etc.) are always considered to have "wide" dependencies in pyspark,

[jira] [Commented] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320533#comment-14320533 ] DeepakVohra commented on SPARK-5798: Thanks Sean for testing. Not all Spark/Scala co

[jira] [Commented] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320528#comment-14320528 ] Apache Spark commented on SPARK-5803: - User 'mengxr' has created a pull request for th

[jira] [Resolved] (SPARK-5345) Fix unstable test case in FsHistoryProviderSuite

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5345. --- Resolution: Fixed It looks like this has been fixed by SPARK-5600, so I'm going to resolve this for n

[jira] [Resolved] (SPARK-5626) Spurious test failures due to NullPointerException in EasyMock test code

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5626. --- Resolution: Fixed > Spurious test failures due to NullPointerException in EasyMock test code > ---

[jira] [Created] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5803: Summary: Use ArrayBuilder instead of ArrayBuffer for primitive types Key: SPARK-5803 URL: https://issues.apache.org/jira/browse/SPARK-5803 Project: Spark Is

[jira] [Commented] (SPARK-5626) Spurious test failures due to NullPointerException in EasyMock test code

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320519#comment-14320519 ] Josh Rosen commented on SPARK-5626: --- This should hopefully be fixed now that I've merged

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320510#comment-14320510 ] Sean Owen commented on SPARK-5770: -- Yeah I think that's the point, that overwriting an ex

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-13 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320514#comment-14320514 ] Octavian Geagla commented on SPARK-5726: Ok, I've made the change on the PR. Than

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320504#comment-14320504 ] Marcelo Vanzin commented on SPARK-5770: --- bq. but the classloader still load the old

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320499#comment-14320499 ] Michael Armbrust commented on SPARK-5296: - Oh, good point... We should pass down n

[jira] [Commented] (SPARK-5802) Cache scaled data in GLM

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320488#comment-14320488 ] Apache Spark commented on SPARK-5802: - User 'mengxr' has created a pull request for th

[jira] [Created] (SPARK-5802) Cache scaled data in GLM

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5802: Summary: Cache scaled data in GLM Key: SPARK-5802 URL: https://issues.apache.org/jira/browse/SPARK-5802 Project: Spark Issue Type: Improvement Comp

[jira] [Updated] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Summary: BlockManager heartbeat expiration does not kill executor (was: Executor is still hold while Bloc

[jira] [Updated] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Component/s: YARN > Executor is still hold while BlockManager has been removed > -

[jira] [Updated] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Assignee: Hong Shen > BlockManager heartbeat expiration does not kill executor > -

[jira] [Closed] (SPARK-5735) Replace uses of EasyMock with Mockito

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5735. Resolution: Fixed Fix Version/s: 1.3.0 > Replace uses of EasyMock with Mockito >

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Priority: Critical (was: Major) > Python Worker / Pyspark Daemon Memory Issue > -

[jira] [Updated] (SPARK-5801) Shuffle creates too many nested directories

2015-02-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5801: -- Component/s: Shuffle > Shuffle creates too many nested directories > ---

[jira] [Resolved] (SPARK-4903) RDD remains cached after "DROP TABLE"

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-4903. - Resolution: Fixed I am resolving it. It has been fixed by SPARK-4912 ([commit|https://github.com/apache/s

[jira] [Created] (SPARK-5801) Shuffle creates too many nested directories

2015-02-13 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5801: - Summary: Shuffle creates too many nested directories Key: SPARK-5801 URL: https://issues.apache.org/jira/browse/SPARK-5801 Project: Spark Issue Type: Bug

[jira] [Issue Comment Deleted] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Comment: was deleted (was: Would it make sense to instead make the _next_limit return the MIN of t

[jira] [Resolved] (SPARK-5503) Example code for Power Iteration Clustering

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5503. -- Resolution: Fixed Fix Version/s: 1.3.0 > Example code for Power Iteration Clustering > --

[jira] [Closed] (SPARK-5732) Add an option to print the spark version in spark script

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5732. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: uncleGen > Add an option to print the spark

[jira] [Commented] (SPARK-4903) RDD remains cached after "DROP TABLE"

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320439#comment-14320439 ] Yin Huai commented on SPARK-4903: - I believe that it has been resolved in 1.3 ([see this|

[jira] [Updated] (SPARK-5732) Add an option to print the spark version in spark script

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5732: - Affects Version/s: 1.0.0 > Add an option to print the spark version in spark script >

[jira] [Updated] (SPARK-5723) Change the default file format to Parquet for CTAS statements.

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5723: Priority: Blocker (was: Major) Target Version/s: 1.3.0 > Change the default file format to Parq

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-13 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320417#comment-14320417 ] Corey J. Nolet commented on SPARK-5296: --- bq. I would only pass down ORs though, as

[jira] [Comment Edited] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-13 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320417#comment-14320417 ] Corey J. Nolet edited comment on SPARK-5296 at 2/13/15 5:33 PM:

[jira] [Resolved] (SPARK-5798) Spark shell issue

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5798. -- Resolution: Cannot Reproduce I tried the 1.2.0+CDH binary release on OS X, Windows (cygwin and powershel

[jira] [Updated] (SPARK-4131) Support "Writing data into the filesystem from queries"

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4131: Fix Version/s: (was: 1.3.0) > Support "Writing data into the filesystem from queries" >

[jira] [Updated] (SPARK-4131) Support "Writing data into the filesystem from queries"

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4131: Target Version/s: 1.4.0 (was: 1.1.0) > Support "Writing data into the filesystem from queries" > --

[jira] [Commented] (SPARK-5800) Streaming. Change linked files according the selected language

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320369#comment-14320369 ] Apache Spark commented on SPARK-5800: - User 'gasparms' has created a pull request for

[jira] [Created] (SPARK-5800) Streaming. Change linked files according the selected language

2015-02-13 Thread JIRA
Gaspar Muñoz created SPARK-5800: Summary: Streaming. Change linked files according the selected language Key: SPARK-5800 URL: https://issues.apache.org/jira/browse/SPARK-5800 Project: Spark

[jira] [Commented] (SPARK-5800) Streaming. Change linked files according the selected language

2015-02-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320367#comment-14320367 ] Gaspar Muñoz commented on SPARK-5800: -- I did a PR https://github.com/apache/spark/pu

[jira] [Commented] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320352#comment-14320352 ] Apache Spark commented on SPARK-5799: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5799: -- Summary: Compute aggregation function on specified numeric columns Key: SPARK-5799 URL: https://issues.apache.org/jira/browse/SPARK-5799 Project: Spark I

[jira] [Commented] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320309#comment-14320309 ] Mark Khaitman commented on SPARK-5782: -- Would it make sense to instead make the _next

[jira] [Created] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
DeepakVohra created SPARK-5798: -- Summary: Spark shell issue Key: SPARK-5798 URL: https://issues.apache.org/jira/browse/SPARK-5798 Project: Spark Issue Type: Bug Components: Input/Outpu

  1   2   >