[jira] [Commented] (SPARK-5794) add jar should return 0

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319712#comment-14319712 ] Apache Spark commented on SPARK-5794: - User 'adrian-wang' has created a pull request f

[jira] [Created] (SPARK-5794) add jar should return 0

2015-02-12 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-5794: -- Summary: add jar should return 0 Key: SPARK-5794 URL: https://issues.apache.org/jira/browse/SPARK-5794 Project: Spark Issue Type: Bug Components: SQL

[jira] [Created] (SPARK-5793) Add explode to Column

2015-02-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5793: -- Summary: Add explode to Column Key: SPARK-5793 URL: https://issues.apache.org/jira/browse/SPARK-5793 Project: Spark Issue Type: Improvement Com

[jira] [Commented] (SPARK-5793) Add explode to Column

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319706#comment-14319706 ] Apache Spark commented on SPARK-5793: - User 'viirya' has created a pull request for th

[jira] [Commented] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2015-02-12 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319672#comment-14319672 ] Ankur Dave commented on SPARK-1955: --- [~boyork] Thanks, it would be great if you could ta

[jira] [Commented] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2015-02-12 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319661#comment-14319661 ] Brennon York commented on SPARK-1955: - [~ankurdave] if you haven't started on this I c

[jira] [Updated] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5296: Assignee: Cheng Lian > Predicate Pushdown (BaseRelation) to have an interface that will acce

[jira] [Commented] (SPARK-5641) Allow spark_ec2.py to copy arbitrary files to cluster

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319651#comment-14319651 ] Apache Spark commented on SPARK-5641: - User 'florianverhein' has created a pull reques

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319642#comment-14319642 ] Michael Armbrust commented on SPARK-5296: - As I mentioned on the mailing list, I t

[jira] [Commented] (SPARK-5792) hive udfs like "get_json_object and json_tuple" doesnot work in spark 1.2.0

2015-02-12 Thread pengxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319645#comment-14319645 ] pengxu commented on SPARK-5792: --- I've already figured out the reason, it was caused by the j

[jira] [Resolved] (SPARK-3365) Failure to save Lists to Parquet

2015-02-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-3365. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4581 [https://github.com/

[jira] [Updated] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5296: Priority: Critical (was: Major) > Predicate Pushdown (BaseRelation) to have an interface th

[jira] [Updated] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5296: Target Version/s: 1.3.0 > Predicate Pushdown (BaseRelation) to have an interface that will a

[jira] [Commented] (SPARK-5792) hive udfs like "get_json_object and json_tuple" doesnot work in spark 1.2.0

2015-02-12 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319618#comment-14319618 ] Yi Tian commented on SPARK-5792: Can you provide more information about this issue? Like:

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-12 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319603#comment-14319603 ] Manoj Kumar commented on SPARK-5016: [~mengxr] [~tgaloppo] Should I proceed in this di

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-12 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Description: I'm including the Shuffle component on this, as a brief scan through the code (which

[jira] [Updated] (SPARK-5641) Allow spark_ec2.py to copy arbitrary files to cluster

2015-02-12 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Verhein updated SPARK-5641: --- Description: *Updated - no longer via deploy.generic, no substitutions* Essentially, give use

[jira] [Created] (SPARK-5792) hive udfs like "get_json_object and json_tuple" doesnot work in spark 1.2.0

2015-02-12 Thread pengxu (JIRA)
pengxu created SPARK-5792: - Summary: hive udfs like "get_json_object and json_tuple" doesnot work in spark 1.2.0 Key: SPARK-5792 URL: https://issues.apache.org/jira/browse/SPARK-5792 Project: Spark

[jira] [Updated] (SPARK-5641) Allow spark_ec2.py to copy arbitrary files to cluster

2015-02-12 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Verhein updated SPARK-5641: --- Summary: Allow spark_ec2.py to copy arbitrary files to cluster (was: Allow spark_ec2.py to co

[jira] [Commented] (SPARK-5789) Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319512#comment-14319512 ] Apache Spark commented on SPARK-5789: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319505#comment-14319505 ] Cheng Lian commented on SPARK-3365: --- Actually, after rethinking about this, your solutio

[jira] [Commented] (SPARK-5310) Update SQL programming guide for 1.3

2015-02-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319500#comment-14319500 ] Yin Huai commented on SPARK-5310: - It will be helpful to add our parser's reserved keyword

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319493#comment-14319493 ] Apache Spark commented on SPARK-3365: - User 'tianyi' has created a pull request for th

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319475#comment-14319475 ] Octavian Geagla commented on SPARK-5726: I like the name ElementwiseProduct also,

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319471#comment-14319471 ] Apache Spark commented on SPARK-5726: - User 'ogeagla' has created a pull request for t

[jira] [Resolved] (SPARK-3299) [SQL] Public API in SQLContext to list tables

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3299. - Resolution: Fixed Fix Version/s: 1.3.0 > [SQL] Public API in SQLContext to list tab

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319400#comment-14319400 ] Cheng Hao commented on SPARK-5791: -- Can you also attach the performance comparison result

[jira] [Closed] (SPARK-5764) Delete the cache and lock file after executor fetching the jar

2015-02-12 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula closed SPARK-5764. --- Resolution: Not a Problem > Delete the cache and lock file after executor fetching the jar > -

[jira] [Updated] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-5791: --- Description: Spark SQL shows poor performance when multiple tables do join operation (was: Spark SQL shows po

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319364#comment-14319364 ] Yi Zhou commented on SPARK-5791: For example: SELECT * FROM inventory inv JOIN (

[jira] [Created] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
Yi Zhou created SPARK-5791: -- Summary: [Spark SQL] show poor performance when multiple table do join operation Key: SPARK-5791 URL: https://issues.apache.org/jira/browse/SPARK-5791 Project: Spark Is

[jira] [Updated] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5790: - Assignee: Brennon York > VertexRDD's won't zip properly for `diff` capability > --

[jira] [Commented] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319316#comment-14319316 ] Brennon York commented on SPARK-5790: - FWIW this issue is a blocker for [SPARK-4600|h

[jira] [Updated] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York updated SPARK-5790: Description: For VertexRDD's with differing partition sizes one cannot run commands like `diff` as

[jira] [Created] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Brennon York (JIRA)
Brennon York created SPARK-5790: --- Summary: VertexRDD's won't zip properly for `diff` capability Key: SPARK-5790 URL: https://issues.apache.org/jira/browse/SPARK-5790 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3168) The ServletContextHandler of webui lacks a SessionManager

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3168: - Component/s: (was: Spark Core) Web UI Priority: Minor (was: Major) Issue

[jira] [Updated] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5762: - Target Version/s: 1.3.0 (was: 1.3.0, 1.2.2) > Shuffle write time is incorrect for sort-based shuffle > --

[jira] [Updated] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5762: - Fix Version/s: (was: 1.2.2) > Shuffle write time is incorrect for sort-based shuffle > ---

[jira] [Updated] (SPARK-5335) Destroying cluster in VPC with "--delete-groups" fails to remove security groups

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5335: - Assignee: Vladimir Grigor > Destroying cluster in VPC with "--delete-groups" fails to remove security > g

[jira] [Resolved] (SPARK-5335) Destroying cluster in VPC with "--delete-groups" fails to remove security groups

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5335. -- Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Issue resolved by pull request 41

[jira] [Resolved] (SPARK-5573) Support explode in DataFrame DSL

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5573. - Resolution: Fixed Fix Version/s: 1.3.0 > Support explode in DataFrame DSL > ---

[jira] [Resolved] (SPARK-5758) Use LongType as the default type for integers in JSON schema inference.

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5758. - Resolution: Fixed Fix Version/s: 1.3.0 > Use LongType as the default type for integ

[jira] [Created] (SPARK-5789) Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.

2015-02-12 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5789: --- Summary: Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors. Key: SPARK-5789 URL: https://issues.apache.org/jira/browse/SPARK-5789 Projec

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319182#comment-14319182 ] Sean Owen commented on SPARK-5726: -- You can ignore this comment, but I wonder if it would

[jira] [Closed] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5780. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 (was: 1.4.0) > The loggings

[jira] [Updated] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5780: - Affects Version/s: (was: 1.4.0) > The loggings of Python unittests are noisy and scaring in > ---

[jira] [Updated] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5726: - Assignee: Octavian Geagla > Hadamard Vector Product Transformer >

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319163#comment-14319163 ] Xiangrui Meng commented on SPARK-5726: -- This is a nice feature. I like the name `Hada

[jira] [Closed] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5759. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Lianhui Wang Target Version/s:

[jira] [Updated] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5759: - Affects Version/s: 1.2.0 > ExecutorRunnable should catch YarnException while NMClient start container > --

[jira] [Closed] (SPARK-5761) Revamp StandaloneRestProtocolSuite

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5761. Resolution: Fixed Fix Version/s: 1.3.0 > Revamp StandaloneRestProtocolSuite > ---

[jira] [Closed] (SPARK-5690) Flaky test: org.apache.spark.deploy.rest.StandaloneRestSubmitSuite.simple submit until completion

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5690. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 > Flaky test: org.apache.spa

[jira] [Closed] (SPARK-5760) StandaloneRestClient/Server error behavior is incorrect

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5760. Resolution: Fixed Fix Version/s: 1.3.0 > StandaloneRestClient/Server error behavior is incorrect > --

[jira] [Closed] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5762. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Target Version/s: 1.3.0,

[jira] [Closed] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5765. Resolution: Fixed Fix Version/s: 1.2.2 1.1.2 1.3.0

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Affects Version/s: (was: 1.2.2) (was: 1.3.0) > word split problem in run-ex

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Affects Version/s: 1.3.0 1.2.1 > word split problem in run-example and compute-clas

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Assignee: Venkata Ramana G > word split problem in run-example and compute-classpath > ---

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Affects Version/s: (was: 1.2.1) 1.2.2 > word split problem in run-example and c

[jira] [Commented] (SPARK-5735) Replace uses of EasyMock with Mockito

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319140#comment-14319140 ] Apache Spark commented on SPARK-5735: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-5788) Capture exceptions in Python write thread

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319135#comment-14319135 ] Apache Spark commented on SPARK-5788: - User 'davies' has created a pull request for th

[jira] [Created] (SPARK-5788) Capture exceptions in Python write thread

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5788: - Summary: Capture exceptions in Python write thread Key: SPARK-5788 URL: https://issues.apache.org/jira/browse/SPARK-5788 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5787) Protect JVM from some not-important exceptions

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5787: - Summary: Protect JVM from some not-important exceptions Key: SPARK-5787 URL: https://issues.apache.org/jira/browse/SPARK-5787 Project: Spark Issue Type: Improvemen

[jira] [Commented] (SPARK-2774) Set preferred locations for reduce tasks

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319056#comment-14319056 ] Apache Spark commented on SPARK-2774: - User 'shivaram' has created a pull request for

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319018#comment-14319018 ] Apache Spark commented on SPARK-4267: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319014#comment-14319014 ] Yin Huai commented on SPARK-5746: - Here are places where we need to take care overwrite,

[jira] [Created] (SPARK-5786) Documentation of Narrow Dependencies

2015-02-12 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-5786: --- Summary: Documentation of Narrow Dependencies Key: SPARK-5786 URL: https://issues.apache.org/jira/browse/SPARK-5786 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5785) Pyspark does not support narrow dependencies

2015-02-12 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-5785: --- Summary: Pyspark does not support narrow dependencies Key: SPARK-5785 URL: https://issues.apache.org/jira/browse/SPARK-5785 Project: Spark Issue Type: Improvem

[jira] [Commented] (SPARK-5784) Add StatsDSink to MetricsSystem

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318993#comment-14318993 ] Apache Spark commented on SPARK-5784: - User 'ryan-williams' has created a pull request

[jira] [Created] (SPARK-5784) Add StatsDSink to MetricsSystem

2015-02-12 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5784: Summary: Add StatsDSink to MetricsSystem Key: SPARK-5784 URL: https://issues.apache.org/jira/browse/SPARK-5784 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-3570) Shuffle write time does not include time to open shuffle files

2015-02-12 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318960#comment-14318960 ] Kay Ousterhout commented on SPARK-3570: --- There are a bunch of times when files are o

[jira] [Updated] (SPARK-5783) Include filename, line number in eventlog-parsing error message

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5783: - Affects Version/s: (was: 1.2.1) 1.0.0 > Include filename, line number in eventl

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-12 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Description: I'm including the Shuffle component on this, as a brief scan through the code (which

[jira] [Resolved] (SPARK-1192) Around 30 parameters in Spark are used but undocumented and some are having confusing name

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1192. -- Resolution: Won't Fix PR was withdrawn; this probably deserves a rethink if it were reconsidered anyway

[jira] [Commented] (SPARK-4897) Python 3 support

2015-02-12 Thread Ryan Ovas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318935#comment-14318935 ] Ryan Ovas commented on SPARK-4897: -- I'm interested in using Spark in my startup, but ever

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-12 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318915#comment-14318915 ] Chris T commented on SPARK-5436: I'm haven't been able to make headway on this. [~MechCode

[jira] [Assigned] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-5776: Assignee: Sean Owen > JIRA version not of form x.y.z breaks merge_spark_pr.py > ---

[jira] [Resolved] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5776. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4570 [https://github.com/ap

[jira] [Commented] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1431#comment-1431 ] Sean Owen commented on SPARK-5765: -- I don't think JIRAs are for splitting up work on one

[jira] [Commented] (SPARK-5783) Include filename, line number in eventlog-parsing error message

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318884#comment-14318884 ] Apache Spark commented on SPARK-5783: - User 'ryan-williams' has created a pull request

[jira] [Created] (SPARK-5783) Include filename, line number in eventlog-parsing error message

2015-02-12 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5783: Summary: Include filename, line number in eventlog-parsing error message Key: SPARK-5783 URL: https://issues.apache.org/jira/browse/SPARK-5783 Project: Spark

[jira] [Created] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-12 Thread Mark Khaitman (JIRA)
Mark Khaitman created SPARK-5782: Summary: Python Worker / Pyspark Daemon Memory Issue Key: SPARK-5782 URL: https://issues.apache.org/jira/browse/SPARK-5782 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318869#comment-14318869 ] Nicholas Chammas commented on SPARK-5765: - FWIW [~srowen], the last time I had thi

[jira] [Updated] (SPARK-5522) Accelerate the History Server start

2015-02-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-5522: - Summary: Accelerate the History Server start (was: Accelerate the Histroty Server start) > Accel

[jira] [Commented] (SPARK-5522) Accelerate the Histroty Server start

2015-02-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318857#comment-14318857 ] Ryan Williams commented on SPARK-5522: -- I think [SPARK-4558|https://issues.apache.org

[jira] [Created] (SPARK-5781) Add metadata files for JSON datasets

2015-02-12 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5781: --- Summary: Add metadata files for JSON datasets Key: SPARK-5781 URL: https://issues.apache.org/jira/browse/SPARK-5781 Project: Spark Issue Type: Improvement Co

[jira] [Commented] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318836#comment-14318836 ] Apache Spark commented on SPARK-5780: - User 'davies' has created a pull request for th

[jira] [Created] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5780: - Summary: The loggings of Python unittests are noisy and scaring in Key: SPARK-5780 URL: https://issues.apache.org/jira/browse/SPARK-5780 Project: Spark Issue Type

[jira] [Created] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5779: - Summary: Python broadcast does not work with Kryo serializer Key: SPARK-5779 URL: https://issues.apache.org/jira/browse/SPARK-5779 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5778) Throw if nonexistent "spark.metrics.conf" file is provided

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318823#comment-14318823 ] Apache Spark commented on SPARK-5778: - User 'ryan-williams' has created a pull request

[jira] [Created] (SPARK-5778) Throw if nonexistent "spark.metrics.conf" file is provided

2015-02-12 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5778: Summary: Throw if nonexistent "spark.metrics.conf" file is provided Key: SPARK-5778 URL: https://issues.apache.org/jira/browse/SPARK-5778 Project: Spark Issu

[jira] [Updated] (SPARK-5655) YARN Auxiliary Shuffle service can't access shuffle files on Hadoop cluster configured in secure mode

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5655: - Affects Version/s: (was: 1.3.0) Fix Version/s: 1.2.2 > YARN Auxiliary Shuffle service can't ac

[jira] [Commented] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318777#comment-14318777 ] Apache Spark commented on SPARK-5776: - User 'srowen' has created a pull request for th

[jira] [Created] (SPARK-5777) Completes data source filter types and remove CatalystScan

2015-02-12 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5777: - Summary: Completes data source filter types and remove CatalystScan Key: SPARK-5777 URL: https://issues.apache.org/jira/browse/SPARK-5777 Project: Spark Issue Type

[jira] [Resolved] (SPARK-5757) Use json4s instead of DataFrame.toJSON in model export

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5757. -- Resolution: Fixed Fix Version/s: 1.3.0 > Use json4s instead of DataFrame.toJSON in model

[jira] [Updated] (SPARK-984) SPARK_TOOLS_JAR not set if multiple tools jars exists

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-984: - Assignee: (was: Josh Rosen) > SPARK_TOOLS_JAR not set if multiple tools jars exists > -

[jira] [Created] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
Sean Owen created SPARK-5776: Summary: JIRA version not of form x.y.z breaks merge_spark_pr.py Key: SPARK-5776 URL: https://issues.apache.org/jira/browse/SPARK-5776 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4180. --- Resolution: Fixed Fix Version/s: (was: 1.2.1) 1.2.0 Target V

[jira] [Updated] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4180: -- Labels: (was: backport-needed) > SparkContext constructor should throw exception if another SparkConte

[jira] [Updated] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4180: -- Fix Version/s: 1.2.1 1.3.0 > SparkContext constructor should throw exception if anoth

  1   2   >