[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-12 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318915#comment-14318915 ] Chris T commented on SPARK-5436: I'm haven't been able to make headway on this.

[jira] [Commented] (SPARK-5754) Spark AM not launching on Windows

2015-02-12 Thread Inigo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318654#comment-14318654 ] Inigo commented on SPARK-5754: -- So, I did some test of what works and what doesn't: *

[jira] [Resolved] (SPARK-5757) Use json4s instead of DataFrame.toJSON in model export

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5757. -- Resolution: Fixed Fix Version/s: 1.3.0 Use json4s instead of DataFrame.toJSON in model

[jira] [Commented] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318836#comment-14318836 ] Apache Spark commented on SPARK-5780: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-5522) Accelerate the Histroty Server start

2015-02-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318857#comment-14318857 ] Ryan Williams commented on SPARK-5522: -- I think

[jira] [Created] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-12 Thread Mark Khaitman (JIRA)
Mark Khaitman created SPARK-5782: Summary: Python Worker / Pyspark Daemon Memory Issue Key: SPARK-5782 URL: https://issues.apache.org/jira/browse/SPARK-5782 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5783) Include filename, line number in eventlog-parsing error message

2015-02-12 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5783: Summary: Include filename, line number in eventlog-parsing error message Key: SPARK-5783 URL: https://issues.apache.org/jira/browse/SPARK-5783 Project: Spark

[jira] [Resolved] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5776. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4570

[jira] [Updated] (SPARK-5655) YARN Auxiliary Shuffle service can't access shuffle files on Hadoop cluster configured in secure mode

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5655: - Affects Version/s: (was: 1.3.0) Fix Version/s: 1.2.2 YARN Auxiliary Shuffle service can't

[jira] [Commented] (SPARK-5778) Throw if nonexistent spark.metrics.conf file is provided

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318823#comment-14318823 ] Apache Spark commented on SPARK-5778: - User 'ryan-williams' has created a pull request

[jira] [Created] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5779: - Summary: Python broadcast does not work with Kryo serializer Key: SPARK-5779 URL: https://issues.apache.org/jira/browse/SPARK-5779 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318869#comment-14318869 ] Nicholas Chammas commented on SPARK-5765: - FWIW [~srowen], the last time I had

[jira] [Commented] (SPARK-4856) Null empty string should not be considered as StringType at begining in Json schema inferring

2015-02-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318637#comment-14318637 ] Yin Huai commented on SPARK-4856: - [~chenghao] I think it is fine to use NullType for an

[jira] [Commented] (SPARK-5747) Review all Bash scripts for word splitting bugs

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318717#comment-14318717 ] Apache Spark commented on SPARK-5747: - User 'dyross' has created a pull request for

[jira] [Updated] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4180: -- Labels: (was: backport-needed) SparkContext constructor should throw exception if another

[jira] [Resolved] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4180. --- Resolution: Fixed Fix Version/s: (was: 1.2.1) 1.2.0 Target

[jira] [Commented] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318777#comment-14318777 ] Apache Spark commented on SPARK-5776: - User 'srowen' has created a pull request for

[jira] [Created] (SPARK-5781) Add metadata files for JSON datasets

2015-02-12 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5781: --- Summary: Add metadata files for JSON datasets Key: SPARK-5781 URL: https://issues.apache.org/jira/browse/SPARK-5781 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1431#comment-1431 ] Sean Owen commented on SPARK-5765: -- I don't think JIRAs are for splitting up work on one

[jira] [Updated] (SPARK-984) SPARK_TOOLS_JAR not set if multiple tools jars exists

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-984: - Assignee: (was: Josh Rosen) SPARK_TOOLS_JAR not set if multiple tools jars exists

[jira] [Created] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
Sean Owen created SPARK-5776: Summary: JIRA version not of form x.y.z breaks merge_spark_pr.py Key: SPARK-5776 URL: https://issues.apache.org/jira/browse/SPARK-5776 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-5776) JIRA version not of form x.y.z breaks merge_spark_pr.py

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-5776: Assignee: Sean Owen JIRA version not of form x.y.z breaks merge_spark_pr.py

[jira] [Updated] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4180: -- Fix Version/s: 1.2.1 1.3.0 SparkContext constructor should throw exception if

[jira] [Resolved] (SPARK-5655) YARN Auxiliary Shuffle service can't access shuffle files on Hadoop cluster configured in secure mode

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5655. -- Resolution: Fixed Fix Version/s: 1.3.0 YARN Auxiliary Shuffle service can't access shuffle

[jira] [Created] (SPARK-5778) Throw if nonexistent spark.metrics.conf file is provided

2015-02-12 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5778: Summary: Throw if nonexistent spark.metrics.conf file is provided Key: SPARK-5778 URL: https://issues.apache.org/jira/browse/SPARK-5778 Project: Spark Issue

[jira] [Updated] (SPARK-5522) Accelerate the History Server start

2015-02-12 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-5522: - Summary: Accelerate the History Server start (was: Accelerate the Histroty Server start)

[jira] [Resolved] (SPARK-5210) Support log rolling in EventLogger

2015-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5210. --- Resolution: Later Assignee: (was: Josh Rosen) I'm closing this issue for now, since my

[jira] [Updated] (SPARK-5655) YARN Auxiliary Shuffle service can't access shuffle files on Hadoop cluster configured in secure mode

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5655: - Assignee: Andrew Rowson YARN Auxiliary Shuffle service can't access shuffle files on Hadoop cluster

[jira] [Created] (SPARK-5777) Completes data source filter types and remove CatalystScan

2015-02-12 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5777: - Summary: Completes data source filter types and remove CatalystScan Key: SPARK-5777 URL: https://issues.apache.org/jira/browse/SPARK-5777 Project: Spark Issue

[jira] [Created] (SPARK-5767) Migrate Parquet data source to the write support of data source API

2015-02-12 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5767: - Summary: Migrate Parquet data source to the write support of data source API Key: SPARK-5767 URL: https://issues.apache.org/jira/browse/SPARK-5767 Project: Spark

[jira] [Updated] (SPARK-4819) Remove Guava's Optional from public API

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4819: - Target Version/s: 2+ Remove Guava's Optional from public API -

[jira] [Updated] (SPARK-3369) Java mapPartitions Iterator-Iterable is inconsistent with Scala's Iterator-Iterator

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3369: - Priority: Major (was: Critical) Target Version/s: 2+ (was: 1.2.0) Affects Version/s:

[jira] [Updated] (SPARK-3266) JavaDoubleRDD doesn't contain max()

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3266: - Target Version/s: 2+ (was: 1.1.1, 1.2.0) Assignee: Sean Owen JavaDoubleRDD doesn't contain

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318041#comment-14318041 ] Sean Owen commented on SPARK-5770: -- Can you be more specific about where you think the

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-12 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317768#comment-14317768 ] Yi Tian commented on SPARK-3365: The reason is Spark generated wrong schema for type

[jira] [Updated] (SPARK-5763) Sort-based Groupby and Join to resolve skewed data

2015-02-12 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-5763: Description: In SPARK-4644, it provide a way to resolve skewed data. But when we has more keys

[jira] [Updated] (SPARK-5763) Sort-based Groupby and Join to resolve skewed data

2015-02-12 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-5763: Description: In SPARK-4644, it provide a way to resolve skewed data. But when we has more keys

[jira] [Comment Edited] (SPARK-5508) [hive context] Unable to query array once saved as parquet

2015-02-12 Thread Ayoub Benali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301349#comment-14301349 ] Ayoub Benali edited comment on SPARK-5508 at 2/12/15 9:53 AM: --

[jira] [Updated] (SPARK-5739) Size exceeds Integer.MAX_VALUE in File Map

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5739: - Component/s: MLlib Priority: Minor (was: Major) Size exceeds Integer.MAX_VALUE in File Map

[jira] [Commented] (SPARK-5766) Slow RowMatrix multiplication

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317961#comment-14317961 ] Sean Owen commented on SPARK-5766: -- Given that RowMatrix is a row-by-row representation,

[jira] [Updated] (SPARK-5644) Delete tmp dir when sc is stop

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5644: - Assignee: Weizhong Delete tmp dir when sc is stop -- Key:

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-12 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318021#comment-14318021 ] Manoj Kumar commented on SPARK-5436: Hi, I would like to give this a go. [~ChrisT] are

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317949#comment-14317949 ] Cheng Lian commented on SPARK-3365: --- Hey [~tianyi], please open a PR for this. However,

[jira] [Commented] (SPARK-5768) Spark UI Shows incorrect memory under Yarn

2015-02-12 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318088#comment-14318088 ] Al M commented on SPARK-5768: - So when it says *Memory Used* 3.2GB / 20GB it actually means we

[jira] [Updated] (SPARK-5768) Spark UI Shows incorrect memory under Yarn

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5768: - Component/s: (was: YARN) Web UI Issue Type: Improvement (was: Bug) It sounds

[jira] [Created] (SPARK-5769) Set params in constructor and setParams() in Python ML pipeline API

2015-02-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5769: Summary: Set params in constructor and setParams() in Python ML pipeline API Key: SPARK-5769 URL: https://issues.apache.org/jira/browse/SPARK-5769 Project: Spark

[jira] [Commented] (SPARK-5769) Set params in constructor and setParams() in Python ML pipeline API

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318005#comment-14318005 ] Apache Spark commented on SPARK-5769: - User 'mengxr' has created a pull request for

[jira] [Created] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-12 Thread meiyoula (JIRA)
meiyoula created SPARK-5770: --- Summary: Use addJar() to upload a new jar file to executor, it can't be added to classloader Key: SPARK-5770 URL: https://issues.apache.org/jira/browse/SPARK-5770 Project:

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318027#comment-14318027 ] Apache Spark commented on SPARK-5770: - User 'XuTingjun' has created a pull request for

[jira] [Created] (SPARK-5768) Spark UI Shows incorrect memory under Yarn

2015-02-12 Thread Al M (JIRA)
Al M created SPARK-5768: --- Summary: Spark UI Shows incorrect memory under Yarn Key: SPARK-5768 URL: https://issues.apache.org/jira/browse/SPARK-5768 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-4553) query for parquet table with string fields in spark sql hive get binary result

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317976#comment-14317976 ] Apache Spark commented on SPARK-4553: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-5767) Migrate Parquet data source to the write support of data source API

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317977#comment-14317977 ] Apache Spark commented on SPARK-5767: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-5766) Slow RowMatrix multiplication

2015-02-12 Thread Amaru Cuba Gyllensten (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318013#comment-14318013 ] Amaru Cuba Gyllensten commented on SPARK-5766: -- Yeah, I noticed it when

[jira] [Created] (SPARK-5786) Documentation of Narrow Dependencies

2015-02-12 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-5786: --- Summary: Documentation of Narrow Dependencies Key: SPARK-5786 URL: https://issues.apache.org/jira/browse/SPARK-5786 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5759: - Affects Version/s: 1.2.0 ExecutorRunnable should catch YarnException while NMClient start container

[jira] [Created] (SPARK-5787) Protect JVM from some not-important exceptions

2015-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5787: - Summary: Protect JVM from some not-important exceptions Key: SPARK-5787 URL: https://issues.apache.org/jira/browse/SPARK-5787 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Assignee: Venkata Ramana G word split problem in run-example and compute-classpath

[jira] [Commented] (SPARK-2774) Set preferred locations for reduce tasks

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319056#comment-14319056 ] Apache Spark commented on SPARK-2774: - User 'shivaram' has created a pull request for

[jira] [Closed] (SPARK-5760) StandaloneRestClient/Server error behavior is incorrect

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5760. Resolution: Fixed Fix Version/s: 1.3.0 StandaloneRestClient/Server error behavior is incorrect

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319182#comment-14319182 ] Sean Owen commented on SPARK-5726: -- You can ignore this comment, but I wonder if it would

[jira] [Commented] (SPARK-3570) Shuffle write time does not include time to open shuffle files

2015-02-12 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318960#comment-14318960 ] Kay Ousterhout commented on SPARK-3570: --- There are a bunch of times when files are

[jira] [Created] (SPARK-5785) Pyspark does not support narrow dependencies

2015-02-12 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-5785: --- Summary: Pyspark does not support narrow dependencies Key: SPARK-5785 URL: https://issues.apache.org/jira/browse/SPARK-5785 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5788) Capture exceptions in Python write thread

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319135#comment-14319135 ] Apache Spark commented on SPARK-5788: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Affects Version/s: 1.3.0 1.2.1 word split problem in run-example and

[jira] [Updated] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5765: - Affects Version/s: (was: 1.2.2) (was: 1.3.0) word split problem in

[jira] [Closed] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5762. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Target Version/s: 1.3.0,

[jira] [Closed] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5780. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 (was: 1.4.0) The loggings

[jira] [Updated] (SPARK-5780) The loggings of Python unittests are noisy and scaring in

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5780: - Affects Version/s: (was: 1.4.0) The loggings of Python unittests are noisy and scaring in

[jira] [Resolved] (SPARK-1192) Around 30 parameters in Spark are used but undocumented and some are having confusing name

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1192. -- Resolution: Won't Fix PR was withdrawn; this probably deserves a rethink if it were reconsidered

[jira] [Closed] (SPARK-5690) Flaky test: org.apache.spark.deploy.rest.StandaloneRestSubmitSuite.simple submit until completion

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5690. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 Flaky test:

[jira] [Closed] (SPARK-5761) Revamp StandaloneRestProtocolSuite

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5761. Resolution: Fixed Fix Version/s: 1.3.0 Revamp StandaloneRestProtocolSuite

[jira] [Commented] (SPARK-4897) Python 3 support

2015-02-12 Thread Ryan Ovas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318935#comment-14318935 ] Ryan Ovas commented on SPARK-4897: -- I'm interested in using Spark in my startup, but

[jira] [Commented] (SPARK-5784) Add StatsDSink to MetricsSystem

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318993#comment-14318993 ] Apache Spark commented on SPARK-5784: - User 'ryan-williams' has created a pull request

[jira] [Updated] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5726: - Assignee: Octavian Geagla Hadamard Vector Product Transformer

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319163#comment-14319163 ] Xiangrui Meng commented on SPARK-5726: -- This is a nice feature. I like the name

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-12 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Description: I'm including the Shuffle component on this, as a brief scan through the code

[jira] [Updated] (SPARK-5783) Include filename, line number in eventlog-parsing error message

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5783: - Affects Version/s: (was: 1.2.1) 1.0.0 Include filename, line number in

[jira] [Commented] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319014#comment-14319014 ] Yin Huai commented on SPARK-5746: - Here are places where we need to take care overwrite,

[jira] [Commented] (SPARK-5735) Replace uses of EasyMock with Mockito

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319140#comment-14319140 ] Apache Spark commented on SPARK-5735: - User 'JoshRosen' has created a pull request for

[jira] [Closed] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5759. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Lianhui Wang Target Version/s:

[jira] [Resolved] (SPARK-5335) Destroying cluster in VPC with --delete-groups fails to remove security groups

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5335. -- Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Issue resolved by pull request

[jira] [Updated] (SPARK-5335) Destroying cluster in VPC with --delete-groups fails to remove security groups

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5335: - Assignee: Vladimir Grigor Destroying cluster in VPC with --delete-groups fails to remove security

[jira] [Updated] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5762: - Target Version/s: 1.3.0 (was: 1.3.0, 1.2.2) Shuffle write time is incorrect for sort-based shuffle

[jira] [Updated] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5762: - Fix Version/s: (was: 1.2.2) Shuffle write time is incorrect for sort-based shuffle

[jira] [Commented] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319316#comment-14319316 ] Brennon York commented on SPARK-5790: - FWIW this issue is a blocker for

[jira] [Updated] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5790: - Assignee: Brennon York VertexRDD's won't zip properly for `diff` capability

[jira] [Created] (SPARK-5790) VertexRDD's won't zip properly for `diff` capability

2015-02-12 Thread Brennon York (JIRA)
Brennon York created SPARK-5790: --- Summary: VertexRDD's won't zip properly for `diff` capability Key: SPARK-5790 URL: https://issues.apache.org/jira/browse/SPARK-5790 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319400#comment-14319400 ] Cheng Hao commented on SPARK-5791: -- Can you also attach the performance comparison result

[jira] [Updated] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-5791: --- Description: Spark SQL shows poor performance when multiple tables do join operation (was: Spark SQL shows

[jira] [Resolved] (SPARK-3299) [SQL] Public API in SQLContext to list tables

2015-02-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3299. - Resolution: Fixed Fix Version/s: 1.3.0 [SQL] Public API in SQLContext to list

[jira] [Updated] (SPARK-3168) The ServletContextHandler of webui lacks a SessionManager

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3168: - Component/s: (was: Spark Core) Web UI Priority: Minor (was: Major) Issue

[jira] [Created] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
Yi Zhou created SPARK-5791: -- Summary: [Spark SQL] show poor performance when multiple table do join operation Key: SPARK-5791 URL: https://issues.apache.org/jira/browse/SPARK-5791 Project: Spark

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-02-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319364#comment-14319364 ] Yi Zhou commented on SPARK-5791: For example: SELECT * FROM inventory inv JOIN (

[jira] [Closed] (SPARK-5764) Delete the cache and lock file after executor fetching the jar

2015-02-12 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula closed SPARK-5764. --- Resolution: Not a Problem Delete the cache and lock file after executor fetching the jar

[jira] [Created] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Venkata Ramana G (JIRA)
Venkata Ramana G created SPARK-5765: --- Summary: word split problem in run-example and compute-classpath Key: SPARK-5765 URL: https://issues.apache.org/jira/browse/SPARK-5765 Project: Spark

[jira] [Created] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5762: - Summary: Shuffle write time is incorrect for sort-based shuffle Key: SPARK-5762 URL: https://issues.apache.org/jira/browse/SPARK-5762 Project: Spark Issue

[jira] [Commented] (SPARK-5754) Spark AM not launching on Windows

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317850#comment-14317850 ] Sean Owen commented on SPARK-5754: -- We just resolved

[jira] [Commented] (SPARK-5739) Size exceeds Integer.MAX_VALUE in File Map

2015-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317782#comment-14317782 ] Sean Owen commented on SPARK-5739: -- What would that really do though except change one

[jira] [Commented] (SPARK-5762) Shuffle write time is incorrect for sort-based shuffle

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317799#comment-14317799 ] Apache Spark commented on SPARK-5762: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-5765) word split problem in run-example and compute-classpath

2015-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14317882#comment-14317882 ] Apache Spark commented on SPARK-5765: - User 'gvramana' has created a pull request for

  1   2   >