[jira] [Commented] (SPARK-10051) Support collecting data of StructType in DataFrame

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791617#comment-14791617 ] Apache Spark commented on SPARK-10051: -- User 'sun-rui' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10051) Support collecting data of StructType in DataFrame

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10051: Assignee: (was: Apache Spark) > Support collecting data of StructType in DataFrame >

[jira] [Assigned] (SPARK-10051) Support collecting data of StructType in DataFrame

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10051: Assignee: Apache Spark > Support collecting data of StructType in DataFrame >

[jira] [Commented] (SPARK-7841) Spark build should not use lib_managed for dependencies

2015-09-17 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791674#comment-14791674 ] Iulian Dragos commented on SPARK-7841: -- Yes, there are a few build scripts (including

[jira] [Commented] (SPARK-4440) Enhance the job progress API to expose more information

2015-09-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791732#comment-14791732 ] Rui Li commented on SPARK-4440: --- For Hive on Spark, we want completion time for each stage so we can compute

[jira] [Commented] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791708#comment-14791708 ] Reynold Xin commented on SPARK-10474: - It seems like the problem is that although we reserve a page,

[jira] [Comment Edited] (SPARK-10614) SystemClock uses non-monotonic time in its wait logic

2015-09-17 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14768900#comment-14768900 ] Steve Loughran edited comment on SPARK-10614 at 9/17/15 10:08 AM: --

[jira] [Commented] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802796#comment-14802796 ] Apache Spark commented on SPARK-10660: -- User '397090770' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10660: Assignee: (was: Apache Spark) > Doc describe error in the "Running Spark on YARN"

[jira] [Commented] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802792#comment-14802792 ] yangping wu commented on SPARK-10660: - Hi [~srowen], Thank you for your reply! I had make a PR

[jira] [Assigned] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10660: Assignee: Apache Spark > Doc describe error in the "Running Spark on YARN" page >

[jira] [Updated] (SPARK-10661) The PipelineModel class inherits from Serializable twice.

2015-09-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10661: -- Target Version/s: (was: 1.5.0) Priority: Trivial (was: Minor) Fix Version/s:

[jira] [Commented] (SPARK-10661) The PipelineModel class inherits from Serializable twice.

2015-09-17 Thread Matt Hagen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802821#comment-14802821 ] Matt Hagen commented on SPARK-10661: Thanks. Still calibrating how to report doc issues. Will label

[jira] [Created] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-17 Thread Matt Hagen (JIRA)
Matt Hagen created SPARK-10663: -- Summary: Change test.toDF to test in Spark ML Programming Guide Key: SPARK-10663 URL: https://issues.apache.org/jira/browse/SPARK-10663 Project: Spark Issue

[jira] [Updated] (SPARK-10662) Code snippets are not properly formatted in docs

2015-09-17 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-10662: Issue Type: Task (was: Bug) Summary: Code snippets are not properly formatted in

[jira] [Created] (SPARK-10661) The PipelineModel class inherits from Serializable twice.

2015-09-17 Thread Matt Hagen (JIRA)
Matt Hagen created SPARK-10661: -- Summary: The PipelineModel class inherits from Serializable twice. Key: SPARK-10661 URL: https://issues.apache.org/jira/browse/SPARK-10661 Project: Spark Issue

[jira] [Commented] (SPARK-10388) Public dataset loader interface

2015-09-17 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802747#comment-14802747 ] Kai Sasaki commented on SPARK-10388: [~mengxr] I totally agree with you. The initial version should

[jira] [Created] (SPARK-10662) Code examples are not properly formatted

2015-09-17 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-10662: --- Summary: Code examples are not properly formatted Key: SPARK-10662 URL: https://issues.apache.org/jira/browse/SPARK-10662 Project: Spark Issue Type:

[jira] [Created] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread yangping wu (JIRA)
yangping wu created SPARK-10660: --- Summary: Doc describe error in the "Running Spark on YARN" page Key: SPARK-10660 URL: https://issues.apache.org/jira/browse/SPARK-10660 Project: Spark Issue

[jira] [Updated] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-10660: Description: In the *Configuration* section, the *spark.yarn.driver.memoryOverhead* and

[jira] [Commented] (SPARK-10660) Doc describe error in the "Running Spark on YARN" page

2015-09-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791834#comment-14791834 ] Sean Owen commented on SPARK-10660: --- Agree with that, do you want to make a PR? > Doc describe error

[jira] [Assigned] (SPARK-10642) Crash in rdd.lookup() with "java.lang.Long cannot be cast to java.lang.Integer"

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10642: Assignee: Apache Spark > Crash in rdd.lookup() with "java.lang.Long cannot be cast to >

[jira] [Commented] (SPARK-10642) Crash in rdd.lookup() with "java.lang.Long cannot be cast to java.lang.Integer"

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791786#comment-14791786 ] Apache Spark commented on SPARK-10642: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10642) Crash in rdd.lookup() with "java.lang.Long cannot be cast to java.lang.Integer"

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10642: Assignee: (was: Apache Spark) > Crash in rdd.lookup() with "java.lang.Long cannot be

[jira] [Commented] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-09-17 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791815#comment-14791815 ] Yu Ishikawa commented on SPARK-10285: - Close this PR because those are non-public API. > Add @since

[jira] [Commented] (SPARK-10625) Spark SQL JDBC read/write is unable to handle JDBC Drivers that adds unserializable objects into connection properties

2015-09-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791830#comment-14791830 ] Sean Owen commented on SPARK-10625: --- Dumb question here, but if the driver needs these objects, and

[jira] [Commented] (SPARK-2613) CLONE - word2vec: Distributed Representation of Words

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803072#comment-14803072 ] Maximilian Michels commented on SPARK-2613: --- User 'nikste' has created a pull request for this

[jira] [Commented] (SPARK-2640) In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803066#comment-14803066 ] Maximilian Michels commented on SPARK-2640: --- User 'mxm' has created a pull request for this

[jira] [Commented] (SPARK-2591) Add config property to disable incremental collection used in Thrift server

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803070#comment-14803070 ] Maximilian Michels commented on SPARK-2591: --- User 'willmiao' has created a pull request for this

[jira] [Commented] (SPARK-2566) Update ShuffleWriteMetrics as data is written

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803064#comment-14803064 ] Maximilian Michels commented on SPARK-2566: --- User 'mjsax' has created a pull request for this

[jira] [Commented] (SPARK-2622) Add Jenkins build numbers to SparkQA messages

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803073#comment-14803073 ] Maximilian Michels commented on SPARK-2622: --- User 'HuangWHWHW' has created a pull request for

[jira] [Commented] (SPARK-1851) Upgrade Avro dependency to 1.7.6 so Spark can read Avro files

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803076#comment-14803076 ] Maximilian Michels commented on SPARK-1851: --- User 'aljoscha' has created a pull request for this

[jira] [Commented] (SPARK-2410) Thrift/JDBC Server

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803075#comment-14803075 ] Maximilian Michels commented on SPARK-2410: --- User 'twalthr' has created a pull request for this

[jira] [Commented] (SPARK-2659) HiveQL: Division operator should always perform fractional division

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803067#comment-14803067 ] Maximilian Michels commented on SPARK-2659: --- User 'greghogan' has created a pull request for

[jira] [Commented] (SPARK-2637) PEP8 Compliance pull request #1540

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803065#comment-14803065 ] Maximilian Michels commented on SPARK-2637: --- User 'tillrohrmann' has created a pull request for

[jira] [Commented] (SPARK-2537) Workaround Timezone specific Hive tests

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803069#comment-14803069 ] Maximilian Michels commented on SPARK-2537: --- User 'chenliang613' has created a pull request for

[jira] [Commented] (SPARK-2653) Heap size should be the sum of driver.memory and executor.memory in local mode

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803071#comment-14803071 ] Maximilian Michels commented on SPARK-2653: --- User 'greghogan' has created a pull request for

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803081#comment-14803081 ] Maximilian Michels commented on SPARK-2576: --- User 'jkovacs' has created a pull request for this

[jira] [Commented] (SPARK-2357) HashFilteredJoin doesn't match some equi-join query

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803080#comment-14803080 ] Maximilian Michels commented on SPARK-2357: --- User 'StephanEwen' has created a pull request for

[jira] [Commented] (SPARK-2595) The driver run garbage collection, when the executor throws OutOfMemoryError exception

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803082#comment-14803082 ] Maximilian Michels commented on SPARK-2595: --- User 'tedyu' has created a pull request for this

[jira] [Commented] (SPARK-2689) Remove use of println in ActorHelper

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803083#comment-14803083 ] Maximilian Michels commented on SPARK-2689: --- User 'fhueske' has created a pull request for this

[jira] [Commented] (SPARK-2690) Make unidoc part of our test process

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803078#comment-14803078 ] Maximilian Michels commented on SPARK-2690: --- User 'tillrohrmann' has created a pull request for

[jira] [Commented] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803079#comment-14803079 ] Maximilian Michels commented on SPARK-2691: --- User 'felixcheung' has created a pull request for

[jira] [Resolved] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10284. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8694

[jira] [Created] (SPARK-10664) JDBC DataFrameWriter does not save data to Oracle 11 Database

2015-09-17 Thread Dmitriy Atorin (JIRA)
Dmitriy Atorin created SPARK-10664: -- Summary: JDBC DataFrameWriter does not save data to Oracle 11 Database Key: SPARK-10664 URL: https://issues.apache.org/jira/browse/SPARK-10664 Project: Spark

[jira] [Resolved] (SPARK-10281) Add @since annotation to pyspark.ml.clustering

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10281. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8691

[jira] [Comment Edited] (SPARK-10635) pyspark - running on a different host

2015-09-17 Thread Ben Duffield (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802950#comment-14802950 ] Ben Duffield edited comment on SPARK-10635 at 9/17/15 2:04 PM: --- Curious as

[jira] [Commented] (SPARK-10663) Change test.toDF to test in Spark ML Programming Guide

2015-09-17 Thread Jian Feng Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802952#comment-14802952 ] Jian Feng Zhang commented on SPARK-10663: - It's correct in the Spark Website. It's same with the

[jira] [Resolved] (SPARK-10278) Add @since annotation to pyspark.mllib.tree

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10278. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8685

[jira] [Resolved] (SPARK-10459) PythonUDF could process UnsafeRow

2015-09-17 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10459. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8616

[jira] [Resolved] (SPARK-10282) Add @since annotation to pyspark.ml.recommendation

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10282. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8692

[jira] [Resolved] (SPARK-10274) Add @since annotation to pyspark.mllib.fpm

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10274. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8665

[jira] [Resolved] (SPARK-10077) Java package doc for spark.ml.feature

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10077. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8740

[jira] [Commented] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2015-09-17 Thread Martin Tapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803095#comment-14803095 ] Martin Tapp commented on SPARK-2691: This pull request seems unrelated (Python broken links). > Allow

[jira] [Resolved] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10283. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8693

[jira] [Commented] (SPARK-2622) Add Jenkins build numbers to SparkQA messages

2015-09-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803169#comment-14803169 ] Nicholas Chammas commented on SPARK-2622: - [~mxm] - I noticed you have been posting this kind of

[jira] [Commented] (SPARK-10620) Look into whether accumulator mechanism can replace TaskMetrics

2015-09-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803184#comment-14803184 ] Imran Rashid commented on SPARK-10620: -- I think you've done a good job of summarizing the key issues

[jira] [Commented] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-09-17 Thread Matt Massie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803185#comment-14803185 ] Matt Massie commented on SPARK-7263: The [Parquet shuffle

[jira] [Comment Edited] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-09-17 Thread Matt Massie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803185#comment-14803185 ] Matt Massie edited comment on SPARK-7263 at 9/17/15 4:37 PM: - The [Parquet

[jira] [Resolved] (SPARK-10279) Add @since annotation to pyspark.mllib.util

2015-09-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10279. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8689

[jira] [Updated] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-09-17 Thread Matt Massie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Massie updated SPARK-7263: --- Component/s: (was: Block Manager) Shuffle > Add new shuffle manager which stores

[jira] [Commented] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802912#comment-14802912 ] Cheng Hao commented on SPARK-10474: --- The root reason for this failure, is because of the

[jira] [Comment Edited] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802912#comment-14802912 ] Cheng Hao edited comment on SPARK-10474 at 9/17/15 1:48 PM: The root reason

[jira] [Commented] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802906#comment-14802906 ] Apache Spark commented on SPARK-10474: -- User 'chenghao-intel' has created a pull request for this

[jira] [Assigned] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10474: Assignee: Apache Spark > Aggregation failed with unable to acquire memory >

[jira] [Commented] (SPARK-10226) Error occured in SparkSQL when using !=

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802940#comment-14802940 ] Maximilian Michels commented on SPARK-10226: User 'small-wang' has created a pull request for

[jira] [Commented] (SPARK-10635) pyspark - running on a different host

2015-09-17 Thread Ben Duffield (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802950#comment-14802950 ] Ben Duffield commented on SPARK-10635: -- Curious as to why you believe this to be hard to support?

[jira] [Commented] (SPARK-6028) Provide an alternative RPC implementation based on the network transport module

2015-09-17 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802968#comment-14802968 ] Jacek Lewandowski commented on SPARK-6028: -- Hey - what's the estimated date of delivery of this

[jira] [Assigned] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10474: Assignee: (was: Apache Spark) > Aggregation failed with unable to acquire memory >

[jira] [Commented] (SPARK-10289) A direct write API for testing Parquet compatibility

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802938#comment-14802938 ] Maximilian Michels commented on SPARK-10289: User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-8887) Explicitly define which data types can be used as dynamic partition columns

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802947#comment-14802947 ] Maximilian Michels commented on SPARK-8887: --- User 'yjshen' has created a pull request for this

[jira] [Commented] (SPARK-10226) Error occured in SparkSQL when using !=

2015-09-17 Thread Maximilian Michels (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802939#comment-14802939 ] Maximilian Michels commented on SPARK-10226: User 'small-wang' has created a pull request for

[jira] [Updated] (SPARK-10662) Code snippets are not properly formatted in docs

2015-09-17 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-10662: Attachment: spark-docs-backticks-tables.png > Code snippets are not properly formatted in

[jira] [Updated] (SPARK-10643) Support HDFS urls in spark-submit

2015-09-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10643: -- Component/s: Spark Submit > Support HDFS urls in spark-submit > - > >

[jira] [Commented] (SPARK-10635) pyspark - running on a different host

2015-09-17 Thread Patrick Woody (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802958#comment-14802958 ] Patrick Woody commented on SPARK-10635: --- For a bit of motivation - we have a long running

[jira] [Commented] (SPARK-6028) Provide an alternative RPC implementation based on the network transport module

2015-09-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14802993#comment-14802993 ] Shixiong Zhu commented on SPARK-6028: - I'm working on it. It will be delivered in 1.6.0. > Provide an

[jira] [Assigned] (SPARK-10666) Use properties from ActiveJob associated with a Stage

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10666: Assignee: Mark Hamstra (was: Apache Spark) > Use properties from ActiveJob associated

[jira] [Updated] (SPARK-10646) Bivariate Statistics: Pearson's Chi-Squared Test for categorical vs. categorical

2015-09-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10646: -- Assignee: Jihong MA > Bivariate Statistics: Pearson's Chi-Squared Test for categorical

[jira] [Updated] (SPARK-10623) NoSuchElementException thrown when ORC predicate push-down is turned on

2015-09-17 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10623: --- Summary: NoSuchElementException thrown when ORC predicate push-down is turned on (was: turning on

[jira] [Updated] (SPARK-10545) HiveMetastoreTypes.toMetastoreType should handle interval type

2015-09-17 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10545: - Priority: Minor (was: Major) > HiveMetastoreTypes.toMetastoreType should handle interval type >

[jira] [Commented] (SPARK-10545) HiveMetastoreTypes.toMetastoreType should handle interval type

2015-09-17 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803233#comment-14803233 ] Yin Huai commented on SPARK-10545: -- Seems Hive 1.2.1's parser does not allow interval as a column type

[jira] [Resolved] (SPARK-10657) Remove legacy SCP-based Jenkins log archiving code

2015-09-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-10657. Resolution: Fixed Fix Version/s: 1.2.3 1.3.2 1.4.2

[jira] [Commented] (SPARK-10565) New /api/v1/[path] APIs don't contain as much information as original /json API

2015-09-17 Thread Kevin Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803224#comment-14803224 ] Kevin Chen commented on SPARK-10565: To summarize what has been discussed up until now in a separate

[jira] [Commented] (SPARK-2537) Workaround Timezone specific Hive tests

2015-09-17 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803223#comment-14803223 ] Yin Huai commented on SPARK-2537: - [~mxm] Seems you are trying

[jira] [Assigned] (SPARK-10666) Use properties from ActiveJob associated with a Stage

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10666: Assignee: Apache Spark (was: Mark Hamstra) > Use properties from ActiveJob associated

[jira] [Commented] (SPARK-10632) Cannot save DataFrame with User Defined Types

2015-09-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803272#comment-14803272 ] Joseph K. Bradley commented on SPARK-10632: --- I tried this with the current master, and it

[jira] [Updated] (SPARK-10662) Code snippets are not properly formatted in docs

2015-09-17 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-10662: Issue Type: Bug (was: Task) > Code snippets are not properly formatted in docs >

[jira] [Resolved] (SPARK-10650) Spark docs include test and other extra classes

2015-09-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10650. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved

[jira] [Commented] (SPARK-10664) JDBC DataFrameWriter does not save data to Oracle 11 Database

2015-09-17 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803331#comment-14803331 ] Suresh Thalamati commented on SPARK-10664: -- Table exists case should be fixed as part of

[jira] [Updated] (SPARK-10623) NoSuchElementException thrown when ORC predicate push-down is turned on

2015-09-17 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10623: --- Description: Turning on predicate pushdown for ORC datasources results in a

[jira] [Commented] (SPARK-10623) NoSuchElementException thrown when ORC predicate push-down is turned on

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803418#comment-14803418 ] Apache Spark commented on SPARK-10623: -- User 'liancheng' has created a pull request for this issue:

[jira] [Resolved] (SPARK-10642) Crash in rdd.lookup() with "java.lang.Long cannot be cast to java.lang.Integer"

2015-09-17 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10642. Resolution: Fixed Fix Version/s: 1.2.3 1.3.2 1.4.2

[jira] [Updated] (SPARK-10545) HiveMetastoreTypes.toMetastoreType should handle interval type

2015-09-17 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10545: - Target Version/s: (was: 1.6.0, 1.5.1) > HiveMetastoreTypes.toMetastoreType should handle interval type

[jira] [Commented] (SPARK-10666) Use properties from ActiveJob associated with a Stage

2015-09-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803244#comment-14803244 ] Apache Spark commented on SPARK-10666: -- User 'markhamstra' has created a pull request for this

[jira] [Created] (SPARK-10666) Use properties from ActiveJob associated with a Stage

2015-09-17 Thread Mark Hamstra (JIRA)
Mark Hamstra created SPARK-10666: Summary: Use properties from ActiveJob associated with a Stage Key: SPARK-10666 URL: https://issues.apache.org/jira/browse/SPARK-10666 Project: Spark Issue

[jira] [Commented] (SPARK-6880) Spark Shutdowns with NoSuchElementException when running parallel collect on cachedRDD

2015-09-17 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14803261#comment-14803261 ] Mark Hamstra commented on SPARK-6880: - see SPARK-10666 > Spark Shutdowns with NoSuchElementException

[jira] [Resolved] (SPARK-10531) AppId is set as AppName in status rest api

2015-09-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10531. Resolution: Fixed Assignee: Jeff Zhang Fix Version/s: 1.6.0 > AppId is set

[jira] [Resolved] (SPARK-10394) Make GBTParams use shared "stepSize"

2015-09-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10394. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8552

[jira] [Created] (SPARK-10670) Link to each language's API in codetabs in ML docs: spark.ml

2015-09-17 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10670: - Summary: Link to each language's API in codetabs in ML docs: spark.ml Key: SPARK-10670 URL: https://issues.apache.org/jira/browse/SPARK-10670 Project:

[jira] [Created] (SPARK-10665) Connect the local iterators with the planner

2015-09-17 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-10665: --- Summary: Connect the local iterators with the planner Key: SPARK-10665 URL: https://issues.apache.org/jira/browse/SPARK-10665 Project: Spark Issue Type:

  1   2   3   >