[jira] [Updated] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Shaik Idris Ali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shaik Idris Ali updated SPARK-7706: --- Labels: oozie yarn (was: ) Allow setting YARN_CONF_DIR from spark argument

[jira] [Assigned] (SPARK-7704) Updating Programming Guides per SPARK-4397

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7704: --- Assignee: (was: Apache Spark) Updating Programming Guides per SPARK-4397

[jira] [Created] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Shaik Idris Ali (JIRA)
Shaik Idris Ali created SPARK-7706: -- Summary: Allow setting YARN_CONF_DIR from spark argument Key: SPARK-7706 URL: https://issues.apache.org/jira/browse/SPARK-7706 Project: Spark Issue

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548106#comment-14548106 ] Sean Owen commented on SPARK-7706: -- Can't you just use {{YARN_CONF_DIR=... command ...}}?

[jira] [Resolved] (SPARK-6657) Fix Python doc build warnings

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6657. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6221

[jira] [Commented] (SPARK-7699) Config spark.dynamicAllocation.initialExecutors has no effect

2015-05-18 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547956#comment-14547956 ] meiyoula commented on SPARK-7699: - If so, I think the

[jira] [Commented] (SPARK-7705) Cleanup of .sparkStaging directory fails if application is killed

2015-05-18 Thread Wilfred Spiegelenburg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547954#comment-14547954 ] Wilfred Spiegelenburg commented on SPARK-7705: -- I think the limitation that

[jira] [Commented] (SPARK-7565) Broken maps in jsonRDD

2015-05-18 Thread Paul Colomiets (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547957#comment-14547957 ] Paul Colomiets commented on SPARK-7565: --- The pull request is pretty trivial. I've

[jira] [Updated] (SPARK-7705) Cleanup of .sparkStaging directory fails if application is killed

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7705: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) Minor since this just amounts to

[jira] [Reopened] (SPARK-5265) Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-5265: -- Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

[jira] [Assigned] (SPARK-7704) Updating Programming Guides per SPARK-4397

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7704: --- Assignee: Apache Spark Updating Programming Guides per SPARK-4397

[jira] [Created] (SPARK-7705) Cleanup of .sparkStaging directory fails if application is killed

2015-05-18 Thread Wilfred Spiegelenburg (JIRA)
Wilfred Spiegelenburg created SPARK-7705: Summary: Cleanup of .sparkStaging directory fails if application is killed Key: SPARK-7705 URL: https://issues.apache.org/jira/browse/SPARK-7705

[jira] [Commented] (SPARK-5265) Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

2015-05-18 Thread Roque Vassal'lo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547963#comment-14547963 ] Roque Vassal'lo commented on SPARK-5265: Yes, SPARK-6443 and this jira are the

[jira] [Commented] (SPARK-7699) Config spark.dynamicAllocation.initialExecutors has no effect

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547974#comment-14547974 ] Sean Owen commented on SPARK-7699: -- It would make a difference if the program immediately

[jira] [Resolved] (SPARK-5265) Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5265. -- Resolution: Duplicate Submitting applications on Standalone cluster controlled by Zookeeper forces

[jira] [Commented] (SPARK-7700) Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

2015-05-18 Thread Kaveen Raajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547641#comment-14547641 ] Kaveen Raajan commented on SPARK-7700: -- Hi [~srowen] I'm sure there is no space

[jira] [Updated] (SPARK-7661) Support for dynamic allocation of resources in Kinesis Spark Streaming

2015-05-18 Thread Murtaza Kanchwala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murtaza Kanchwala updated SPARK-7661: - Summary: Support for dynamic allocation of resources in Kinesis Spark Streaming (was:

[jira] [Resolved] (SPARK-7700) Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7700. -- Resolution: Duplicate Given you're on Windows, this sounds almost exactly like SPARK-5754 Spark 1.3.0

[jira] [Resolved] (SPARK-7299) saving Oracle-source DataFrame to Hive changes scale

2015-05-18 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7299. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Liang-Chi Hsieh saving

[jira] [Commented] (SPARK-7150) SQLContext.range()

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547710#comment-14547710 ] Apache Spark commented on SPARK-7150: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-6416) RDD.fold() requires the operator to be commutative

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547718#comment-14547718 ] Apache Spark commented on SPARK-6416: - User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-6416) RDD.fold() requires the operator to be commutative

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6416: --- Assignee: (was: Apache Spark) RDD.fold() requires the operator to be commutative

[jira] [Created] (SPARK-7699) Config spark.dynamicAllocation.initialExecutors has no effect

2015-05-18 Thread meiyoula (JIRA)
meiyoula created SPARK-7699: --- Summary: Config spark.dynamicAllocation.initialExecutors has no effect Key: SPARK-7699 URL: https://issues.apache.org/jira/browse/SPARK-7699 Project: Spark Issue

[jira] [Resolved] (SPARK-7700) Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7700. -- Resolution: Not A Problem This sounds like a problem with your configuration, like you've put a JVM arg

[jira] [Commented] (SPARK-7661) Support for dynamic allocation of executors in Kinesis Spark Streaming

2015-05-18 Thread Murtaza Kanchwala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547642#comment-14547642 ] Murtaza Kanchwala commented on SPARK-7661: -- Yes, it works I took 4 + 4 = 8 cores,

[jira] [Assigned] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7663: --- Assignee: Apache Spark [MLLIB] feature.Word2Vec throws empty iterator error when the

[jira] [Assigned] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7663: --- Assignee: (was: Apache Spark) [MLLIB] feature.Word2Vec throws empty iterator error when

[jira] [Commented] (SPARK-7699) Config spark.dynamicAllocation.initialExecutors has no effect

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547665#comment-14547665 ] Sean Owen commented on SPARK-7699: -- I think that's maybe intentional? the logic is

[jira] [Reopened] (SPARK-7700) Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-7700: -- Spark 1.3.0 on YARN: Application failed 2 times due to AM Container

[jira] [Assigned] (SPARK-7697) Column with an unsigned int should be treated as long in JDBCRDD

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7697: --- Assignee: Apache Spark Column with an unsigned int should be treated as long in JDBCRDD

[jira] [Commented] (SPARK-4852) Hive query plan deserialization failure caused by shaded hive-exec jar file when generating golden answers

2015-05-18 Thread Manku Timma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547708#comment-14547708 ] Manku Timma commented on SPARK-4852: The following diff could solve the problem. File

[jira] [Commented] (SPARK-7697) Column with an unsigned int should be treated as long in JDBCRDD

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547709#comment-14547709 ] Apache Spark commented on SPARK-7697: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7697) Column with an unsigned int should be treated as long in JDBCRDD

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7697: --- Assignee: (was: Apache Spark) Column with an unsigned int should be treated as long in

[jira] [Assigned] (SPARK-6416) RDD.fold() requires the operator to be commutative

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6416: --- Assignee: Apache Spark RDD.fold() requires the operator to be commutative

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Created] (SPARK-7708) Incorrect task serialization with Kryo closure serializer

2015-05-18 Thread Akshat Aranya (JIRA)
Akshat Aranya created SPARK-7708: Summary: Incorrect task serialization with Kryo closure serializer Key: SPARK-7708 URL: https://issues.apache.org/jira/browse/SPARK-7708 Project: Spark

[jira] [Resolved] (SPARK-3334) Spark causes mesos-master memory leak

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3334. -- Resolution: Not A Problem Spark causes mesos-master memory leak -

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548242#comment-14548242 ] Sean Owen commented on SPARK-7706: -- I am not sure Spark is designed to be invoked this

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548243#comment-14548243 ] Sean Owen commented on SPARK-7706: -- I am not sure Spark is designed to be invoked this

[jira] [Issue Comment Deleted] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7706: - Comment: was deleted (was: I am not sure Spark is designed to be invoked this way. You may need to

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548322#comment-14548322 ] Sean Owen commented on SPARK-7706: -- YARN_CONF_DIR is a YARN env variable right? not

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Shaik Idris Ali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548418#comment-14548418 ] Shaik Idris Ali commented on SPARK-7706: I think the cleaner way in

[jira] [Commented] (SPARK-7540) PMML correctness check

2015-05-18 Thread Shuo Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548326#comment-14548326 ] Shuo Xiang commented on SPARK-7540: --- Schema and required field verification done using

[jira] [Updated] (SPARK-7693) Remove import scala.concurrent.ExecutionContext.Implicits.global

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7693: - Assignee: Shixiong Zhu Remove import scala.concurrent.ExecutionContext.Implicits.global

[jira] [Resolved] (SPARK-7272) User guide update for PMML model export

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7272. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6219

[jira] [Assigned] (SPARK-7458) Check 1.3- 1.4 MLlib API compliance using java-compliance-checker

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-7458: Assignee: Xiangrui Meng Check 1.3- 1.4 MLlib API compliance using java-compliance-checker

[jira] [Created] (SPARK-7707) User guide and example code for Statistics.kernelDensity

2015-05-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7707: Summary: User guide and example code for Statistics.kernelDensity Key: SPARK-7707 URL: https://issues.apache.org/jira/browse/SPARK-7707 Project: Spark Issue

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Shaik Idris Ali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548306#comment-14548306 ] Shaik Idris Ali commented on SPARK-7706: We might have multiple types of

[jira] [Created] (SPARK-7709) spark-submit option to quit after submitting in cluster mode

2015-05-18 Thread Shay Rojansky (JIRA)
Shay Rojansky created SPARK-7709: Summary: spark-submit option to quit after submitting in cluster mode Key: SPARK-7709 URL: https://issues.apache.org/jira/browse/SPARK-7709 Project: Spark

[jira] [Commented] (SPARK-7708) Incorrect task serialization with Kryo closure serializer

2015-05-18 Thread Akshat Aranya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548282#comment-14548282 ] Akshat Aranya commented on SPARK-7708: -- This happens because the TaskDescription

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Commented] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument

2015-05-18 Thread Shaik Idris Ali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548203#comment-14548203 ] Shaik Idris Ali commented on SPARK-7706: Hi, [~srowen], Thanks for the quick

[jira] [Commented] (SPARK-3334) Spark causes mesos-master memory leak

2015-05-18 Thread Iven Hsu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548220#comment-14548220 ] Iven Hsu commented on SPARK-3334: - I'm not using Spark currently, you can close it as you

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-05-18 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548248#comment-14548248 ] Thomas Graves commented on SPARK-7110: -- Are you using spark1.1.0 as reported in the

[jira] [Assigned] (SPARK-3251) Clarify learning interfaces

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3251: --- Assignee: (was: Apache Spark) Clarify learning interfaces

[jira] [Assigned] (SPARK-3251) Clarify learning interfaces

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3251: --- Assignee: Apache Spark Clarify learning interfaces

[jira] [Resolved] (SPARK-4962) Put TaskScheduler.start back in SparkContext to shorten cluster resources occupation period

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4962. -- Resolution: Won't Fix Put TaskScheduler.start back in SparkContext to shorten cluster resources

[jira] [Assigned] (SPARK-4962) Put TaskScheduler.start back in SparkContext to shorten cluster resources occupation period

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4962: --- Assignee: (was: Apache Spark) Put TaskScheduler.start back in SparkContext to shorten

[jira] [Assigned] (SPARK-4962) Put TaskScheduler.start back in SparkContext to shorten cluster resources occupation period

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4962: --- Assignee: Apache Spark Put TaskScheduler.start back in SparkContext to shorten cluster

[jira] [Created] (SPARK-7710) User guide and example code for math/stat functions in DataFrames

2015-05-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7710: Summary: User guide and example code for math/stat functions in DataFrames Key: SPARK-7710 URL: https://issues.apache.org/jira/browse/SPARK-7710 Project: Spark

[jira] [Assigned] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5316: --- Assignee: (was: Apache Spark) DAGScheduler may make shuffleToMapStage leak if

[jira] [Assigned] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5316: --- Assignee: Apache Spark DAGScheduler may make shuffleToMapStage leak if getParentStages

[jira] [Resolved] (SPARK-6888) Make DriverQuirks editable

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6888. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-18 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548433#comment-14548433 ] Josh Rosen commented on SPARK-4105: --- I just noticed something interesting, although

[jira] [Assigned] (SPARK-4991) Worker should reconnect to Master when Master actor restart

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4991: --- Assignee: Apache Spark Worker should reconnect to Master when Master actor restart

[jira] [Assigned] (SPARK-4991) Worker should reconnect to Master when Master actor restart

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4991: --- Assignee: (was: Apache Spark) Worker should reconnect to Master when Master actor

[jira] [Assigned] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3219: --- Assignee: Apache Spark (was: Derrick Burns) K-Means clusterer should support Bregman

[jira] [Assigned] (SPARK-3261) KMeans clusterer can return duplicate cluster centers

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3261: --- Assignee: Apache Spark (was: Derrick Burns) KMeans clusterer can return duplicate cluster

[jira] [Assigned] (SPARK-3218) K-Means clusterer can fail on degenerate data

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3218: --- Assignee: Derrick Burns (was: Apache Spark) K-Means clusterer can fail on degenerate data

[jira] [Assigned] (SPARK-3261) KMeans clusterer can return duplicate cluster centers

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3261: --- Assignee: Derrick Burns (was: Apache Spark) KMeans clusterer can return duplicate cluster

[jira] [Assigned] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3219: --- Assignee: Derrick Burns (was: Apache Spark) K-Means clusterer should support Bregman

[jira] [Assigned] (SPARK-3218) K-Means clusterer can fail on degenerate data

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3218: --- Assignee: Apache Spark (was: Derrick Burns) K-Means clusterer can fail on degenerate data

[jira] [Resolved] (SPARK-4094) checkpoint should still be available after rdd actions

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4094. -- Resolution: Won't Fix checkpoint should still be available after rdd actions

[jira] [Assigned] (SPARK-4630) Dynamically determine optimal number of partitions

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4630: --- Assignee: Apache Spark (was: Kostas Sakellis) Dynamically determine optimal number of

[jira] [Assigned] (SPARK-4630) Dynamically determine optimal number of partitions

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4630: --- Assignee: Kostas Sakellis (was: Apache Spark) Dynamically determine optimal number of

[jira] [Created] (SPARK-7711) startTime() is missing

2015-05-18 Thread Sam Steingold (JIRA)
Sam Steingold created SPARK-7711: Summary: startTime() is missing Key: SPARK-7711 URL: https://issues.apache.org/jira/browse/SPARK-7711 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-7712) Native Spark Window Functions Performance Improvements

2015-05-18 Thread Herman van Hovell tot Westerflier (JIRA)
Herman van Hovell tot Westerflier created SPARK-7712: Summary: Native Spark Window Functions Performance Improvements Key: SPARK-7712 URL: https://issues.apache.org/jira/browse/SPARK-7712

[jira] [Resolved] (SPARK-2883) Spark Support for ORCFile format

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2883. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6194

[jira] [Closed] (SPARK-7327) DataFrame show() method doesn't like empty dataframes

2015-05-18 Thread Olivier Girardot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olivier Girardot closed SPARK-7327. --- Resolution: Cannot Reproduce I can't seem to reproduce the issue and I did not attach any

[jira] [Updated] (SPARK-7711) startTime() is missing

2015-05-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7711: - Priority: Minor (was: Major) startTime() is missing -- Key:

[jira] [Commented] (SPARK-7690) MulticlassClassificationEvaluator for tuning Multiclass Classifiers

2015-05-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548577#comment-14548577 ] Joseph K. Bradley commented on SPARK-7690: -- +1 We should also check for

[jira] [Assigned] (SPARK-6785) DateUtils can not handle date before 1970/01/01 correctly

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6785: --- Assignee: (was: Apache Spark) DateUtils can not handle date before 1970/01/01 correctly

[jira] [Commented] (SPARK-6785) DateUtils can not handle date before 1970/01/01 correctly

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548584#comment-14548584 ] Apache Spark commented on SPARK-6785: - User 'ckadner' has created a pull request for

[jira] [Assigned] (SPARK-7696) Aggregate function's result should be nullable only if the input expression is nullable

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7696: --- Assignee: Apache Spark Aggregate function's result should be nullable only if the input

[jira] [Commented] (SPARK-7696) Aggregate function's result should be nullable only if the input expression is nullable

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548617#comment-14548617 ] Apache Spark commented on SPARK-7696: - User 'ogirardot' has created a pull request for

[jira] [Assigned] (SPARK-7696) Aggregate function's result should be nullable only if the input expression is nullable

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7696: --- Assignee: (was: Apache Spark) Aggregate function's result should be nullable only if

[jira] [Resolved] (SPARK-7570) Ignore _temporary folders during partition discovery

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7570. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6091

[jira] [Resolved] (SPARK-7380) Python: Transformer/Estimator should be copyable

2015-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7380. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6088

[jira] [Resolved] (SPARK-7631) treenode argString should not print children

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7631. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6144

[jira] [Resolved] (SPARK-7269) Incorrect aggregation analysis

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7269. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6173

[jira] [Assigned] (SPARK-6785) DateUtils can not handle date before 1970/01/01 correctly

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6785: --- Assignee: Apache Spark DateUtils can not handle date before 1970/01/01 correctly

[jira] [Commented] (SPARK-7497) test_count_by_value_and_window is flaky

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548630#comment-14548630 ] Apache Spark commented on SPARK-7497: - User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-7497) test_count_by_value_and_window is flaky

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7497: --- Assignee: Davies Liu (was: Apache Spark) test_count_by_value_and_window is flaky

[jira] [Assigned] (SPARK-7497) test_count_by_value_and_window is flaky

2015-05-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7497: --- Assignee: Apache Spark (was: Davies Liu) test_count_by_value_and_window is flaky

[jira] [Resolved] (SPARK-7673) DataSourceStrategy's buildPartitionedTableScan always list list file status for all data files

2015-05-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-7673. - Resolution: Fixed Fix Version/s: 1.4.0 Issue has been addressed by

[jira] [Created] (SPARK-7713) Use shared broadcast hadoop conf for partitioned table scan.

2015-05-18 Thread Yin Huai (JIRA)
Yin Huai created SPARK-7713: --- Summary: Use shared broadcast hadoop conf for partitioned table scan. Key: SPARK-7713 URL: https://issues.apache.org/jira/browse/SPARK-7713 Project: Spark Issue

[jira] [Resolved] (SPARK-6216) Check Python version in worker before run PySpark job

2015-05-18 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6216. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6203

[jira] [Resolved] (SPARK-3267) Deadlock between ScalaReflectionLock and Data type initialization

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3267. - Resolution: Cannot Reproduce Deadlock between ScalaReflectionLock and Data type

[jira] [Resolved] (SPARK-4523) Improve handling of serialized schema information

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4523. - Resolution: Won't Fix We haven't changed this for a few release now, and it seem unlikely

[jira] [Resolved] (SPARK-6241) hiveql ANALYZE TABLE doesn't work for external tables

2015-05-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6241. - Resolution: Won't Fix Datasource tables have their own mechanism for reporting statistics

  1   2   3   >