spark git commit: [SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 0375a413b - 8cd1692c9 [SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system For detail description, pls refer to [SPARK-6036](https://issues.apache.org/jira/browse/SPARK-6036). Author: Zhang, Liye

spark git commit: [SPARK-5495][UI] Add app and driver kill function in master web UI

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 12135e905 - 67595eb8f [SPARK-5495][UI] Add app and driver kill function in master web UI Add application kill function in master web UI for standalone mode. Details can be seen in

spark git commit: SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 6200f0709 - 485b91934 SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server. As agreed in PR #1160 adding test to verify if history server generates relative links to applications. Author: Lukasz

spark git commit: SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 67595eb8f - 4a8a0a8ec SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server. As agreed in PR #1160 adding test to verify if history server generates relative links to applications. Author: Lukasz Jastrzebski

spark git commit: fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 7c99a014f - 0375a413b fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode jira case spark-6033 https://issues.apache.org/jira/browse/SPARK-6033 In standalone deploy mode, the cleanup will only remove the stopped

spark git commit: fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 485b91934 - b8db84c5b fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode jira case spark-6033 https://issues.apache.org/jira/browse/SPARK-6033 In standalone deploy mode, the cleanup will only remove the

spark git commit: fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.1 2785210fa - 814934da6 fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode jira case spark-6033 https://issues.apache.org/jira/browse/SPARK-6033 In standalone deploy mode, the cleanup will only remove the

spark git commit: fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.0 14e042b65 - e751f8f26 fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode jira case spark-6033 https://issues.apache.org/jira/browse/SPARK-6033 In standalone deploy mode, the cleanup will only remove the

spark git commit: fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 d0bf938ec - d4ce702c4 fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode jira case spark-6033 https://issues.apache.org/jira/browse/SPARK-6033 In standalone deploy mode, the cleanup will only remove the

spark git commit: [SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5e5ad6558 - 12135e905 [SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set cc andrewor14, srowen. Author: jerryshao saisai.s...@intel.com Closes #4800 from jerryshao/SPARK-5771 and squashes the following

spark git commit: SPARK-4300 [CORE] Race condition during SparkWorker shutdown

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 2d83442f2 - 64e0cbc73 SPARK-4300 [CORE] Race condition during SparkWorker shutdown Close appender saving stdout/stderr before destroying process to avoid exception on reading closed input stream. (This also removes a redundant

spark git commit: [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2

2015-02-26 Thread lian
Repository: spark Updated Branches: refs/heads/branch-1.3 a51d9dbeb - e0f5fb0ad [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2 JIRA: https://issues.apache.org/jira/browse/SPARK-6023 Author: Yin Huai

spark git commit: [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2

2015-02-26 Thread lian
Repository: spark Updated Branches: refs/heads/master 51a6f9097 - f02394d06 [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2 JIRA: https://issues.apache.org/jira/browse/SPARK-6023 Author: Yin Huai

spark git commit: [SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true

2015-02-26 Thread lian
Repository: spark Updated Branches: refs/heads/master f02394d06 - 192e42a29 [SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true Please see JIRA (https://issues.apache.org/jira/browse/SPARK-6016) for details of the

spark git commit: [SPARK-5801] [core] Avoid creating nested directories.

2015-02-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 192e42a29 - df3d559b3 [SPARK-5801] [core] Avoid creating nested directories. Cache the value of the local root dirs to use for storing local data, so that the same directories are reused. Also, to avoid an extra level of nesting, use a

spark git commit: [SPARK-6004][MLlib] Pick the best model when training GradientBoostedTrees with validation

2015-02-26 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 235865754 - cfff397f0 [SPARK-6004][MLlib] Pick the best model when training GradientBoostedTrees with validation Since the validation error does not change monotonically, in practice, it should be proper to pick the best model when

spark git commit: [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe

2015-02-26 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master cfff397f0 - 7fa960e65 [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe Removing elements from a mutable HashSet while iterating over it can cause the iteration to incorrectly skip over entries that were not removed.

spark git commit: [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe

2015-02-26 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.3 dafb3d210 - 5d309ad6c [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe Removing elements from a mutable HashSet while iterating over it can cause the iteration to incorrectly skip over entries that were not

spark git commit: [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe

2015-02-26 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.2 015895ab5 - cc7313d09 [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe Removing elements from a mutable HashSet while iterating over it can cause the iteration to incorrectly skip over entries that were not

spark git commit: [SPARK-6015] fix links to source code in Python API docs

2015-02-26 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.2 00112baf9 - 015895ab5 [SPARK-6015] fix links to source code in Python API docs Author: Davies Liu dav...@databricks.com Closes #4772 from davies/source_link and squashes the following commits: 389f0c6 [Davies Liu] fix link to source

spark git commit: [SPARK-6007][SQL] Add numRows param in DataFrame.show()

2015-02-26 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 b5c5e93d7 - 7c779d8d5 [SPARK-6007][SQL] Add numRows param in DataFrame.show() It is useful to let the user decide the number of rows to show in DataFrame.show Author: Jacky Li jacky.li...@huawei.com Closes #4767 from jackylk/show and

spark git commit: [SPARK-6007][SQL] Add numRows param in DataFrame.show()

2015-02-26 Thread rxin
Repository: spark Updated Branches: refs/heads/master df3d559b3 - 235865754 [SPARK-6007][SQL] Add numRows param in DataFrame.show() It is useful to let the user decide the number of rows to show in DataFrame.show Author: Jacky Li jacky.li...@huawei.com Closes #4767 from jackylk/show and

spark git commit: SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 cc7313d09 - 602d5c1fc SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output Join on output threads to make sure any lingering output from process reaches stdout, stderr before exiting CC andrewor14 since I believe he

spark git commit: SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 7fa960e65 - cd5c8d7bb SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output Join on output threads to make sure any lingering output from process reaches stdout, stderr before exiting CC andrewor14 since I believe he created

spark git commit: Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 602d5c1fc - 94faf4c49 Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs. The configuration is not supported in mesos mode now. See https://github.com/apache/spark/pull/1462 Author: Li Zhihui

spark git commit: Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master cd5c8d7bb - 10094a523 Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs. The configuration is not supported in mesos mode now. See https://github.com/apache/spark/pull/1462 Author: Li Zhihui

spark git commit: [SPARK-3562]Periodic cleanup event logs

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 10094a523 - 8942b522d [SPARK-3562]Periodic cleanup event logs Author: xukun 00228947 xukun...@huawei.com Closes #4214 from viper-kun/cleaneventlog and squashes the following commits: 7a5b9c5 [xukun 00228947] fix issue 31674ee [xukun

spark git commit: [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 8942b522d - aa63f633d [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message The problem with SPARK-6027 in short is that JARs like the kafka-assembly.jar does not work in python as the

spark git commit: [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 62652dc5b - 731a997db [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message The problem with SPARK-6027 in short is that JARs like the kafka-assembly.jar does not work in python as

spark git commit: [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master aa63f633d - 5f3238b3b [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM Author: Cheolsoo Park cheols...@netflix.com Closes #4773 from piaozhexiu/SPARK-6018 and squashes the following commits: 2a919d5 [Cheolsoo

spark git commit: [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 731a997db - fe7967483 [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM Author: Cheolsoo Park cheols...@netflix.com Closes #4773 from piaozhexiu/SPARK-6018 and squashes the following commits: 2a919d5

spark git commit: [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 94faf4c49 - e21475d16 [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM Author: Cheolsoo Park cheols...@netflix.com Closes #4773 from piaozhexiu/SPARK-6018 and squashes the following commits: 2a919d5

spark git commit: [SPARK-5914] to run spark-submit requiring only user perm on windows

2015-02-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master e43139f40 - 51a6f9097 [SPARK-5914] to run spark-submit requiring only user perm on windows Because windows on-default does not grant read permission to jars except to admin, spark-submit would fail with ClassNotFound exception if user

spark git commit: Add a note for context termination for History server on Yarn

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 fe7967483 - 297c3ef82 Add a note for context termination for History server on Yarn The history server on Yarn only shows completed jobs. This adds a note concerning the needed explicit context termination at the end of a spark job

spark git commit: [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master c871e2dae - b38dec2ff [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode Remove unreachable driver memory properties in yarn client mode Author: mohit.goyal mohit.go...@guavus.com Closes #4730 from

spark git commit: [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 297c3ef82 - 5b426cb1f [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode Remove unreachable driver memory properties in yarn client mode Author: mohit.goyal mohit.go...@guavus.com Closes #4730 from

spark git commit: SPARK-4300 [CORE] Race condition during SparkWorker shutdown

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5f3238b3b - 3fb53c029 SPARK-4300 [CORE] Race condition during SparkWorker shutdown Close appender saving stdout/stderr before destroying process to avoid exception on reading closed input stream. (This also removes a redundant `waitFor()`

spark git commit: Add a note for context termination for History server on Yarn

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.0 f74bccbe3 - 14e042b65 Add a note for context termination for History server on Yarn The history server on Yarn only shows completed jobs. This adds a note concerning the needed explicit context termination at the end of a spark job

spark git commit: Add a note for context termination for History server on Yarn

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.1 36f3c499f - 2785210fa Add a note for context termination for History server on Yarn The history server on Yarn only shows completed jobs. This adds a note concerning the needed explicit context termination at the end of a spark job

spark git commit: Add a note for context termination for History server on Yarn

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 64e0cbc73 - 58b3aa692 Add a note for context termination for History server on Yarn The history server on Yarn only shows completed jobs. This adds a note concerning the needed explicit context termination at the end of a spark job

spark git commit: Add a note for context termination for History server on Yarn

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 3fb53c029 - c871e2dae Add a note for context termination for History server on Yarn The history server on Yarn only shows completed jobs. This adds a note concerning the needed explicit context termination at the end of a spark job which

spark git commit: SPARK-6045 RecordWriter should be checked against null in PairRDDFunctio...

2015-02-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master b38dec2ff - e60ad2f4c SPARK-6045 RecordWriter should be checked against null in PairRDDFunctio... ...ns#saveAsNewAPIHadoopDataset Author: tedyu yuzhih...@gmail.com Closes #4794 from tedyu/master and squashes the following commits:

spark git commit: [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging

2015-02-26 Thread lian
Repository: spark Updated Branches: refs/heads/master 18f209843 - 4ad5153f5 [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging `FilteringParquetRowInputFormat` manually merges Parquet schemas before computing splits. However, it is duplicate because the schemas are already merged

spark git commit: [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging

2015-02-26 Thread lian
Repository: spark Updated Branches: refs/heads/branch-1.3 b83a93e08 - 25a109e42 [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging `FilteringParquetRowInputFormat` manually merges Parquet schemas before computing splits. However, it is duplicate because the schemas are already

spark git commit: SPARK-4579 [WEBUI] Scheduling Delay appears negative

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master e60ad2f4c - fbc469473 SPARK-4579 [WEBUI] Scheduling Delay appears negative Ensure scheduler delay handles unfinished task case, and ensure delay is never negative even due to rounding Author: Sean Owen so...@cloudera.com Closes #4796

spark git commit: [SPARK-5529][CORE]Add expireDeadHosts in HeartbeatReceiver

2015-02-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master fbc469473 - 18f209843 [SPARK-5529][CORE]Add expireDeadHosts in HeartbeatReceiver If a blockManager has not send heartBeat more than 120s, BlockManagerMasterActor will remove it. But coarseGrainedSchedulerBackend can only remove executor

spark git commit: [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.

2015-02-26 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4ad5153f5 - 5e5ad6558 [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore. JIRA: https://issues.apache.org/jira/browse/SPARK-6024 Author: Yin Huai yh...@databricks.com Closes #4795

spark git commit: [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.

2015-02-26 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 25a109e42 - 6200f0709 [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore. JIRA: https://issues.apache.org/jira/browse/SPARK-6024 Author: Yin Huai yh...@databricks.com Closes