git commit: [SPARK-3827] Very long RDD names are not rendered properly in web UI

2014-10-07 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 70e824f75 - d65fd554b [SPARK-3827] Very long RDD names are not rendered properly in web UI With Spark SQL we generate very long RDD names. These names are not properly rendered in the web UI. This PR fixes the rendering issue.

git commit: [SPARK-3827] Very long RDD names are not rendered properly in web UI

2014-10-07 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.1 964e3aa48 - 82ab4a796 [SPARK-3827] Very long RDD names are not rendered properly in web UI With Spark SQL we generate very long RDD names. These names are not properly rendered in the web UI. This PR fixes the rendering issue.

git commit: [SPARK-3808] PySpark fails to start in Windows

2014-10-07 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.1 82ab4a796 - 3a7875d9d [SPARK-3808] PySpark fails to start in Windows Modified syntax error of *.cmd script. Author: Masayoshi TSUZUKI tsudu...@oss.nttdata.co.jp Closes #2669 from tsudukim/feature/SPARK-3808 and squashes the following

git commit: [SPARK-3762] clear reference of SparkEnv after stop

2014-10-07 Thread matei
Repository: spark Updated Branches: refs/heads/master 12e2551ea - 655032965 [SPARK-3762] clear reference of SparkEnv after stop SparkEnv is cached in ThreadLocal object, so after stop and create a new SparkContext, old SparkEnv is still used by some threads, it will trigger many problems,

git commit: [SPARK-3825] Log more detail when unrolling a block fails

2014-10-07 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master bc87cc410 - 553737c6e [SPARK-3825] Log more detail when unrolling a block fails Before: ``` 14/10/06 16:45:42 WARN CacheManager: Not enough space to cache partition rdd_0_2 in memory! Free memory is 481861527 bytes. ``` After: ``` 14/10/07

git commit: [SPARK-3731] [PySpark] fix memory leak in PythonRDD

2014-10-07 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.1 267c7be3b - 553183024 [SPARK-3731] [PySpark] fix memory leak in PythonRDD The parent.getOrCompute() of PythonRDD is executed in a separated thread, it should release the memory reserved for shuffle and unrolling finally. Author:

git commit: [SPARK-3777] Display Executor ID for Tasks in Stage page

2014-10-07 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 553737c6e - 446063eca [SPARK-3777] Display Executor ID for Tasks in Stage page Now the Stage page only displays Executor(host) for tasks. However, there may be more than one Executors running in the same host. Currently, when some task

git commit: [SPARK-3790][MLlib] CosineSimilarity Example

2014-10-07 Thread meng
Repository: spark Updated Branches: refs/heads/master 446063eca - 3d7b36e0d [SPARK-3790][MLlib] CosineSimilarity Example Provide example for `RowMatrix.columnSimilarity()` Author: Reza Zadeh riz...@gmail.com Closes #2622 from rezazadeh/dimsumexample and squashes the following commits:

git commit: [SPARK-3832][MLlib] Upgrade Breeze dependency to 0.10

2014-10-07 Thread meng
Repository: spark Updated Branches: refs/heads/master 098c7344e - b32bb72e8 [SPARK-3832][MLlib] Upgrade Breeze dependency to 0.10 In Breeze 0.10, the L1regParam can be configured through anonymous function in OWLQN, and each component can be penalized differently. This is required for

git commit: [SPARK-3398] [EC2] Have spark-ec2 intelligently wait for specific cluster states

2014-10-07 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master b32bb72e8 - 5912ca671 [SPARK-3398] [EC2] Have spark-ec2 intelligently wait for specific cluster states Instead of waiting arbitrary amounts of time for the cluster to reach a specific state, this patch lets `spark-ec2` explicitly wait for

git commit: [SPARK-3829] Make Spark logo image on the header of HistoryPage as a link to HistoryPage's page #1

2014-10-07 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.1 e8afb733a - a1f833f75 [SPARK-3829] Make Spark logo image on the header of HistoryPage as a link to HistoryPage's page #1 There is a Spark logo on the header of HistoryPage. We can have too many HistoryPages if we run 20+ applications.

git commit: [SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs

2014-10-07 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master b69c9fb6f - 798ed22c2 [SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs Retire Epydoc, use Sphinx to generate API docs. Refine Sphinx docs, also convert some docstrings into Sphinx style. It looks like: ![api

git commit: [SPARK-3836] [REPL] Spark REPL optionally propagate internal exceptions

2014-10-07 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 798ed22c2 - c7818434f [SPARK-3836] [REPL] Spark REPL optionally propagate internal exceptions Optionally have the repl throw exceptions generated by interpreted code, instead of swallowing the exception and returning it as text output.