git commit: [CORE] Bugfix: LogErr format in DAGScheduler.scala

2014-09-29 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1651cc117 - 657bdff41 [CORE] Bugfix: LogErr format in DAGScheduler.scala Author: Zhang, Liye liye.zh...@intel.com Closes #2572 from liyezhang556520/DAGLogErr and squashes the following commits: 5be2491 [Zhang, Liye] Bugfix: LogErr format

git commit: [CORE] Bugfix: LogErr format in DAGScheduler.scala

2014-09-29 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.1 004b6fa77 - 7d88471e8 [CORE] Bugfix: LogErr format in DAGScheduler.scala Author: Zhang, Liye liye.zh...@intel.com Closes #2572 from liyezhang556520/DAGLogErr and squashes the following commits: 5be2491 [Zhang, Liye] Bugfix: LogErr

git commit: [EC2] Sort long, manually-inputted dictionaries

2014-09-29 Thread rxin
Repository: spark Updated Branches: refs/heads/master 657bdff41 - aedd251c5 [EC2] Sort long, manually-inputted dictionaries Similar to the work done in #2571, this PR just sorts the remaining manually-inputted dicts in the EC2 script so they are easier to maintain. Author: Nicholas Chammas

git commit: [MLlib] [SPARK-2885] DIMSUM: All-pairs similarity

2014-09-29 Thread meng
Repository: spark Updated Branches: refs/heads/master aedd251c5 - 587a0cd7e [MLlib] [SPARK-2885] DIMSUM: All-pairs similarity # All-pairs similarity via DIMSUM Compute all pairs of similar vectors using brute force approach, and also DIMSUM sampling approach. Laying down some notation: we

git commit: [SPARK-3032][Shuffle] Fix key comparison integer overflow introduced sorting exception

2014-09-29 Thread matei
Repository: spark Updated Branches: refs/heads/branch-1.1 7d88471e8 - df5a62f51 [SPARK-3032][Shuffle] Fix key comparison integer overflow introduced sorting exception Previous key comparison in `ExternalSorter` will get wrong sorting result or exception when key comparison overflows,

git commit: Add more debug message for ManagedBuffer

2014-09-29 Thread adav
Repository: spark Updated Branches: refs/heads/master dab1b0ae2 - e43c72fe0 Add more debug message for ManagedBuffer This is to help debug the error reported at http://apache-spark-user-list.1001560.n3.nabble.com/SQL-queries-fail-in-1-2-0-SNAPSHOT-td15327.html Author: Reynold Xin

git commit: [graphX] GraphOps: random pick vertex bug

2014-09-29 Thread ankurdave
Repository: spark Updated Branches: refs/heads/master 0bbe7faef - 51229ff7f [graphX] GraphOps: random pick vertex bug When `numVertices 50`, probability is set to 0. This would cause infinite loop. Author: yingjieMiao ying...@42go.com Closes #2553 from yingjieMiao/graphx and squashes the