spark git commit: [SPARK-11453][SQL] append data to partitioned table will messes up the result

2015-11-08 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 97b7080cf -> d8b50f702 [SPARK-11453][SQL] append data to partitioned table will messes up the result The reason is that: 1. For partitioned hive table, we will move the partitioned columns after data columns. (e.g. `` partition by `a` wil

spark git commit: [SPARK-11453][SQL] append data to partitioned table will messes up the result

2015-11-08 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 fddf0c413 -> 7eaf48eeb [SPARK-11453][SQL] append data to partitioned table will messes up the result The reason is that: 1. For partitioned hive table, we will move the partitioned columns after data columns. (e.g. `` partition by `a`

spark git commit: [SPARK-11564][SQL] Dataset Java API audit

2015-11-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.6 6ade67e5f -> fddf0c413 [SPARK-11564][SQL] Dataset Java API audit A few changes: 1. Removed fold, since it can be confusing for distributed collections. 2. Created specific interfaces for each Dataset function (e.g. MapFunction, Reduce

spark git commit: [SPARK-11564][SQL] Dataset Java API audit

2015-11-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master b2d195e13 -> 97b7080cf [SPARK-11564][SQL] Dataset Java API audit A few changes: 1. Removed fold, since it can be confusing for distributed collections. 2. Created specific interfaces for each Dataset function (e.g. MapFunction, ReduceFunc

spark git commit: [SPARK-11554][SQL] add map/flatMap to GroupedDataset

2015-11-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.6 27161f59e -> 6ade67e5f [SPARK-11554][SQL] add map/flatMap to GroupedDataset Author: Wenchen Fan Closes #9521 from cloud-fan/map. (cherry picked from commit b2d195e137fad88d567974659fa7023ff4da96cd) Signed-off-by: Michael Armbrust

spark git commit: [SPARK-11554][SQL] add map/flatMap to GroupedDataset

2015-11-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 26739059b -> b2d195e13 [SPARK-11554][SQL] add map/flatMap to GroupedDataset Author: Wenchen Fan Closes #9521 from cloud-fan/map. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/s

spark git commit: [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not …

2015-11-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 30c8ba71a -> 26739059b [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not … Doc change to align with HiveConf default in terms of where to create `warehouse` directory. Author: xin Wu Closes #9365 from xwu0226

spark git commit: [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not …

2015-11-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 ea786a196 -> 78a5cf198 [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not … Doc change to align with HiveConf default in terms of where to create `warehouse` directory. Author: xin Wu Closes #9365 from xwu

spark git commit: [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not …

2015-11-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.6 41b2bb1c3 -> 27161f59e [SPARK-10046][SQL] Hive warehouse dir not set in current directory when not … Doc change to align with HiveConf default in terms of where to create `warehouse` directory. Author: xin Wu Closes #9365 from xwu

spark git commit: [SPARK-11451][SQL] Support single distinct count on multiple columns.

2015-11-08 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 7b3736098 -> 41b2bb1c3 [SPARK-11451][SQL] Support single distinct count on multiple columns. This PR adds support for multiple column in a single count distinct aggregate to the new aggregation path. cc yhuai Author: Herman van Hovel

spark git commit: [SPARK-11451][SQL] Support single distinct count on multiple columns.

2015-11-08 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 5c4e6d7ec -> 30c8ba71a [SPARK-11451][SQL] Support single distinct count on multiple columns. This PR adds support for multiple column in a single count distinct aggregate to the new aggregation path. cc yhuai Author: Herman van Hovell

spark git commit: [DOC][SQL] Remove redundant out-of-place python snippet

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 0bc813b45 -> 7b3736098 [DOC][SQL] Remove redundant out-of-place python snippet This snippet seems to be mistakenly introduced at two places in #5348. Author: Rohit Agarwal Closes #9540 from mindprince/patch-1. (cherry picked from co

spark git commit: [DOC][SQL] Remove redundant out-of-place python snippet

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 3917f6153 -> ea786a196 [DOC][SQL] Remove redundant out-of-place python snippet This snippet seems to be mistakenly introduced at two places in #5348. Author: Rohit Agarwal Closes #9540 from mindprince/patch-1. (cherry picked from co

spark git commit: [DOC][SQL] Remove redundant out-of-place python snippet

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master d98190210 -> 5c4e6d7ec [DOC][SQL] Remove redundant out-of-place python snippet This snippet seems to be mistakenly introduced at two places in #5348. Author: Rohit Agarwal Closes #9540 from mindprince/patch-1. Project: http://git-wip-u

spark git commit: [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 8fb6696cd -> 3917f6153 [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation Fix Python example to use normalRDD as advertised Author: Sean Owen Closes #9529 from srowen/SPARK-11476. (cherr

spark git commit: [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 696d4a52d -> 0bc813b45 [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation Fix Python example to use normalRDD as advertised Author: Sean Owen Closes #9529 from srowen/SPARK-11476. (cherr

spark git commit: [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation

2015-11-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4b69a42ed -> d98190210 [SPARK-11476][DOCS] Incorrect function referred to in MLib Random data generation documentation Fix Python example to use normalRDD as advertised Author: Sean Owen Closes #9529 from srowen/SPARK-11476. Project: