spark git commit: [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/master f07e71406 - b80a030e9 [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python. To maintain consistency with the Scala API. Author: Reynold Xin r...@databricks.com Closes #5284 from rxin/df-na-alias and squashes the

spark git commit: [SPARK-6625][SQL] Add common string filters to data sources.

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 67c885e3c - a97d4e6bf [SPARK-6625][SQL] Add common string filters to data sources. Filters such as startsWith, endsWith, contains will be very useful for data sources that provide search functionality, e.g. Succinct, Elastic Search,

spark git commit: [SPARK-6625][SQL] Add common string filters to data sources.

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/master 56775571c - f07e71406 [SPARK-6625][SQL] Add common string filters to data sources. Filters such as startsWith, endsWith, contains will be very useful for data sources that provide search functionality, e.g. Succinct, Elastic Search, Solr.

spark git commit: [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 a97d4e6bf - cf651a46e [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python. To maintain consistency with the Scala API. Author: Reynold Xin r...@databricks.com Closes #5284 from rxin/df-na-alias and squashes the

spark git commit: [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock

2015-03-31 Thread lian
Repository: spark Updated Branches: refs/heads/master b80a030e9 - 314afd0e2 [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock JIRA: https://issues.apache.org/jira/browse/SPARK-6618 Author: Yin Huai yh...@databricks.com Closes #5281 from

spark git commit: [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock

2015-03-31 Thread lian
Repository: spark Updated Branches: refs/heads/branch-1.3 cf651a46e - fd600cec0 [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock JIRA: https://issues.apache.org/jira/browse/SPARK-6618 Author: Yin Huai yh...@databricks.com Closes #5281 from

spark git commit: [SPARK-6542][SQL] add CreateStruct

2015-03-31 Thread lian
Repository: spark Updated Branches: refs/heads/master 314afd0e2 - a05835b89 [SPARK-6542][SQL] add CreateStruct Similar to `CreateArray`, we can add `CreateStruct` to create nested columns. marmbrus Author: Xiangrui Meng m...@databricks.com Closes #5195 from mengxr/SPARK-6542 and squashes

spark git commit: [Doc] Improve Python DataFrame documentation

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 c4c982a65 - e527b3590 [Doc] Improve Python DataFrame documentation Author: Reynold Xin r...@databricks.com Closes #5287 from rxin/pyspark-df-doc-cleanup-context and squashes the following commits: 1841b60 [Reynold Xin] Lint. f2007f1

spark git commit: [Doc] Improve Python DataFrame documentation

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/master 37326079d - 305abe1e5 [Doc] Improve Python DataFrame documentation Author: Reynold Xin r...@databricks.com Closes #5287 from rxin/pyspark-df-doc-cleanup-context and squashes the following commits: 1841b60 [Reynold Xin] Lint. f2007f1

spark git commit: [SPARK-6145][SQL] fix ORDER BY on nested fields

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 778c87686 - 045228f38 [SPARK-6145][SQL] fix ORDER BY on nested fields This PR is based on work by cloud-fan in #4904, but with two differences: - We isolate the logic for Sort's special handling into `ResolveSortReferences` - We

spark git commit: [SPARK-6255] [MLLIB] Support multiclass classification in Python API

2015-03-31 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 46de6c05e - b5bd75d90 [SPARK-6255] [MLLIB] Support multiclass classification in Python API Python API parity check for classification and multiclass classification support, major disparities need to be added for Python: ```scala

spark git commit: [SPARK-5371][SQL] Propagate types after function conversion, before futher resolution

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 045228f38 - 5a957fe0d [SPARK-5371][SQL] Propagate types after function conversion, before futher resolution Before it was possible for a query to flip back and forth from a resolved state, allowing resolution to propagate up before

spark git commit: [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 fd600cec0 - 9ebefb1f1 [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation Also removes temporary workarounds made in #5183 and #5251. !-- Reviewable:start -- [img src=https://reviewable.io/review_button.png;

spark git commit: [SPARK-4894][mllib] Added Bernoulli option to NaiveBayes model in mllib

2015-03-31 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master a05835b89 - d01a6d8c3 [SPARK-4894][mllib] Added Bernoulli option to NaiveBayes model in mllib Added optional model type parameter for NaiveBayes training. Can be either Multinomial or Bernoulli. When Bernoulli is given the Bernoulli

spark git commit: [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d01a6d8c3 - a7992ffaf [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation Also removes temporary workarounds made in #5183 and #5251. !-- Reviewable:start -- [img src=https://reviewable.io/review_button.png;

spark git commit: [SPARK-6633][SQL] Should be Contains instead of EndsWith when constructing sources.StringContains

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 5a957fe0d - d85164637 [SPARK-6633][SQL] Should be Contains instead of EndsWith when constructing sources.StringContains Author: Liang-Chi Hsieh vii...@gmail.com Closes #5299 from viirya/stringcontains and squashes the following

spark git commit: [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure

2015-03-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.3 d85164637 - c4c982a65 [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure In OutputCommitCoordinator, there is some logic to clear the authorized

spark git commit: [SPARK-6633][SQL] Should be Contains instead of EndsWith when constructing sources.StringContains

2015-03-31 Thread rxin
Repository: spark Updated Branches: refs/heads/master beebb7ffc - 2036bc599 [SPARK-6633][SQL] Should be Contains instead of EndsWith when constructing sources.StringContains Author: Liang-Chi Hsieh vii...@gmail.com Closes #5299 from viirya/stringcontains and squashes the following commits:

spark git commit: [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure

2015-03-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 0e00f12d3 - 37326079d [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure In OutputCommitCoordinator, there is some logic to clear the authorized committer's

spark git commit: [SPARK-5692] [MLlib] Word2Vec save/load

2015-03-31 Thread meng
Repository: spark Updated Branches: refs/heads/master 2036bc599 - 0e00f12d3 [SPARK-5692] [MLlib] Word2Vec save/load Word2Vec model now supports saving and loading. a] The Metadata stored in JSON format consists of version, classname, vectorSize and numWords b] The data stored in Parquet