GitHub user yupbank opened a pull request:

    https://github.com/apache/spark/pull/17365

    [SPARK-19962] [MLlib] [WIP] add DictVectorizer to ml.feature

    ## What changes were proposed in this pull request?
    add a new estimator`DictVectorizer` and transformer `DictVectorizerModel` 
for dataframe.
    (Please fill in changes proposed in this fix)
    
    ## How was this patch tested?
    
    (Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)
    (If this patch involves UI changes, please attach a screenshot; otherwise, 
remove this)
    
    Please review http://spark.apache.org/contributing.html before opening a 
pull request.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/yupbank/spark 
dict-vectorizer-for-df-SPARK-19962

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17365.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17365
    
----
commit acc41b06cc6534b196cf0a32c8d6c3949f1aad24
Author: Peng Yu <[email protected]>
Date:   2017-03-17T02:16:36Z

    add template

commit ae05f132f3c9b4011c38a24f1d1593bd51d49327
Author: Peng Yu <[email protected]>
Date:   2017-03-17T20:49:26Z

    improve abit

commit b670331451e6405ae3ad3413db9360be5b7db7d1
Author: Peng Yu <[email protected]>
Date:   2017-03-17T21:18:09Z

    xxxx

commit dfd4eb743e39964a4f489ec37686af2457322c51
Author: Peng Yu <[email protected]>
Date:   2017-03-18T00:12:40Z

    finish fit

commit 8098ea745a4ea5e5944fd171d4ed02051b050b57
Author: Peng Yu <[email protected]>
Date:   2017-03-18T04:02:39Z

    add transform

commit e5b58a2cae49ae3130568fe21118d4b197645707
Author: Peng Yu <[email protected]>
Date:   2017-03-19T02:50:57Z

    update stuff

commit b0cc7caa7b4f781b09ce93a790fa2c2ee6f62f2c
Author: Peng Yu <[email protected]>
Date:   2017-03-20T20:53:25Z

    add more test case

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to