[ 
https://issues.apache.org/jira/browse/HIVEMALL-22?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15724920#comment-15724920
 ] 

ASF GitHub Bot commented on HIVEMALL-22:
----------------------------------------

GitHub user myui opened a pull request:

    https://github.com/apache/incubator-hivemall/pull/15

    [WIP] Support Feature Selection UDFs

    This PR introduces two feature selection schemes: `Chi-Square test` and 
`Signal Noise Ratio`.
    
    This PR is based on [a pending 
PR](https://github.com/myui/hivemall/pull/385) by @amaya382 that is sent before 
Hivemall entered Apache Incubator.
    
    See [JIRA](https://issues.apache.org/jira/browse/HIVEMALL-22
    ) for tracking the status of this issue.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/myui/incubator-hivemall JIRA-22/pr-385

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-hivemall/pull/15.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #15
    
----
commit 2dc176a760b553214624e98f885a719ee196cc4e
Author: amaya <[email protected]>
Date:   2016-09-16T06:46:44Z

    add HiveUtils.isNumberListOI

commit 56adf2d4e8b2591c31b846b8980016d3dafdbacc
Author: amaya <[email protected]>
Date:   2016-09-16T06:48:33Z

    add HiveUtils.asDoubleOI

commit 6f9b4fa0acebf604882240ccd5507d9df45bab2d
Author: amaya <[email protected]>
Date:   2016-09-16T06:52:54Z

    add transpose_and_dot

commit d3009be59bcf314b373038e3db8903a041396931
Author: amaya <[email protected]>
Date:   2016-09-16T07:00:58Z

    add chi2 and chi2_test

commit d8f1005bb9fbf769b117290582bed18d7607a94a
Author: amaya <[email protected]>
Date:   2016-09-20T03:01:46Z

    mod number format

commit d0e97e6ff71b2072ec5235cc3ac169162d59da59
Author: amaya <[email protected]>
Date:   2016-09-20T03:02:28Z

    add HiveUtils.isNumberListListOI

commit 7b07e4a6e1f700ba0a6e5b68659a040a3d89aa2f
Author: amaya <[email protected]>
Date:   2016-09-20T03:03:44Z

    change interface of chi2

commit e9d1a94f29f31e2910a54add7c2625825d715318
Author: amaya <[email protected]>
Date:   2016-09-20T07:55:57Z

    add array_top_k_indices

commit 1ab9b0974ca4203c00175469b7b75d5b65209547
Author: amaya <[email protected]>
Date:   2016-09-20T07:56:15Z

    add subarray_by_indices

commit ad81b3aa5a0bbb7c248d127ba44608578c01ae00
Author: amaya <[email protected]>
Date:   2016-09-20T08:05:55Z

    add license and format

commit be1ea37a0f5048cde4284107c04e109f0f526b42
Author: amaya <[email protected]>
Date:   2016-09-20T09:00:49Z

    add ddl definitions

commit 89c81aacf5b13f6e125723cb5c703333574c10ae
Author: amaya <[email protected]>
Date:   2016-09-21T01:56:59Z

    change to select_k_best

commit 6dc234490dc25f563b22e5659c378e6ebcf8dcdb
Author: amaya <[email protected]>
Date:   2016-09-21T02:41:59Z

    standardize to chi2

commit a16a3fde844ba381dee7eb1e9608ddc2dcfb96fc
Author: amaya <[email protected]>
Date:   2016-09-21T04:10:18Z

    refine chi2

commit abbf5492b95dd69e347580c59ac044a78627c547
Author: amaya <[email protected]>
Date:   2016-09-21T04:11:00Z

    refine transpose_and_dot

commit b8cf39684496f2511e59294041d443b9438394a9
Author: amaya <[email protected]>
Date:   2016-09-21T06:02:12Z

    fix chi2

commit a882c5f9f8067b911254dfc43d268de06a5490f9
Author: amaya <[email protected]>
Date:   2016-09-21T07:00:36Z

    mod chi2 function name

commit 5088ef36367df1cd51ae62f1c044933676975e2e
Author: amaya <[email protected]>
Date:   2016-09-21T07:22:09Z

    add tests

commit 22a608ee1c7239b2953183b5341f80c58b1e7045
Author: amaya <[email protected]>
Date:   2016-09-26T08:07:55Z

    add snr

commit a1f8f958c99f3cde9e48b6d80d364004f6d98cc2
Author: amaya <[email protected]>
Date:   2016-09-27T06:58:33Z

    integrate chi2 and SNR into hivemall.spark

commit aa7d5299739349b49ef4f50cc2c1969f5cb8a78f
Author: amaya <[email protected]>
Date:   2016-09-27T07:02:02Z

    Merge 'master' into 'feature/feature_selection'

commit 1347de985ea6f8028c9d381f8827882ad39ad3a7
Author: amaya <[email protected]>
Date:   2016-09-28T05:22:05Z

    refine feature selection in spark integration

commit 8e2842cf8c272642feaa76bf95e8fa463b0322dc
Author: amaya <[email protected]>
Date:   2016-09-28T05:24:19Z

    refine tests

commit 4cfa4e5ac15a6535b187c23616c205696a1cd13b
Author: amaya <[email protected]>
Date:   2016-09-28T09:26:01Z

    mod SNR for corner cases

commit 80be81ecf92cd4675dcdfaa5f456d84d484d6c44
Author: amaya <[email protected]>
Date:   2016-09-28T11:01:08Z

    minor fix

commit 8d9f0d4c00758324029d342eb4b892e046ca4a49
Author: amaya <[email protected]>
Date:   2016-09-29T02:02:14Z

    minor fix

commit ce4a48980e33b9f16c74a62fcea6878f28b9c08b
Author: amaya <[email protected]>
Date:   2016-09-30T08:05:20Z

    change method of testing for spark

commit 67ba9631af3c231b7abd145134d17237b6aca0a5
Author: myui <[email protected]>
Date:   2016-11-21T09:19:45Z

    Merge branch 'feature/feature_selection' of
    https://github.com/amaya382/hivemall into feature_selection
    
    # Conflicts:
    #   core/src/main/java/hivemall/utils/hadoop/HiveUtils.java
    #   core/src/main/java/hivemall/utils/math/StatsUtils.java
    #   
spark/spark-1.6/src/main/scala/org/apache/spark/sql/hive/GroupedDataEx.scala
    #   
spark/spark-1.6/src/test/scala/org/apache/spark/sql/hive/HivemallOpsSuite.scala
    #   
spark/spark-2.0/src/main/scala/org/apache/spark/sql/hive/HivemallGroupedDataset.scala
    #   
spark/spark-2.0/src/test/scala/org/apache/spark/sql/hive/HivemallOpsSuite.scala

commit e44a413e5fd4270af53895fceec27ccff3d63a73
Author: myui <[email protected]>
Date:   2016-11-21T10:02:27Z

    Updated license headers

commit 6549ef5104883a9529dfd9fc52b2b24843076fbb
Author: amaya <[email protected]>
Date:   2016-11-23T12:16:10Z

    Add feature selection gitbook (#386)

----


> Review and merge pending Pull Requests before entering Incubator
> ----------------------------------------------------------------
>
>                 Key: HIVEMALL-22
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-22
>             Project: Hivemall
>          Issue Type: New Feature
>            Reporter: Makoto Yui
>            Assignee: Makoto Yui
>
> Need to review and merge pending Pull Requests in
> https://github.com/myui/hivemall/pulls
> * Feature Selection
> https://github.com/myui/hivemall/pull/385
> * SST change point detection
> https://github.com/myui/hivemall/pull/356
> * Checkstyle
> https://github.com/myui/hivemall/pull/343
> * System Test
> https://github.com/myui/hivemall/pull/336
> * Kernelized Passive Aggressive
> https://github.com/myui/hivemall/pull/304
> * Separate Optimizer
> https://github.com/myui/hivemall/pull/285



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to