[
https://issues.apache.org/jira/browse/HIVEMALL-22?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15724920#comment-15724920
]
ASF GitHub Bot commented on HIVEMALL-22:
----------------------------------------
GitHub user myui opened a pull request:
https://github.com/apache/incubator-hivemall/pull/15
[WIP] Support Feature Selection UDFs
This PR introduces two feature selection schemes: `Chi-Square test` and
`Signal Noise Ratio`.
This PR is based on [a pending
PR](https://github.com/myui/hivemall/pull/385) by @amaya382 that is sent before
Hivemall entered Apache Incubator.
See [JIRA](https://issues.apache.org/jira/browse/HIVEMALL-22
) for tracking the status of this issue.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/myui/incubator-hivemall JIRA-22/pr-385
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-hivemall/pull/15.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #15
----
commit 2dc176a760b553214624e98f885a719ee196cc4e
Author: amaya <[email protected]>
Date: 2016-09-16T06:46:44Z
add HiveUtils.isNumberListOI
commit 56adf2d4e8b2591c31b846b8980016d3dafdbacc
Author: amaya <[email protected]>
Date: 2016-09-16T06:48:33Z
add HiveUtils.asDoubleOI
commit 6f9b4fa0acebf604882240ccd5507d9df45bab2d
Author: amaya <[email protected]>
Date: 2016-09-16T06:52:54Z
add transpose_and_dot
commit d3009be59bcf314b373038e3db8903a041396931
Author: amaya <[email protected]>
Date: 2016-09-16T07:00:58Z
add chi2 and chi2_test
commit d8f1005bb9fbf769b117290582bed18d7607a94a
Author: amaya <[email protected]>
Date: 2016-09-20T03:01:46Z
mod number format
commit d0e97e6ff71b2072ec5235cc3ac169162d59da59
Author: amaya <[email protected]>
Date: 2016-09-20T03:02:28Z
add HiveUtils.isNumberListListOI
commit 7b07e4a6e1f700ba0a6e5b68659a040a3d89aa2f
Author: amaya <[email protected]>
Date: 2016-09-20T03:03:44Z
change interface of chi2
commit e9d1a94f29f31e2910a54add7c2625825d715318
Author: amaya <[email protected]>
Date: 2016-09-20T07:55:57Z
add array_top_k_indices
commit 1ab9b0974ca4203c00175469b7b75d5b65209547
Author: amaya <[email protected]>
Date: 2016-09-20T07:56:15Z
add subarray_by_indices
commit ad81b3aa5a0bbb7c248d127ba44608578c01ae00
Author: amaya <[email protected]>
Date: 2016-09-20T08:05:55Z
add license and format
commit be1ea37a0f5048cde4284107c04e109f0f526b42
Author: amaya <[email protected]>
Date: 2016-09-20T09:00:49Z
add ddl definitions
commit 89c81aacf5b13f6e125723cb5c703333574c10ae
Author: amaya <[email protected]>
Date: 2016-09-21T01:56:59Z
change to select_k_best
commit 6dc234490dc25f563b22e5659c378e6ebcf8dcdb
Author: amaya <[email protected]>
Date: 2016-09-21T02:41:59Z
standardize to chi2
commit a16a3fde844ba381dee7eb1e9608ddc2dcfb96fc
Author: amaya <[email protected]>
Date: 2016-09-21T04:10:18Z
refine chi2
commit abbf5492b95dd69e347580c59ac044a78627c547
Author: amaya <[email protected]>
Date: 2016-09-21T04:11:00Z
refine transpose_and_dot
commit b8cf39684496f2511e59294041d443b9438394a9
Author: amaya <[email protected]>
Date: 2016-09-21T06:02:12Z
fix chi2
commit a882c5f9f8067b911254dfc43d268de06a5490f9
Author: amaya <[email protected]>
Date: 2016-09-21T07:00:36Z
mod chi2 function name
commit 5088ef36367df1cd51ae62f1c044933676975e2e
Author: amaya <[email protected]>
Date: 2016-09-21T07:22:09Z
add tests
commit 22a608ee1c7239b2953183b5341f80c58b1e7045
Author: amaya <[email protected]>
Date: 2016-09-26T08:07:55Z
add snr
commit a1f8f958c99f3cde9e48b6d80d364004f6d98cc2
Author: amaya <[email protected]>
Date: 2016-09-27T06:58:33Z
integrate chi2 and SNR into hivemall.spark
commit aa7d5299739349b49ef4f50cc2c1969f5cb8a78f
Author: amaya <[email protected]>
Date: 2016-09-27T07:02:02Z
Merge 'master' into 'feature/feature_selection'
commit 1347de985ea6f8028c9d381f8827882ad39ad3a7
Author: amaya <[email protected]>
Date: 2016-09-28T05:22:05Z
refine feature selection in spark integration
commit 8e2842cf8c272642feaa76bf95e8fa463b0322dc
Author: amaya <[email protected]>
Date: 2016-09-28T05:24:19Z
refine tests
commit 4cfa4e5ac15a6535b187c23616c205696a1cd13b
Author: amaya <[email protected]>
Date: 2016-09-28T09:26:01Z
mod SNR for corner cases
commit 80be81ecf92cd4675dcdfaa5f456d84d484d6c44
Author: amaya <[email protected]>
Date: 2016-09-28T11:01:08Z
minor fix
commit 8d9f0d4c00758324029d342eb4b892e046ca4a49
Author: amaya <[email protected]>
Date: 2016-09-29T02:02:14Z
minor fix
commit ce4a48980e33b9f16c74a62fcea6878f28b9c08b
Author: amaya <[email protected]>
Date: 2016-09-30T08:05:20Z
change method of testing for spark
commit 67ba9631af3c231b7abd145134d17237b6aca0a5
Author: myui <[email protected]>
Date: 2016-11-21T09:19:45Z
Merge branch 'feature/feature_selection' of
https://github.com/amaya382/hivemall into feature_selection
# Conflicts:
# core/src/main/java/hivemall/utils/hadoop/HiveUtils.java
# core/src/main/java/hivemall/utils/math/StatsUtils.java
#
spark/spark-1.6/src/main/scala/org/apache/spark/sql/hive/GroupedDataEx.scala
#
spark/spark-1.6/src/test/scala/org/apache/spark/sql/hive/HivemallOpsSuite.scala
#
spark/spark-2.0/src/main/scala/org/apache/spark/sql/hive/HivemallGroupedDataset.scala
#
spark/spark-2.0/src/test/scala/org/apache/spark/sql/hive/HivemallOpsSuite.scala
commit e44a413e5fd4270af53895fceec27ccff3d63a73
Author: myui <[email protected]>
Date: 2016-11-21T10:02:27Z
Updated license headers
commit 6549ef5104883a9529dfd9fc52b2b24843076fbb
Author: amaya <[email protected]>
Date: 2016-11-23T12:16:10Z
Add feature selection gitbook (#386)
----
> Review and merge pending Pull Requests before entering Incubator
> ----------------------------------------------------------------
>
> Key: HIVEMALL-22
> URL: https://issues.apache.org/jira/browse/HIVEMALL-22
> Project: Hivemall
> Issue Type: New Feature
> Reporter: Makoto Yui
> Assignee: Makoto Yui
>
> Need to review and merge pending Pull Requests in
> https://github.com/myui/hivemall/pulls
> * Feature Selection
> https://github.com/myui/hivemall/pull/385
> * SST change point detection
> https://github.com/myui/hivemall/pull/356
> * Checkstyle
> https://github.com/myui/hivemall/pull/343
> * System Test
> https://github.com/myui/hivemall/pull/336
> * Kernelized Passive Aggressive
> https://github.com/myui/hivemall/pull/304
> * Separate Optimizer
> https://github.com/myui/hivemall/pull/285
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)