GitHub user jaxony opened a pull request:
https://github.com/apache/incubator-hivemall/pull/163
[HIVEMALL-196] Support BM25 scoring
## What changes were proposed in this pull request?
Adding scoring function Okapi BM25 as a UDF
## What type of PR is it?
Feature
## What is the Jira issue?
https://issues.apache.org/jira/projects/HIVEMALL/issues/HIVEMALL-196
## How was this patch tested?
TODO
## How to use this feature?
TODO
## Checklist
(Please remove this section if not needed; check `x` for YES, blank for NO)
- [x] Did you apply source code formatter, i.e., `./bin/format_code.sh`,
for your commit?
- [ ] Did you run system tests on Hive (or Spark)?
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jaxony/incubator-hivemall feature/bm25
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-hivemall/pull/163.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #163
----
commit 86dc72c2035c4d322aff03bb34d86006cf2f73a2
Author: Jackson Huang <huang.j@...>
Date: 2018-09-06T07:12:58Z
Implemented getOptions()
commit 075a204315d26180ce2f7ec3bef04eb6c8f635fe
Author: Jackson Huang <huang.j@...>
Date: 2018-09-06T08:21:13Z
Okapi BM25: Implemented initialize and evaluate methods, reformatted
descriptions
commit 2e0b271fb6faf7e6700d7f57bd649e19ed98bab1
Author: Jackson Huang <huang.j@...>
Date: 2018-09-06T08:22:56Z
Remove unused imports
commit 372b1aebf022fee2bfab361ecf01d31fd8472438
Author: Jackson Huang <huang.j@...>
Date: 2018-09-06T08:29:21Z
Remove unnecessary import and deprecation warning
commit 0cd1888cacaeb77a88946e94f0ffbf61ac9108a0
Author: Jackson Huang <huang.j@...>
Date: 2018-09-06T08:44:38Z
Run update_ddls.sh, update_func_md.sh and formate_code.sh
----
---