[
https://issues.apache.org/jira/browse/OPENNLP-88?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13987615#comment-13987615
]
Joern Kottmann commented on OPENNLP-88:
---------------------------------------
We recently received a language model contribution which is placed inside the
OpenNLP sandbox. It would be nice to have a bloom filter implementation too. I
suggest you come over to the dev list and start a discussion about it. We can
then figure out with everybody involved what could/should be done.
The language model code itself will probably be moved over to opennlp-tools
after the 1.6.0 release.
> use bloom filters where appropriate (e.g. language models)
> ----------------------------------------------------------
>
> Key: OPENNLP-88
> URL: https://issues.apache.org/jira/browse/OPENNLP-88
> Project: OpenNLP
> Issue Type: Improvement
> Reporter: Jason Baldridge
> Priority: Minor
>
> Bloom filters can be used to drastically reduce the memory requirements for
> certain kinds of models. For example, see the following papers on using bloom
> filters to create highly compact language models:
> http://homepages.inf.ed.ac.uk/miles/papers/emnlp07.pdf
> http://homepages.inf.ed.ac.uk/miles/papers/acl07.pdf
> It would be good to find/create an implementation of bloom filters for such
> uses in OpenNLP.
--
This message was sent by Atlassian JIRA
(v6.2#6252)