GitHub user manalishah opened a pull request:
https://github.com/apache/tika/pull/108
Tika 1913 - MIT Information Extraction itegrated with Tika
This pull request comprises of yet another NamedEntityRecognizer that uses
the open-source trained models and functions of MIT-nlp to perform information
extraction. I have provided it as a runtime binding to MITIE.
In order to test it out, you can download
[https://github.com/manalishah/mitie-resources|mitie-resources] and follow the
steps or build it from source from [https://github.com/mit-nlp/MITIE|
mit-nlp/MITIE]
@chrismattmann
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/manalishah/tika TIKA-1913
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tika/pull/108.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #108
----
commit 5f859fbb7684a907db14959a4e49dfa644ba93d6
Author: manali <[email protected]>
Date: 2016-04-12T05:05:30Z
mitie ner parser added
commit 2210c81ac8111f45f054bee5552a3f4f020eb000
Author: manali <[email protected]>
Date: 2016-04-12T06:10:46Z
runtime binding to mitie
commit b5e246f9a71a3c1ffcdf4a82ad50fc5b03cf83ae
Author: manali <[email protected]>
Date: 2016-04-13T00:33:28Z
code cleanup
commit f39c087ea36c074acd3cca8de702b2860bee7096
Author: manali <[email protected]>
Date: 2016-04-21T19:14:05Z
removed logs
commit 80b27e6d843d221de87591d34632e546c0d23b1a
Author: manali <[email protected]>
Date: 2016-04-21T19:15:26Z
Merge remote-tracking branch 'upstream/master' into TIKA-1913
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---