[
https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1787.
-------------------------------------
Resolution: Fixed
Great work [~thammegowda] and [~Yueheng]!
Thamme - please take your docs below and add the to the wiki page. Thanks!
{noformat}
[mattmann-0420740:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1787:
Include Stanford Name Entity Recognition in Tika contributed by Thamme Gowda N
and Yueheng He this closes #61 this closes #62"
Sending .gitignore
Sending CHANGES.txt
Sending tika-parsers/pom.xml
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.java
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityParser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNLPNERecogniser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNERecogniser.java
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNameFinder.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
Adding
tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNERecogniser.java
Adding tika-parsers/src/main/resources/org/apache/tika/parser/ner
Adding tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
Adding
tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner
Adding
tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityParserTest.java
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
Adding
tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNERecogniserTest.java
Adding tika-parsers/src/test/resources/org/apache/tika/parser
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner
Adding
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
Adding
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
Adding
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
Adding
tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding
tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-config.xml
Transmitting file data ................
Committed revision 1714835.
[mattmann-0420740:~/tmp/tika1.12] mattmann%
{noformat}
> Include Stanford Name Entity Recognition in Tika
> ------------------------------------------------
>
> Key: TIKA-1787
> URL: https://issues.apache.org/jira/browse/TIKA-1787
> Project: Tika
> Issue Type: Improvement
> Components: mime, parser
> Affects Versions: 1.12
> Environment: Java 1.8, Mac OSX 10.11
> Reporter: Yueheng He
> Assignee: Chris A. Mattmann
> Labels: features, newbie, test
> Fix For: 1.12
>
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> Using the Stanford Name Entity Recognition, Tika will be able to extract name
> entities like PERSON, ORGANIZATION, LOCATION, etc from the given text. The
> extracted name entities will be added to the metadata
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)