[ 
https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann resolved TIKA-1787.
-------------------------------------
    Resolution: Fixed

Great work [~thammegowda] and [~Yueheng]!

Thamme - please take your docs below and add the to the wiki page. Thanks!

{noformat}
[mattmann-0420740:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1787: 
Include Stanford Name Entity Recognition in Tika contributed by Thamme Gowda N 
and Yueheng He this closes #61 this closes #62"
Sending        .gitignore
Sending        CHANGES.txt
Sending        tika-parsers/pom.xml
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.java
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityParser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNLPNERecogniser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNERecogniser.java
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNameFinder.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
Adding         
tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNERecogniser.java
Adding         tika-parsers/src/main/resources/org/apache/tika/parser/ner
Adding         tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
Adding         
tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner
Adding         
tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityParserTest.java
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
Adding         
tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNERecogniserTest.java
Adding         tika-parsers/src/test/resources/org/apache/tika/parser
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner
Adding         
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
Adding         
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
Adding         
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
Adding         
tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding         
tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-config.xml
Transmitting file data ................
Committed revision 1714835.
[mattmann-0420740:~/tmp/tika1.12] mattmann% 
{noformat}

> Include Stanford Name Entity Recognition in Tika
> ------------------------------------------------
>
>                 Key: TIKA-1787
>                 URL: https://issues.apache.org/jira/browse/TIKA-1787
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime, parser
>    Affects Versions: 1.12
>         Environment: Java 1.8, Mac OSX 10.11
>            Reporter: Yueheng He
>            Assignee: Chris A. Mattmann
>              Labels: features, newbie, test
>             Fix For: 1.12
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Using the Stanford Name Entity Recognition, Tika will be able to extract name 
> entities like PERSON, ORGANIZATION, LOCATION, etc from the given text. The 
> extracted name entities will be added to the metadata



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to