Hey Folks, With the commit of TIKA-1787/GH-61 in trunk we now have full integration of Named Entity Recognition with Stanford NER/NLP and Apache OpenNLP. Will also look to see if we can integrate NLTK too. This is a *big deal* since NER is something we’ve always wanted to pull into Tika.
Woot! Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
