Opennlp is a great java package with working named entity recognition.  I've 
had a lot of success with it.  That said, the Stanford nlp stuff is always 
highly acclaimed, though I had trouble figuring it all out the last time I 
tried it.


On Jul 2, 2010, at 11:12 AM, Claudio Martella <[email protected]> 
wrote:

> I can advice you GATE and ANNIE. Gate is a framework for Text-mining.
> ANNIE is a pipeline of Gate's components for the extraction of Named
> Entities like names of people, locations, companies etc. You can use
> Gate/Annie programmatically throught their Java API.
> 
> http://gate.ac.uk/
> 
> 
> Alex McLintock wrote:
>> I'm quite interested in OpenCalais - a Reuters/Thompson initiative. It
>> is a web service to take your free text and identify important terms
>> in it like people, businesses, places, and so on. If you are the
>> document owner you can submit your document to their web site and get
>> back important tags saying what this document is about. I'd like to
>> tag this sort of data and feed it into a Lucene style index so that it
>> can be used in searches AND in focussed/topical crawls.
>> 
>> Now, here comes the problem. When we crawl the web we don't own the
>> documents we are crawling so we don't really have permission to use
>> Reuters' servers to do this analysis. (Maybe we could cut a deal
>> though if we were a big enough company).
>> 
>> So has anyone else looked at alternatives to OpenCalais which takes
>> free text and tries to understand what it is about? I've been looking
>> for software to do this but nothing seems suitable.
>> 
>> Alex
>> 
>> 
> 
> 
> -- 
> Claudio Martella
> Digital Technologies
> Unit Research & Development - Analyst
> 
> TIS innovation park
> Via Siemens 19 | Siemensstr. 19
> 39100 Bolzano | 39100 Bozen
> Tel. +39 0471 068 123
> Fax  +39 0471 068 129
> [email protected] http://www.tis.bz.it
> 
> Short information regarding use of personal data. According to Section 13 of 
> Italian Legislative Decree no. 196 of 30 June 2003, we inform you that we 
> process your personal data in order to fulfil contractual and fiscal 
> obligations and also to send you information regarding our services and 
> events. Your personal data are processed with and without electronic means 
> and by respecting data subjects' rights, fundamental freedoms and dignity, 
> particularly with regard to confidentiality, personal identity and the right 
> to personal data protection. At any time and without formalities you can 
> write an e-mail to [email protected] in order to object the processing of 
> your personal data for the purpose of sending advertising materials and also 
> to exercise the right to access personal data and other rights referred to in 
> Section 7 of Decree 196/2003. The data controller is TIS Techno Innovation 
> Alto Adige, Siemens Street n. 19, Bolzano. You can find the complete 
> information on the web site www.tis.bz.it.
> 
> 

Reply via email to