Re: OpenNLP Annotations Proposal

Jörn Kottmann Wed, 22 Jun 2011 04:14:30 -0700

On 6/22/11 10:45 AM, Olivier Grisel wrote:

I will (soon?) include a couple of new scripts in pignlproc to extract
occurrence contexts of any kind of entities occurring as wikilinks in
Wikipedia dumps to load those in a Solr index. I will let you know
when that happens.


We definitely need some code to parse the wikipedia articles.
How do you transform the wiki text to plain text in pignlproc?

Could we take a similar approach for the annotation project, or maybe
even share the code which does it?

Jörn

Re: OpenNLP Annotations Proposal

Reply via email to