You can buy MUC 6 and 7 data from LDC. They
cost a few hundred dollars.
There is parsing support for them built into OpenNLP.
We cannot share the data here because that of course would
violate the copyright.
Anyway OntoNotes might be better suited for your needs and
only costs 30 or 50 USD.
Jörn
On 05/22/2012 09:37 PM, Lance Norskog wrote:
Where are the source files?
On Tue, May 22, 2012 at 12:17 AM, Jörn Kottmann<[email protected]> wrote:
On 05/22/2012 07:50 AM, Lance Norskog wrote:
What are the sources of the training data for the models on sourceforge?
In particular, for the English language NER models?
That is trained on hand corrected and extended MUC 6/7 training data.
Jörn