Hi, Yngve,

The best way to create a POSDictionary is using the API. You should create
a subclass of POSDictionary and use the method addTags(String word,
String... tags) to populate it.
Your class should be in the package opennlp.tools.postag, because the
addTags method is package-private. Use the serialize method to save it to a
file.

Java Doc:
http://opennlp.apache.org/documentation/1.5.2-incubating/apidocs/opennlp-tools/index.html?opennlp/tools/postag/POSDictionary.html
Source Code:
http://svn.apache.org/viewvc/opennlp/trunk/opennlp-tools/src/main/java/opennlp/tools/postag/POSDictionary.java?view=co

Regards,
William


On Tue, May 15, 2012 at 7:21 AM, Yngve Ødegård <[email protected]>wrote:

> I am going to create my own training data for the Part-of-speech tagger and
> would like to use a Tag Dictionary file in the training. But I cannot find
> any documentation on how the Tag Dictionary file format should be (except
> that it is XML).
>
> Does anybody have an example of how the Tag Dictionary should look like?
>
> Thanks,
> Yngve.
>

Reply via email to