Hi, Yngve, The best way to create a POSDictionary is using the API. You should create a subclass of POSDictionary and use the method addTags(String word, String... tags) to populate it. Your class should be in the package opennlp.tools.postag, because the addTags method is package-private. Use the serialize method to save it to a file.
Java Doc: http://opennlp.apache.org/documentation/1.5.2-incubating/apidocs/opennlp-tools/index.html?opennlp/tools/postag/POSDictionary.html Source Code: http://svn.apache.org/viewvc/opennlp/trunk/opennlp-tools/src/main/java/opennlp/tools/postag/POSDictionary.java?view=co Regards, William On Tue, May 15, 2012 at 7:21 AM, Yngve Ødegård <[email protected]>wrote: > I am going to create my own training data for the Part-of-speech tagger and > would like to use a Tag Dictionary file in the training. But I cannot find > any documentation on how the Tag Dictionary file format should be (except > that it is XML). > > Does anybody have an example of how the Tag Dictionary should look like? > > Thanks, > Yngve. >
