Thanks William!

I have now installed Eclipse on my computer and I am trying to do the
training using the API.

I am still not entirely sure how to code it so I would really appreciate it
if somebody has a code example of how they trained the tagger using a Tag
Dictionary.

Regards,
Yngve.

On Wed, May 16, 2012 at 12:00 AM, William Colen <[email protected]>wrote:

> Hi, Yngve,
>
> The best way to create a POSDictionary is using the API. You should create
> a subclass of POSDictionary and use the method addTags(String word,
> String... tags) to populate it.
> Your class should be in the package opennlp.tools.postag, because the
> addTags method is package-private. Use the serialize method to save it to a
> file.
>
> Java Doc:
>
> http://opennlp.apache.org/documentation/1.5.2-incubating/apidocs/opennlp-tools/index.html?opennlp/tools/postag/POSDictionary.html
> Source Code:
>
> http://svn.apache.org/viewvc/opennlp/trunk/opennlp-tools/src/main/java/opennlp/tools/postag/POSDictionary.java?view=co
>
> Regards,
> William
>
>
> On Tue, May 15, 2012 at 7:21 AM, Yngve Ødegård <[email protected]
> >wrote:
>
> > I am going to create my own training data for the Part-of-speech tagger
> and
> > would like to use a Tag Dictionary file in the training. But I cannot
> find
> > any documentation on how the Tag Dictionary file format should be (except
> > that it is XML).
> >
> > Does anybody have an example of how the Tag Dictionary should look like?
> >
> > Thanks,
> > Yngve.
> >
>

Reply via email to