Damiano Porta created OPENNLP-1092:
--------------------------------------
Summary: PosTagger serialization in namefinder model
Key: OPENNLP-1092
URL: https://issues.apache.org/jira/browse/OPENNLP-1092
Project: OpenNLP
Issue Type: Bug
Components: Name Finder
Affects Versions: 1.8.0, 1.8.1
Environment: Ubuntu 16.04 - Intel Core i7 6700k - Openjdk version
1.8.0_131
Reporter: Damiano Porta
I am getting an error during the serialization of the post tagger inside a name
finder model.
The error is: *java.lang.IllegalStateException: Missing serializer for
postagger.bin*
I am having this problem via API and via cmd NameFinderTrainer tool.
The command is:
*opennlp TokenNameFinderTrainer -data /home/damiano/corpus.train -lang it
-model /home/damiano/model.bin -featuregen /home/damiano/test.xml
-sequenceCodec BIO -resources
/home/damiano/lavoro/java/Parser/src/main/resources/*
{code}
The output is:
Writing name finder model ... Compressed 885605 parameters to 94030
3451 outcome patterns
Exception in thread "main" java.lang.IllegalStateException: Missing serializer
for postagger.bin
at opennlp.tools.util.model.BaseModel.serialize(BaseModel.java:592)
at opennlp.tools.cmdline.CmdLineUtil.writeModel(CmdLineUtil.java:182)
at
opennlp.tools.cmdline.namefind.TokenNameFinderTrainerTool.run(TokenNameFinderTrainerTool.java:188)
at opennlp.tools.cmdline.CLI.main(CLI.java:244)
{code}
My generators.xml is:
{code:xml}
<?xml version="1.0" encoding="UTF-8"?>
<generators>
<cache>
<generators>
<window prevLength="4" nextLength="2">
<tokenclass />
</window>
<window prevLength="4" nextLength="2">
<token />
</window>
<!-- Pos Tagger -->
<window prevLength="4" nextLength="2">
<tokenpos model="postagger.bin" />
</window>
<definition />
<prevmap />
<bigram />
<sentence begin="true" end="false" />
</generators>
</cache>
</generators>
{code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)