My Lucene OpenNLP patch (LUCENE-2899) contains small test data sets just
to create unit tests. The patch runs MaxEnt on this test data and then
uses the .bin files to run simple unit tests. These datasets are
completely bogus, they only exist to demonstrate a complete round trip.
The chunker output changed slightly from 1.5.2-incubation to 1.5.3. Was
this expected? Was there some change in MaxEnt that caused generated
models to change? If there was, that's fine, as long as someone expected
this. But it does mean that the old models on Sourceforge may be
slightly wrong.
Lance
- Chunker behavior changed in 1.5.3? Lance Norskog
-