Re: incremental training: cmodel (and maybe lmodel)

Caius Sun, 30 Aug 2009 11:58:09 -0700

> If you want to continue training an existing model, you need to use
> linerec_cpreload; this is used for book adaptation, for example.  It
> works quite well (see the publications).
>


After some steps of trial and error, I did this:

float8buffer_datafile=mydefault.f8b linerec_cpreload=default.model
ocropus trainseg mydefault.model <my receipt as book dir>

cmodel=mydefault.model ocropus lines2fsts <my receipt as book dir>

But the result is identical to the result I get using the original
default.model. I don't know if it should, but ocropus never accesses
the "mydefault.f8b" file the trainseg operation produced when tracking
with strace utility.

Prior to trainseg, I did correct the transcriptions in .gt.txt files
in the bookdir and trainseg mostly did accept the input (like 24 out
of 30 lines except for those that contained Finnish umlaut a's).

Did I miss something?

Side note: Character segmentation in this case seems almost outright
flawless, only one weird error: couple of small dots within letter E
are segmented as separate character.

If there's anything worth providing more details, let me know.

Thanks!

Caius

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en
-~----------~----~----~----~------~----~------~--~---

Re: incremental training: cmodel (and maybe lmodel)

Reply via email to