> If you want to continue training an existing model, you need to use > linerec_cpreload; this is used for book adaptation, for example. It > works quite well (see the publications). >
After some steps of trial and error, I did this: float8buffer_datafile=mydefault.f8b linerec_cpreload=default.model ocropus trainseg mydefault.model <my receipt as book dir> cmodel=mydefault.model ocropus lines2fsts <my receipt as book dir> But the result is identical to the result I get using the original default.model. I don't know if it should, but ocropus never accesses the "mydefault.f8b" file the trainseg operation produced when tracking with strace utility. Prior to trainseg, I did correct the transcriptions in .gt.txt files in the bookdir and trainseg mostly did accept the input (like 24 out of 30 lines except for those that contained Finnish umlaut a's). Did I miss something? Side note: Character segmentation in this case seems almost outright flawless, only one weird error: couple of small dots within letter E are segmented as separate character. If there's anything worth providing more details, let me know. Thanks! Caius --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
