Using the 'lines' files to train ocropus (Ubuntu 9.04) fails for every file, telling me either that the cseg file contains one extra character than the transcript:
0000 0001: transcript doesn't agree with cseg (transcript 11, cseg 12) FIXME or the cseg doesn't exist [info] 0000 0005: no such cseg I've had a look at the files and the character count is correct for the cseg, not the transcript but the transcript does in fact have all the required characters in the file. In the case of missing cseg files, they are indeed not there. I used the file at http://ocropus.googlegroups.com/web/lines.tgz as per the instructions. Any help would be much appreciated, but go easy - I'm such a newbie I'm not even sure if newbie is the right term... --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
