Hi, I managed to install and perform some tests on Fedora 9 with press articles. I followed the instructions from the wiki to build a training data set to get better results. ocropus extracted about one hundred png corresponding to lines and I've populated a transcriptions text file as described : transcriptions lines look like this p0001_l0105.png Mais il se débrouille toujours pour
But I can't figure out how to feed ocropus with the data ! Do I have to make a script using transcription file as input to generate all the txt files according to png or can I launch an ocropus script to digest all this ? How to tell ocropus to learn ? Thanks --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
