OCRopus doesn't need line-accurate transcriptions. It's fine if you just take each page and type in its text in reading order. The training tools will figure out how to align that with the input.
In some cases you may need additional or different labeling tools: - You can use ocropus-cedit and related tools inside OCRopus - We're helping another group write a proposal for web-based correction tools (as part of a larger project). - The TextGrid project (that we're part of) is also developing tools based on Eclipse. - We're also developing some tools for using Mechanical Turk for transcriptions and verification. Tom -- You received this message because you are subscribed to the Google Groups "ocropus" group. To view this discussion on the web visit https://groups.google.com/d/msg/ocropus/-/ei_UA8KZ1T4J. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/ocropus?hl=en.
