> > I'm trying to overlay hOCR output on top a source image that was used to > produce the OCR output. > > It works pretty well when a bounding box coordinates are included into the > <span> element. But some <span>s shows up inside <h3> elements. Those > elements don't include the bonding box information.
Please submit this as a bug report. > > > Is there a way output the bounding box of those lines that are inside <h3>? > > A related question - what triggers the output of <h3> element? Where this > logic is defined? Detection of H3, P, and similar layout elements is done in Lua in order to make it easy to modify (experience shows that that's what people want to modify a lot in OCR systems). The logic is defined in ocropus/ocroscript/lib/headings.lua (you can find this sort of thing easily with "grep -ri h3 scripts") Tom --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
