Better late than never, but found this tool that will do what you want.
You just need to rename your hocr or html file (depending on version of
tesseract) to xml.
On Sunday, October 6, 2013 at 3:26:58 PM UTC-4, matthew christy wrote:
> Does anyone know about a tool that already exists that allows you to see
> all the bounding boxes identified in Tesseract's hOCR output on one page?
You received this message because you are subscribed to the Google Groups
To unsubscribe from this group and stop receiving emails from it, send an email
To post to this group, send email to email@example.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
For more options, visit https://groups.google.com/d/optout.