Better late than never, but found this tool that will do what you want.

You just need to rename your hocr or html file (depending on version of 
tesseract) to xml.

On Sunday, October 6, 2013 at 3:26:58 PM UTC-4, matthew christy wrote:
> Does anyone know about a tool that already exists that allows you to see 
> all the bounding boxes identified in Tesseract's hOCR output on one page?
> Thanks,
> Matt

You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
To post to this group, send email to
Visit this group at
To view this discussion on the web visit
For more options, visit

Reply via email to