I have a few thousand scanned pages of text that are saved as TIFF images on my computer. I have also bundled these images in PDF documents.
I need to number each line of text in these images, but the formatting must be preserved. It's very important that the formatting is preserved as these are legal documents. Is there any way that I can use Tesseract to detect each line of text in my images and number each line? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/18fffa03-a9e4-4a6d-9021-46f61f3f49e6n%40googlegroups.com.

