tesseract -c include_page_breaks=1 -c page_separator="[PAGE SEPRATOR]" input.tiff output
https://groups.google.com/forum/#!topic/tesseract-dev/VsgJ9R-cTQ0 On Saturday, March 12, 2016 at 12:44:12 PM UTC-6, [email protected] wrote: > > If I OCR a multipage tiff file using Tesseract it comes out as one single > page .txt file. Is there a way to maintain the page breaks? > Thanks. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/253f7913-fe16-4a06-8e0c-81b74a6ff788%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

