you have very old version of tesseract. page_separator was implemented after 3.02 release
Zdenko On Sat, Mar 12, 2016 at 10:22 PM, <[email protected]> wrote: > Thanks Zdenko. I'm still stuck. I OCR'd an 81 page tiff file and I've > searched my output txt file for the form feed character (asc 12) and didn't > find one. I have windows version of tesseract 3.02. Also I don't see a > parameter for page_separator in the command-line options. Do you know what > I'm doing wrong? > > On Saturday, March 12, 2016 at 1:44:12 PM UTC-5, [email protected] wrote: >> >> If I OCR a multipage tiff file using Tesseract it comes out as one single >> page .txt file. Is there a way to maintain the page breaks? >> Thanks. >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/da280a42-a10d-40e9-a587-46bf31af51a8%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/da280a42-a10d-40e9-a587-46bf31af51a8%40googlegroups.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yr8%3Do54ApQcR8K6dFOw6kX_rf36Mc64PvegvK3zMXU9g%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

