We are using 3.5.x On Wednesday, November 15, 2017 at 5:09:42 PM UTC+2, Quan Nguyen wrote: > > Try the latest version, 3.4.2. > > On Wednesday, November 15, 2017 at 1:43:08 AM UTC-6, Nikolai Velkov wrote: >> >> So is there a fix for that ? >> >> On Monday, November 13, 2017 at 4:47:56 PM UTC+2, Quan Nguyen wrote: >>> >>> The GhostScript-based PDF module in Tess4J sets the limit to 999 since >>> it was thought that the users would never attempt to go beyond that since >>> loading only a few hundreds of 300-DPI full-size image pages into memory >>> would already cause out-of-memory exceptions. >>> >>> On Friday, November 10, 2017 at 6:47:31 AM UTC-6, Nikolai Velkov wrote: >>>> >>>> We're using tesseract 3.0.5 to ocr pdf files and when ocring a pdf file >>>> with 1000+ pages, tesseract goes to page 999 and then stops ocring. No >>>> error or anything (using it with java and tess4j btw). It's also not about >>>> the size since i tested it with a pdf file of 1000+ pages with only the >>>> letter 'A' on each page. The file is about 2.3 mbs. Is there any >>>> configuration that specifies a max amount of pages to ocr ? >>>> >>>
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1c45babd-75b9-46a8-ab0a-2b8014d1b0cb%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

