[tesseract-ocr] Re: Tesseract max pages while ocring?

Quan Nguyen Wed, 15 Nov 2017 07:10:08 -0800

Try the latest version, 3.4.2.

On Wednesday, November 15, 2017 at 1:43:08 AM UTC-6, Nikolai Velkov wrote:
>
> So is there a fix for that ?
>
> On Monday, November 13, 2017 at 4:47:56 PM UTC+2, Quan Nguyen wrote:
>>
>> The GhostScript-based PDF module in Tess4J sets the limit to 999 since it 
>> was thought that the users would never attempt to go beyond that since 
>> loading only a few hundreds of 300-DPI full-size image pages into memory 
>> would already cause out-of-memory exceptions.
>>
>> On Friday, November 10, 2017 at 6:47:31 AM UTC-6, Nikolai Velkov wrote:
>>>
>>> We're using tesseract 3.0.5 to ocr pdf files and when ocring a pdf file 
>>> with 1000+ pages, tesseract goes to page 999 and then stops ocring. No 
>>> error or anything (using it with java and tess4j btw). It's also not about 
>>> the size since i tested it with a pdf file of 1000+ pages with only the 
>>> letter 'A' on each page. The file is about 2.3 mbs. Is there any 
>>> configuration that specifies a max amount of pages to ocr ?
>>>
>>


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/2af8c3cd-ca59-499b-a31c-84e7d513a9fc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: Tesseract max pages while ocring?

Reply via email to