I think Quan is referring to tess4j version -

see https://sourceforge.net/projects/tess4j/files/tess4j/3.4.2/

Version 3.4.2 (14 November 2017) - Update Lept4J to 1.6.2 - Update
GhostScript to 9.22 - Improve handling of PDF files in multi-threaded
environment - Lift limits on number of pages in PDF - Use TESSDATA_PREFIX
environment variable by default, if defined


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Nov 16, 2017 at 3:11 PM, Nikolai Velkov <[email protected]> wrote:

> We are using 3.5.x
>
> On Wednesday, November 15, 2017 at 5:09:42 PM UTC+2, Quan Nguyen wrote:
>>
>> Try the latest version, 3.4.2.
>>
>> On Wednesday, November 15, 2017 at 1:43:08 AM UTC-6, Nikolai Velkov wrote:
>>>
>>> So is there a fix for that ?
>>>
>>> On Monday, November 13, 2017 at 4:47:56 PM UTC+2, Quan Nguyen wrote:
>>>>
>>>> The GhostScript-based PDF module in Tess4J sets the limit to 999 since
>>>> it was thought that the users would never attempt to go beyond that since
>>>> loading only a few hundreds of 300-DPI full-size image pages into memory
>>>> would already cause out-of-memory exceptions.
>>>>
>>>> On Friday, November 10, 2017 at 6:47:31 AM UTC-6, Nikolai Velkov wrote:
>>>>>
>>>>> We're using tesseract 3.0.5 to ocr pdf files and when ocring a pdf
>>>>> file with 1000+ pages, tesseract goes to page 999 and then stops ocring. 
>>>>> No
>>>>> error or anything (using it with java and tess4j btw). It's also not about
>>>>> the size since i tested it with a pdf file of 1000+ pages with only the
>>>>> letter 'A' on each page. The file is about 2.3 mbs. Is there any
>>>>> configuration that specifies a max amount of pages to ocr ?
>>>>>
>>>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/ms
> gid/tesseract-ocr/1c45babd-75b9-46a8-ab0a-2b8014d1b0cb%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/1c45babd-75b9-46a8-ab0a-2b8014d1b0cb%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduW_VTzoe06mOtqHiGF2%2Boj6aE60g%2B4LiJVmGr75FPT6tw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to