Im trying to make a setup where i can give a list of entities (that hold 
all the necessary information to do OCR on .tiff files). For this i use 
Spring and i use Threadpoolexecutor to execute my tasks in parallel.

Enviroment: win10, Java, Spring Framework
Executor: FixedThreadpool
tess4j version: 4.3.1

Error messages: (there are multiple ones because it gives sometimes 
different error or just works, so here are my findings)

splitter_.orig_pix():Error:Assert failed:in file ..\..\src\ccmain\
tesseractclass.cpp, line 674
This is the most common one, I can replicate this


!w_it.cycled_list():Error:Assert failed:in file ..\..\src\ccstruct\pageres.
cpp, line 1351
 I couldnt replicate this


HIGHlol1 
LOWlol 
Page 1 
Page 1 
Detected 224 diacritics 
Didn't fail 
OCR is done let's move! 
tmp\lol1.pdf -> C:\Users\kh\Desktop\workstuff\samples\test_out\lol1.pdf: The 
process cannot access the file because it is being used by another process. 
C:\Users\kh\Desktop\workstuff\samples\test_out\lol1.pdf [Fatal Error] :1:167
: The markup in the document following the root element must be well-formed. 
C:\Users\kh\Desktop\workstuff\samples\test_out\lol1.pdf
Here is the context for this output. HIGHlol1 means high priority and the 
file is named lol1(.tiff). Page 1 and Detected 224 diacritics are standard 
tesseract outputs as far as i know. Didn't fail means it did not throw any 
tesseract exceptions (never got one btw.). OCR is done lets move! means i 
managed to tell database that we finished OCR on file. After that, the 
program fails to move pdf file from tmp folder, which is the intended 
folder for creating pdf via tess4j. After this I dont know what the error 
means, but it closes the application, meaning it wont even try to do ocr on 
the second .tiff file called lol.

To sum it up these are the errors i get when i try to execute tasks 
(specific tasks that execute tess4j's createDocuments) parallel.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/aa9ad58c-fcf3-44be-a390-e8a8817c9bee%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to