I am able to process smaller files say up to 160 kb

When I hit 220 to 250 kb, I am having issues and getting the following 
error:

Exception in thread "main" java.lang.Error: Invalid memory access
    at com.sun.jna.Native.invokePointer(Native Method)
    at com.sun.jna.Function.invokePointer(Function.java:470)
    at com.sun.jna.Function.invoke(Function.java:404)
    at com.sun.jna.Function.invoke(Function.java:315)
    at com.sun.jna.Library$Handler.invoke(Library.java:212)
    at com.sun.proxy.$Proxy0.TessBaseAPIGetUTF8Text(Unknown Source)
    at net.sourceforge.tess4j.Tesseract.getOCRText(Unknown Source)
    at net.sourceforge.tess4j.Tesseract.doOCR(Unknown Source)
    at net.sourceforge.tess4j.Tesseract.doOCR(Unknown Source)
    at com.nationwide.robot.MinP98TextReader.main(MinP98TextReader.java:34)
split_pt >0 && split_pt < word->chopped_word->NumBlobs():Error:Assert 
failed:in file ..\..\ccmain\tfacepp.cpp, line 186

Using the following code

Tesseract instance = new Tesseract();
             String result = instance.doOCR(outputFile );

Now if I set a rectangle in the doOCR I am able to process that rectangle 
but not the complete image of course limited to the rectangle.  I stepped 
the rectangle around the file to determine if there is an issue and cannot 
find a problem with the processing of the image, just a problem with 
processing the entire image.

so if I process

new Rectangle(0, 0,1200, 1000)  It works

If I exceed to new Rectangle(0, 0,1400, 1000), it fails with the above 
error.


Is there some buffer size I need to adjust to allow for processing the 
larger image or higher text  content images?

Or is there a strategy I can employ other than making lots of rectangles :)


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/33287949-3fcb-404a-ba18-f33f6141376a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to