[tesseract-ocr] Tesseract config for simple single words text and questions about learning

2018-04-28 Thread Lorenzo Blz
Hi, I'm using tesseract to recognize small fragments of text like this (actual images I'm using): Numers are fixed lenght (7 digits) and letters are always 2 chars uppercase. I'm using a whitelist (a different one depeding if the fragment is text or digits, I know this in advance). And it

Re: [tesseract-ocr] tesseract 4 beta: openCL useage

2018-04-28 Thread Janpieter Sollie
Oops, I forgot the attachment. Here it is :-) I believe it will help you further to decide, but what it CAN do: - find whitelines - map a zone to a certainn character probability - train itself. it does NOT decide whether it is a certain character or not, this needs to be decided on the host, not

Re: [tesseract-ocr] tesseract 4 beta: openCL useage

2018-04-28 Thread ShreeDevi Kumar
@zdenko This discussion maybe better suited for tesseract-dev forum or do you want to track it as a issue on github? ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Apr 28, 2018 at 1:19 PM, Janpieter Sollie

Re: [tesseract-ocr] tesseract 4 beta: openCL useage

2018-04-28 Thread Janpieter Sollie
Would it be a problem for you if I rewrite the opencl engine completely, and you people provide me help to link the tesseract kernel -> opencl engine parts? in attachment, I already have a list of features I'd like to port to openCL. As this uses the GPU in a heavy way, I will implement