> Why? Would it be more cost effective to just have it double/triple-keyed 
and compare the transcriptions? 

Indeed. took me 30 seconds to find it (and that was just from typing in the 
first line and deciding if it was a 1 or an l ....)

https://www.google.com/search?q=IyEvYmluL2Jhc2gKCiMgQ29uZ3Jhd

https://shogo82148.github.io/


But there may also be much more efficient ways to crack this particular nut 
than using OCR.


Such as a little searching on the web:

https://github.com/energelpen/UNIQLO_Akamai_T-Shirt_Bash

 
Thanks for the reply. Yes, I did find these existing transcriptions. I was 
more interested in understanding how to get this to work with an OCR 
pipeline.
 

> You don't say what pre-processing you did. Did you remove all the orange? 
Anything else?

I provide the command line I used and the image to reproduce the issue. As 
shown there, no pre-processing yet.


> That's a basic shell quoting issue that the documentation for your shell 
should cover.

Thanks. I thought so, I was just flagging that I knew this wasn't yet 
covered in te command line I provided.


Are there better settings to use for such a use case?


> Maybe? But there may also be much more efficient ways to crack this 
particular nut than using OCR.

That's what I am interested in. If not OCR or human transcription, what 
would you suggest?


Best wishes,
Tom


 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion visit 
https://groups.google.com/d/msgid/tesseract-ocr/f50f4cfb-5a55-4d74-8a7b-c7570fe52b9an%40googlegroups.com.

Reply via email to