Re: [tesseract-ocr] Re: Best settings to OCR an image of some cyphered text (base64)

Tom Vercauteren Mon, 14 Jul 2025 00:09:34 -0700

> Why? Would it be more cost effective to just have it double/triple-keyed 
and compare the transcriptions?

Indeed. took me 30 seconds to find it (and that was just from typing in the
first line and deciding if it was a 1 or an l ....)

https://www.google.com/search?q=IyEvYmluL2Jhc2gKCiMgQ29uZ3Jhd

https://shogo82148.github.io/

But there may also be much more efficient ways to crack this particular nut
than using OCR.

Such as a little searching on the web:

https://github.com/energelpen/UNIQLO_Akamai_T-Shirt_Bash

Thanks for the reply. Yes, I did find these existing transcriptions. I was
more interested in understanding how to get this to work with an OCR
pipeline.

> You don't say what pre-processing you did. Did you remove all the orange?
Anything else?

I provide the command line I used and the image to reproduce the issue. As
shown there, no pre-processing yet.

> That's a basic shell quoting issue that the documentation for your shell
should cover.

Thanks. I thought so, I was just flagging that I knew this wasn't yet
covered in te command line I provided.

Are there better settings to use for such a use case?

> Maybe? But there may also be much more efficient ways to crack this
particular nut than using OCR.

That's what I am interested in. If not OCR or human transcription, what
would you suggest?

Best wishes,
Tom

--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion visit
https://groups.google.com/d/msgid/tesseract-ocr/f50f4cfb-5a55-4d74-8a7b-c7570fe52b9an%40googlegroups.com.

Re: [tesseract-ocr] Re: Best settings to OCR an image of some cyphered text (base64)

Reply via email to