On Mon, Apr 24, 2017 at 3:58 PM, Dan McDonald <[email protected]> wrote:

> Lets say I scan in a bunch of paper at 300dpi or whatever.  I'd like to
> turn it into ASCII.  Does such an OCR beast exists that's portable to, or
> already running on, illumos?  Also, something that works in an LX zone is
> acceptable if there's nothing native.
>

I used tesseract a few years ago. Current source appears to be here:

https://github.com/tesseract-ocr/tesseract

That was on Solaris 10, so it can't have been that hard to build. Although
I'm not sure which version we were using.

So as a quick test (this is on my todo list anyway) the leptonica
dependency built with a simple replacement of __SOLARIS__ with
__sun__ in src/sarray1.c and tesseract 3.05 itself built just fine. Having
downloaded the english traineddata file it worked reasonably well.

-- 
-Peter Tribble
http://www.petertribble.co.uk/ - http://ptribble.blogspot.com/



-------------------------------------------
illumos-discuss
Archives: https://www.listbox.com/member/archive/182180/=now
RSS Feed: https://www.listbox.com/member/archive/rss/182180/21175430-2e6923be
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=21175430&id_secret=21175430-6a77cda4
Powered by Listbox: http://www.listbox.com

Reply via email to