On Mon, Apr 24, 2017 at 3:58 PM, Dan McDonald <[email protected]> wrote:
> Lets say I scan in a bunch of paper at 300dpi or whatever. I'd like to > turn it into ASCII. Does such an OCR beast exists that's portable to, or > already running on, illumos? Also, something that works in an LX zone is > acceptable if there's nothing native. > I used tesseract a few years ago. Current source appears to be here: https://github.com/tesseract-ocr/tesseract That was on Solaris 10, so it can't have been that hard to build. Although I'm not sure which version we were using. So as a quick test (this is on my todo list anyway) the leptonica dependency built with a simple replacement of __SOLARIS__ with __sun__ in src/sarray1.c and tesseract 3.05 itself built just fine. Having downloaded the english traineddata file it worked reasonably well. -- -Peter Tribble http://www.petertribble.co.uk/ - http://ptribble.blogspot.com/ ------------------------------------------- illumos-discuss Archives: https://www.listbox.com/member/archive/182180/=now RSS Feed: https://www.listbox.com/member/archive/rss/182180/21175430-2e6923be Modify Your Subscription: https://www.listbox.com/member/?member_id=21175430&id_secret=21175430-6a77cda4 Powered by Listbox: http://www.listbox.com
