John Carter wrote:
On Fri, 16 Jan 2009, Craig Falconer wrote:

David Lowe wrote, On 16/01/09 15:46:
Then you can never share this, because it would be redistribution of copyright material :-\

Actually I suspect even an attack lawyer may have a hard time
identifying what is copyrighted in a text file of (word,yyyy-mm-dd) pairs.

A very brief attempt (2 seconds) with gocr didn't spit out anything
readable. I suspect one actually needs to (Gasp! Schlock! Horror!)
read the man page and tweak options.

gocr gave reasonable results when converting opera subtitles with dvd::rip. I have no idea what options they used, and every DVD needed its own learning phase, where you could tell it that on THIS DVD, this bunch of pixels is 'm', but that bunch of pixels is 'rn'.

But it showed that gocr could be useful, and it might be worth pursuing.


Stephen Irons

=======================================================================
This email, including any attachments, is only for the intended
addressee.  It is subject to copyright, is confidential and may be
the subject of legal or other privilege, none of which is waived or
lost by reason of this transmission.
If the receiver is not the intended addressee, please accept our
apologies, notify us by return, delete all copies and perform no
other act on the email.
Unfortunately, we cannot warrant that the email has not been
altered or corrupted during transmission.
=======================================================================

Reply via email to