[opensuse] OCR and batches

2007-03-18 Thread Anders Norrbring
Does anybody know of a way to scan several thousands pictures on disk with an OCR application to look for a specific text, and then list the images where that text was found? -- Anders Norrbring Norrbring Consulting -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:

Re: [opensuse] OCR and batches

2007-03-18 Thread Kai Ponte
On Sunday 18 March 2007 02:57:49 am Anders Norrbring wrote: Does anybody know of a way to scan several thousands pictures on disk with an OCR application to look for a specific text, and then list the images where that text was found? I've been doing apps like that for the better part of

Re: [opensuse] OCR and batches

2007-03-18 Thread Adam Tauno Williams
Does anybody know of a way to scan several thousands pictures on disk with an OCR application to look for a specific text, and then list the images where that text was found? I've been doing apps like that for the better part of twelve years. However, I've yet to see an OCR app in Linux.

Re: [opensuse] OCR and batches

2007-03-18 Thread Anders Norrbring
Kai Ponte skrev: On Sunday 18 March 2007 02:57:49 am Anders Norrbring wrote: Does anybody know of a way to scan several thousands pictures on disk with an OCR application to look for a specific text, and then list the images where that text was found? I've been doing apps like that for the

Re: [opensuse] OCR and batches

2007-03-18 Thread Anders Norrbring
Adam Tauno Williams skrev: Does anybody know of a way to scan several thousands pictures on disk with an OCR application to look for a specific text, and then list the images where that text was found? I've been doing apps like that for the better part of twelve years. However, I've yet to see

Re: [opensuse] OCR and batches

2007-03-18 Thread [EMAIL PROTECTED]
I've been investigating the dejavu (djvu) format to find out if it could do what you mentioned. I asked the guys at Lizardtech (www.lizardtech.com) if they had some off-the-shelf product. They don't but they could do some custom job. Djvu has what is called a hidden text layer. Since it is

Re: [opensuse] OCR and batches

2007-03-18 Thread Carlos E. R.
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 The Sunday 2007-03-18 at 17:55 -, [EMAIL PROTECTED] wrote: also xml markup. Linux has text-based djvu utilities. I think the tools to manipulate the hidden text layer are all Windows-based. Not so: djvused(1) A

Re: [opensuse] OCR and batches

2007-03-18 Thread Anders Norrbring
Carlos E. R. skrev: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 The Sunday 2007-03-18 at 17:55 -, [EMAIL PROTECTED] wrote: also xml markup. Linux has text-based djvu utilities. I think the tools to manipulate the hidden text layer are all Windows-based. Not so: djvused(1)

Re: [opensuse] OCR and batches

2007-03-18 Thread Kai Ponte
On Sunday 18 March 2007 08:07:40 am Anders Norrbring wrote: Thanks, I'm not locked to Linux for this adventure, but the images are stored on a Linux system but can be accessed from Windows. Maybe I should explain more what I'm about to do, it's not really a full text scan.. The images are

Re: [opensuse] OCR and batches

2007-03-18 Thread David Brodbeck
Adam Tauno Williams wrote: I've only ever used gOCR [http://jocr.sourceforge.net/] it worked well enough to develop keyword indexes for images. It does not work well enough (and I've never seen an OCR package that does, commercial or otherwise) to turn images into readable text. The only