Re: OCR questions

2007-07-24 Thread Jörg-Volker Peetz
Rodolfo Medina wrote: Rodolfo Medina wrote: I tried gocr and the result was quite miserable. Then I tried with MS Windows and it was almost perfect. Somewhere in the web I read that OCR software under Linux is very poor at the moment and that it's better to use MS Windows for that:

Re: OCR questions

2007-07-22 Thread Rodolfo Medina
Rodolfo Medina wrote: I tried gocr and the result was quite miserable. Then I tried with MS Windows and it was almost perfect. Somewhere in the web I read that OCR software under Linux is very poor at the moment and that it's better to use MS Windows for that: unfortunately my test seems

OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Rodolfo Medina
Rodolfo Medina [EMAIL PROTECTED] wrote: Excuse the basic question: I wish to scan a printed text so to have it in an editable text file. How can I do that with `sane' and `scanimage'? On Fri, Jun 08, 2007 at 08:57:03AM -0400, Celejar wrote: Scanners scan to image formats. To get

Re: OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Bob Proulx
Rodolfo Medina wrote: Somewhere in the web I read that OCR software under Linux is very poor at the moment and that it's better to use MS Windows for that: unfortunately my test seems to confirm that. What do you Debian listers think? I think you should check out these articles.

Re: OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Andrew Sackville-West
On Sat, Jul 21, 2007 at 08:10:27PM +0200, Bob Proulx wrote: Rodolfo Medina wrote: Somewhere in the web I read that OCR software under Linux is very poor at the moment and that it's better to use MS Windows for that: unfortunately my test seems to confirm that. What do you Debian listers

Re: OCR questions

2007-07-21 Thread Rodolfo Medina
Rodolfo Medina wrote: I tried gocr and the result was quite miserable. Then I tried with MS Windows and it was almost perfect. Somewhere in the web I read that OCR software under Linux is very poor at the moment and that it's better to use MS Windows for that: unfortunately my test seems

Re: OCR questions

2007-07-21 Thread Florian Kulzer
On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: [...] I installed tesseract with configure, make, make install, then tried to run it but got the following error message: Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset . In the README file there is:

Re: OCR questions

2007-07-21 Thread Osamu Aoki
On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: Why not use the Debian package? It is called tesseract-ocr. Yes. But it is old 1.02 version and has FTBFS bug. If anyone here is interesed to help maintain update with

Re: OCR questions

2007-07-21 Thread Nelson Castillo
On 7/21/07, Osamu Aoki [EMAIL PROTECTED] wrote: On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: Why not use the Debian package? It is called tesseract-ocr. Yes. But it is old 1.02 version and has FTBFS bug. Yes,

Re: OCR questions

2007-07-21 Thread Wayne Topa
Nelson Castillo([EMAIL PROTECTED]) is reported to have said: On 7/21/07, Osamu Aoki [EMAIL PROTECTED] wrote: On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: Why not use the Debian package? It is called

Re: OCR questions

2007-07-21 Thread Nelson Castillo
On 7/21/07, Wayne Topa [EMAIL PROTECTED] wrote: Nelson Castillo([EMAIL PROTECTED]) is reported to have said: On 7/21/07, Osamu Aoki [EMAIL PROTECTED] wrote: On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: Why not

Re: OCR questions

2007-07-21 Thread Nelson Castillo
If you install as stated above with aptitude, tesseract-ocr-data is automatically installed unless you change default behavior of aptitude. FTBFS is just package issue. This package should work. Otherwise, please file bug report. Osamu, thanks a lot. The package works well. Sorry -- if I was

Re: OCR questions

2007-07-21 Thread Osamu Aoki
On Sat, Jul 21, 2007 at 07:54:40PM -0500, Nelson Castillo wrote: On 7/21/07, Wayne Topa [EMAIL PROTECTED] wrote: Nelson Castillo([EMAIL PROTECTED]) is reported to have said: On 7/21/07, Osamu Aoki [EMAIL PROTECTED] wrote: On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: On