Re: Suggestions for tesseract

2022-01-21 Thread Siard
On Thu, 20 Jan 2022, Curt wrote: > On 2022-01-20, Siard wrote: > > Bob Bernstein wrote: > > > Executing 'apt-cache search tesseract' brings up a multitude of > > > packages. > > > > > > My need is simple enough, I think: I like to scan (using an

Re: Suggestions for tesseract

2022-01-20 Thread Curt
On 2022-01-20, Siard wrote: > Bob Bernstein wrote: >> Executing 'apt-cache search tesseract' brings up a multitude of >> packages. >> >> My need is simple enough, I think: I like to scan (using an >> Epson scanner) pages of printed books -- almost one hundred

Re: Suggestions for tesseract

2022-01-20 Thread Siard
Bob Bernstein wrote: > Executing 'apt-cache search tesseract' brings up a multitude of > packages. > > My need is simple enough, I think: I like to scan (using an > Epson scanner) pages of printed books -- almost one hundred per > cent text -- and then use OCR to produce page

Suggestions for tesseract

2022-01-20 Thread Bob Bernstein
Executing 'apt-cache search tesseract' brings up a multitude of packages. My need is simple enough, I think: I like to scan (using an Epson scanner) pages of printed books -- almost one hundred per cent text -- and then use OCR to produce pages from which I can copy 'n paste snippets of text

Re: xsane & tesseract

2017-08-26 Thread Siard
Joe Pfeiffer wrote: > I scanned the document to ppm files, sent them to tesseract, put the > output of tesseract into a .txt file, and cleaned up from there. You could try gimagereader, a frontend for tesseract, making this process somewhat easier. Among others, it uses a spell checker, so

Re: xsane & tesseract

2017-08-25 Thread Joe Pfeiffer
Doug <dmcgarr...@optonline.net> writes: > On 08/25/2017 08:31 PM, Stephen Grant Brown wrote: > > Hi All, > How do I setup xsane to use the tesseract OCR engine? > I see gocr under preferences->setup->ocr. > Yours Sincerely > Stephen Grant Brown. > >

Re: xsane & tesseract

2017-08-25 Thread Doug
On 08/25/2017 08:31 PM, Stephen Grant Brown wrote: Hi All, How do I setup xsane to use the tesseract OCR engine? I see gocr under preferences->setup->ocr. Yours Sincerely Stephen Grant Brown. Unless it has been vastly improved, you might as well copy the document by hand! Finding and

xsane & tesseract

2017-08-25 Thread Stephen Grant Brown
Hi All, How do I setup xsane to use the tesseract OCR engine? I see gocr under preferences->setup->ocr. Yours Sincerely Stephen Grant Brown.

fihier manquant pour tesseract

2012-01-15 Thread Bernard Schoenacker
Bonjour, j'ai une erreur avec tesseract du fait que j'ai un fichier de conf absent : Unable to load unicharset file /usr/share/tesseract-ocr/tessdata/eng.unicharset et paf le chien depuis 2007 c'est pour moi : http://code.google.com/p/tesseract-ocr

Re: fihier manquant pour tesseract

2012-01-15 Thread Jean-Damien Durand
Bonjour, Unable to load unicharset file /usr/share/tesseract-ocr/tessdata/eng.unicharset et paf le chien depuis 2007 c'est pour moi : http://code.google.com/p/tesseract-ocr/wiki/ReadMe Installer tesseract-ocr-eng ? C.f. http://bugs.debian.org/cgi-bin/bugreport.cgi?bug

Re: fihier manquant pour tesseract

2012-01-15 Thread moi-meme
Le Sun, 15 Jan 2012 12:10:02 +0100, Bernard Schoenacker a écrit : j'ai une erreur avec tesseract du fait que j'ai un fichier de conf absent : Unable to load unicharset file /usr/share/tesseract-ocr/tessdata/eng.unicharset je confirme Jean-Damien : --[moi@cdiscount

Re: fihier manquant pour tesseract

2012-01-15 Thread Bernard Schoenacker
Le 15 Jan 2012 17:29:25 GMT, moi-meme chie...@free.fr a écrit : Le Sun, 15 Jan 2012 12:10:02 +0100, Bernard Schoenacker a écrit : j'ai une erreur avec tesseract du fait que j'ai un fichier de conf absent : Unable to load unicharset file /usr/share/tesseract-ocr/tessdata

Re: fihier manquant pour tesseract

2012-01-15 Thread moi-meme
Le Sun, 15 Jan 2012 19:00:02 +0100, Bernard Schoenacker a écrit : merci pour m'avoir indiqué d'installer le paquet tesseract des locales : en le tout est rentré dans l'ordre ... il faut charger le package apt-file. ça permet de voir ce que contient un package (même non

Re: Tesseract...

2011-07-15 Thread Camaleón
and stable, then all the packages are mixed - You do not know from the dir.s architecture which package belongs to which repo in the case. Hum... not in Debian and many other distributions, look: http://ftp.de.debian.org/debian/pool/main/t/tesseract/ Individual .deb files display package architecture

Re: Tesseract...

2011-07-15 Thread Sthu Deus
Thank You for Your time and answer, Camaleón: Did you try to compile tesseract for your Debian version? You can do that on the virtual machine, just to see how it goes :-? OK, I'll give it a try. Who needs old files when new arrive? :) Well, it can be years of work that now cannot render

Re: Tesseract...

2011-07-15 Thread Camaleón
On Fri, 15 Jul 2011 22:00:18 +0700, Sthu Deus wrote: Who needs old files when new arrive? :) Well, it can be years of work that now cannot render with the new version... you will get very angry birds (oops... sorry, I mean users, angry users) if you update the package to the last version that

Re: Tesseract...

2011-07-14 Thread Sthu Deus
packages into a Debian installation. Have you considered in compiling the apckage from Tesseract site? I thought to install Ubuntu in KVM and go on in case no luck w/ tesseract 3 in Debian. Ah, I've seen that you already posted into backports mailing list. Yes, you may ask to Tesseract Debian

Re: Tesseract...

2011-07-08 Thread Camaleón
El 2011-07-07 a las 13:50 -0700, sthu deus escribió: (resending to the list) On 07/07/2011, Camaleón noela...@gmail.com wrote: On Thu, 07 Jul 2011 17:02:40 +0700, Sthu Deus wrote: Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra

Tesseract...

2011-07-07 Thread Sthu Deus
Good time of the day. Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra plenty of missing in Debian 6 language packages w/ the updated program itself. Can't it be easily backported to D6 ones it is in Ubuntu? I would like to do it myself

Re: Tesseract...

2011-07-07 Thread Hugo Vanwoerkom
Sthu Deus wrote: Good time of the day. Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra plenty of missing in Debian 6 language packages w/ the updated program itself. Can't it be easily backported to D6 ones it is in Ubuntu? I would

Re: Tesseract...

2011-07-07 Thread Camaleón
On Thu, 07 Jul 2011 17:02:40 +0700, Sthu Deus wrote: Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra plenty of missing in Debian 6 language packages w/ the updated program itself. Can't it be easily backported to D6 ones

Re: tesseract: ocr that works

2008-12-29 Thread Rainer Kluge
Hugo Vanwoerkom schrieb: Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! Hugo I use it with the gscan2pdf frontend and it works perfectly (at least for documents in german language) -- To UNSUBSCRIBE, email to debian

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 21 Dec 2008, Hugo Vanwoerkom wrote: Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! I tried it out: 1. apt-get install tesseract-ocr 2. apt-get install tesseract-ocr-eng 3. use xsane to scan a page at dpi 300 and save

Re: tesseract: ocr that works

2008-12-28 Thread andmalc
On Dec 28, 5:10 am, Anthony Campbell a...@acampbell.org.uk wrote: On 21 Dec 2008, Hugo Vanwoerkom wrote: [snip] Yes, tesseract does work well. Here, xsane gives depth 24, but conversion to depth 8 is neither possible nor necessary. Following the docs, I did There is an option at the top

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 28 Dec 2008, andmalc wrote: On Dec 28, 5:10 am, Anthony Campbell a...@acampbell.org.uk wrote: On 21 Dec 2008, Hugo Vanwoerkom wrote: [snip] Yes, tesseract does work well. Here, xsane gives depth 24, but conversion to depth 8 is neither possible nor necessary. Following the docs, I

tesseract: ocr that works

2008-12-27 Thread Hugo Vanwoerkom
Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! I tried it out: 1. apt-get install tesseract-ocr 2. apt-get install tesseract-ocr-eng 3. use xsane to scan a page at dpi 300 and save as .tif 4. run: convert foo.tif -depth 8 foo1

Re: tesseract: ocr that works

2008-12-27 Thread Dotan Cohen
. This was on Fedora, so maybe it was in fact tesseract. -- Dotan Cohen http://what-is-what.com http://gibberish.co.il א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ-ק-ר-ש-ת ا-ب-ت-ث-ج-ح-خ-د-ذ-ر-ز-س-ش-ص-ض-ط-ظ-ع-غ-ف-ق-ك-ل-م-ن-ه‍-و-ي А-Б-В-Г-Д-Е-Ё-Ж-З-И-Й-К-Л-М-Н-О-П-Р-С-Т-У-Ф-Х-Ц-Ч-Ш-Щ-Ъ-Ы-Ь-Э-Ю-Я а-б-в-г-д-е-ё-ж

Re: tesseract: ocr that works

2008-12-27 Thread Bryan Bishop
DPI made things _worse_ not better, possibly because of noise. This was on Fedora, so maybe it was in fact tesseract. Back when I first got access to the university scientific publication network, I started to get hungry for an OCR tool to do bibliographies and references, here's the result