Re: proofing searchable pdf files

2014-11-13 Thread Jörg-Volker Peetz
There's a newer package gimagereader — graphical GTK+ front-end to tesseract-ocr https://packages.debian.org/unstable/main/gimagereader . Can that help? -- Regards, jvp. -- To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact

Re: proofing searchable pdf files

2014-11-04 Thread Scott Ferguson
On 04/11/14 12:17, Gary Roach wrote: On 11/01/2014 06:35 PM, Scott Ferguson wrote: On 31/10/14 11:47, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the

Re: proofing searchable pdf files

2014-11-03 Thread Gary Roach
On 11/01/2014 06:35 PM, Scott Ferguson wrote: On 31/10/14 11:47, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. I'm not

Re: proofing searchable pdf files

2014-11-02 Thread Jörg-Volker Peetz
There's a open source tool named OCRmyPDF which claims to do what you're trying to do: see https://github.com/fritz-hh/OCRmyPDF As far as I understand, it makes use of standard GNU/Linux software and produces a searchable pdf file (which implies in my understanding that the text is extractable). I

Re: proofing searchable pdf files

2014-11-01 Thread Gary Roach
On 10/31/2014 04:15 PM, Doug wrote: On 10/31/2014 06:31 PM, Gary Roach wrote: On 10/30/2014 05:47 PM, Gary Roach wrote: Hi all, This is part of a medium sized, low budget archiving project that will process serveral thousand documents, all done by low tech volunteers. So I really need

Re: proofing searchable pdf files

2014-11-01 Thread Scott Ferguson
On 31/10/14 11:47, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. I'm not sure what you mean by text *overlay*... but,

Re: proofing searchable pdf files

2014-10-31 Thread Gary Roach
On 10/30/2014 05:47 PM, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. Tesseract seems to do a really great job

Re: proofing searchable pdf files

2014-10-31 Thread Doug
On 10/31/2014 06:31 PM, Gary Roach wrote: On 10/30/2014 05:47 PM, Gary Roach wrote: Hi all, This is part of a medium sized, low budget archiving project that will process serveral thousand documents, all done by low tech volunteers. So I really need methods that are straight forward or can

proofing searchable pdf files

2014-10-30 Thread Gary Roach
Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. System: Debian Wheezy Intel i5-750 processor HP Officejet Pro

Re: proofing searchable pdf files

2014-10-30 Thread Doug
On 10/30/2014 08:47 PM, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. System: Debian Wheezy Intel

Re: proofing searchable pdf files

2014-10-30 Thread Gary Dale
On 30/10/14 08:47 PM, Gary Roach wrote: Hi all, Problem: I am working on an archiving project and wish to archive documents to searchable pdf files but can't seem to figure out how to proof read and correct the text overlay. Any suggestions. System: Debian Wheezy Intel i5-750