Re: reading text out of ps/pdf

2001-01-15 Thread Tuukka Toivonen
On Sun, 14 Jan 2001, Jan Goebel wrote: you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year ago) tested all free OCR

Re: reading text out of ps/pdf

2001-01-15 Thread Herbert Voss
Tuukka Toivonen wrote: On Sun, 14 Jan 2001, Jan Goebel wrote: you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year

Re: reading text out of ps/pdf

2001-01-15 Thread Tuukka Toivonen
On Sun, 14 Jan 2001, Jan Goebel wrote: you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year ago) tested all free OCR

Re: reading text out of ps/pdf

2001-01-15 Thread Herbert Voss
Tuukka Toivonen wrote: On Sun, 14 Jan 2001, Jan Goebel wrote: you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year

Re: reading text out of ps/pdf

2001-01-15 Thread Tuukka Toivonen
On Sun, 14 Jan 2001, Jan Goebel wrote: > you can maybe scanner/OCR software like GOCR (open source) > take a look at: > http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year ago) tested all free OCR

Re: reading text out of ps/pdf

2001-01-15 Thread Herbert Voss
Tuukka Toivonen wrote: > > On Sun, 14 Jan 2001, Jan Goebel wrote: > > > you can maybe scanner/OCR software like GOCR (open source) > > take a look at: > > http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html > > Sure. You can try it. But don't expect too much. When I last time (maybe a

Re: reading text out of ps/pdf

2001-01-14 Thread Jan Goebel
Hello, you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html good luck jan PS: @christopher: if you were sucessfull, you may give me a reply? maybe i need it sometimes, too. On Sat, 13 Jan 2001, Matej Cepl

Re: reading text out of ps/pdf

2001-01-14 Thread Matej Cepl
Christopher Jones wrote: So my question is: is there any software out there which attempts to look at bitmaps and guess what the ascii would be-- something like those programs which read books through a scanner and try to match font characters to the image. And I say this question is a reach,

Re: reading text out of ps/pdf

2001-01-14 Thread Jan Goebel
Hello, you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html good luck jan PS: @christopher: if you were sucessfull, you may give me a reply? maybe i need it sometimes, too. On Sat, 13 Jan 2001, Matej Cepl

Re: reading text out of ps/pdf

2001-01-14 Thread Matej Cepl
Christopher Jones wrote: So my question is: is there any software out there which attempts to look at bitmaps and guess what the ascii would be-- something like those programs which read books through a scanner and try to match font characters to the image. And I say this question is a reach,

Re: reading text out of ps/pdf

2001-01-14 Thread Jan Goebel
Hello, you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html good luck jan PS: @christopher: if you were sucessfull, you may give me a reply? maybe i need it sometimes, too. On Sat, 13 Jan 2001, Matej Cepl

Re: reading text out of ps/pdf

2001-01-14 Thread Matej Cepl
Christopher Jones wrote: > So my question is: is there any software out there which attempts to look at > bitmaps and guess what the ascii would be-- something like those programs which > read books through a scanner and try to match font characters to the image. And > I say this question is a

Re: reading text out of ps/pdf

2001-01-13 Thread R. E. de Lima-Lopes
yes there is a tool called ps2ascii, it extracts plain texts form *.ps files []s lima-lopes R.E. de Lima-Lopes [EMAIL PROTECTED] GNU/Linux Registered User # 182240 On Sat, 13 Jan 2001, Christopher Jones wrote: Date: Sat, 13 Jan 2001 11:34:48 -0600 From: Christopher

Re: reading text out of ps/pdf

2001-01-13 Thread Christopher Jones
I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files, since there is no ascii content to

Re: reading text out of ps/pdf

2001-01-13 Thread Herbert Voss
Christopher Jones wrote: I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files,

Re: reading text out of ps/pdf

2001-01-13 Thread Matej Cepl
Christopher Jones wrote: So my question is: is there any software out there which attempts to look at bitmaps and guess what the ascii would be-- something like those programs which read books through a scanner and try to match font characters to the image. And I say this question is a reach,

Re: reading text out of ps/pdf

2001-01-13 Thread R. E. de Lima-Lopes
yes there is a tool called ps2ascii, it extracts plain texts form *.ps files []s lima-lopes R.E. de Lima-Lopes [EMAIL PROTECTED] GNU/Linux Registered User # 182240 On Sat, 13 Jan 2001, Christopher Jones wrote: Date: Sat, 13 Jan 2001 11:34:48 -0600 From: Christopher

Re: reading text out of ps/pdf

2001-01-13 Thread Christopher Jones
I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files, since there is no ascii content to

Re: reading text out of ps/pdf

2001-01-13 Thread Herbert Voss
Christopher Jones wrote: I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files,

Re: reading text out of ps/pdf

2001-01-13 Thread Matej Cepl
Christopher Jones wrote: So my question is: is there any software out there which attempts to look at bitmaps and guess what the ascii would be-- something like those programs which read books through a scanner and try to match font characters to the image. And I say this question is a reach,

Re: reading text out of ps/pdf

2001-01-13 Thread R. E. de Lima-Lopes
yes there is a tool called ps2ascii, it extracts plain texts form *.ps files []s lima-lopes R.E. de Lima-Lopes [EMAIL PROTECTED] GNU/Linux Registered User # 182240 On Sat, 13 Jan 2001, Christopher Jones wrote: > Date: Sat, 13 Jan 2001 11:34:48 -0600 > From: Christopher

Re: reading text out of ps/pdf

2001-01-13 Thread Christopher Jones
I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files, since there is no ascii content to

Re: reading text out of ps/pdf

2001-01-13 Thread Herbert Voss
Christopher Jones wrote: > > I have that tool. But some pdf or ps files consist not of coded text but a > bitmapped image. For instance, pdf and ps files which I download from journal > databases are scanned images of journal pages. ps2ascii and pdftotext will not > extract text from these

Re: reading text out of ps/pdf

2001-01-13 Thread Matej Cepl
Christopher Jones wrote: > So my question is: is there any software out there which attempts to look at > bitmaps and guess what the ascii would be-- something like those programs which > read books through a scanner and try to match font characters to the image. And > I say this question is a