RE: extracting text from image using pdfbox

2012-10-15 Thread Kishore Babu
m] On Behalf Of Peter Murray-Rust Sent: Monday, 15 October, 2012 11:46 AM To: users@pdfbox.apache.org Subject: Re: extracting text from image using pdfbox On Mon, Oct 15, 2012 at 6:17 AM, Kishore Babu wrote: > Hi Peter, > Thank you very much for the reply. Unfortunately, the image I am > d

Re: extracting text from image using pdfbox

2012-10-14 Thread Peter Murray-Rust
On Mon, Oct 15, 2012 at 6:17 AM, Kishore Babu wrote: > Hi Peter, > Thank you very much for the reply. Unfortunately, the image I am dealing > are the scanned one. > > I will update my result if I succeed in using the mentioned line detection > algorithms. > > There is an excellent explanation of

RE: extracting text from image using pdfbox

2012-10-14 Thread Kishore Babu
y, 14 October, 2012 1:39 PM To: users@pdfbox.apache.org Subject: Re: extracting text from image using pdfbox Hi, Apache PDFBox can't help you here, I'm afraid. What you're after is OCR functionality (http://en.wikipedia.org/wiki/Optical_character_recognition) and PDFBox doesn't pro

RE: extracting text from image using pdfbox

2012-10-14 Thread Kishore Babu
bject: Re: extracting text from image using pdfbox On Fri, Oct 12, 2012 at 2:47 PM, Kishore Babu wrote: > Hi All, > > Is it possible to extract text from an image (JPEG) using pdfbox or is > there any open source java code for this? > > ** ** > > This is a very difficult p

Re: extracting text from image using pdfbox

2012-10-14 Thread Jeremias Maerki
Hi, Apache PDFBox can't help you here, I'm afraid. What you're after is OCR functionality (http://en.wikipedia.org/wiki/Optical_character_recognition) and PDFBox doesn't provide that. The only thing you can do is to extract the bitmap images using PDFBox and then attempt to decipher the text contai

Re: extracting text from image using pdfbox

2012-10-12 Thread Peter Murray-Rust
On Fri, Oct 12, 2012 at 2:47 PM, Kishore Babu wrote: > Hi All, > > Is it possible to extract text from an image (JPEG) using pdfbox or is > there any open source java code for this? > > ** ** > > This is a very difficult problem and to solve it completely requires a large amount of applie