Pooja, Would you create an issue at issues.apache.org for this and attach an example file?
Thanks. Daniel On Wed, May 26, 2010 at 12:03 AM, Pooja4 G <[email protected]> wrote: > I tried to use the pdfbox1.1.0 but with this pdf generation failed while > we are checking for Encryption of documents. > Do anyone have any idea while more API we can use other than PDFbox for > creation of PDFdiff file in DMS. > We are uploading documents from Adobe professional 9.0 and while we create > the new revision of the documents, it will fail at creation of PDF diff > file. It returns null as below > using the class PDFTextStripper.class method > getText(). > > String PDF_text = new String(); > PDFTextStripper stripper = new PDFTextStripper(); > > PDF_text = stripper.getText(document); > > So please help me in solving this. > > Thanks & Regards, > Pooja Gupta > Tata Consultancy Services > Mailto: [email protected] > Website: http://www.tcs.com > ____________________________________________ > Experience certainty. IT Services > Business Solutions > Outsourcing > ____________________________________________ > > > > From: > Andreas Lehmkuehler <[email protected]> > To: > [email protected] > Date: > 05/20/2010 09:21 PM > Subject: > Re: Extract Text from PDF > > > > Hi, > > Thomas Fischer schrieb: > > Hello Pooja, > > > > I don't have any Adobe 9.0 documents, but I know that in my tests the > newer versions of PDFBox perform significantly better than version 7.3. > > I would suggest you try the fairly recent version 1.1.0, this works very > well at least on my Adobe Acrobat 8.1 documents. > Which can be found at [1] > > > BR > Andreas Lehmkühler > > [1] http://pdfbox.apache.org/download.html > > > > Mit freundlichen Grüßen > > Thomas Fischer > > > > > > Am 20.05.2010 um 14:07 schrieb Pooja4 G: > > > >> Which version of the PDF documents are supported by PDFbox0.7.3, As we > >> upload a document of version Adobe Professional writer 9.0 and while > >> creating the difference files to compare, we will extract the text data > > >> from the PDF document using the class PDFTextStripper.class method > >> getText(). > >> > >> String PDF_text = new String(); > >> PDFTextStripper stripper = new PDFTextStripper(); > >> > >> PDF_text = stripper.getText(document); > >> > >> But it will return null if the argument as document is created from > adobe > >> Professional 9.0 else it will run successfully. > >> Please help or at least let us know if any upcoming new version PDFBox > >> does support this. > >> > >> Thanks & Regards, > >> Pooja Gupta > >> Tata Consultancy Services > >> Mailto: [email protected] > >> Website: http://www.tcs.com > >> ____________________________________________ > >> Experience certainty. IT Services > >> Business Solutions > >> Outsourcing > >> ____________________________________________ > >> =====-----=====-----===== > >> Notice: The information contained in this e-mail > >> message and/or attachments to it may contain > >> confidential or privileged information. If you are > >> not the intended recipient, any dissemination, use, > >> review, distribution, printing or copying of the > >> information contained in this e-mail message > >> and/or attachments to it are strictly prohibited. If > >> you have received this communication in error, > >> please notify us by reply e-mail or telephone and > >> immediately and permanently delete the message > >> and any attachments. Thank you > >> > >> > > > > > > =====-----=====-----===== > Notice: The information contained in this e-mail > message and/or attachments to it may contain > confidential or privileged information. If you are > not the intended recipient, any dissemination, use, > review, distribution, printing or copying of the > information contained in this e-mail message > and/or attachments to it are strictly prohibited. If > you have received this communication in error, > please notify us by reply e-mail or telephone and > immediately and permanently delete the message > and any attachments. Thank you > > >

