Hi, you need to upload it to a public location as the mailing list doesn't support attachments.
BR Maruan > Am 23.03.2015 um 19:18 schrieb a7med shre3y <a7med.shr...@gmail.com>: > > Dear Maruan, > > Thank you very much for the information. Please find herewith attached the > PDF to reproduce the problem. > The text to remove is: "To Be Approved". The text has a multi-byte encoding, > so I call first to encode it in order to find it then remove it. > > Best Regards, > a7mad > >> On Mon, Mar 23, 2015 at 4:13 PM, Maruan Sahyoun <sahy...@fileaffairs.de> >> wrote: >> Dear a7mad, >> >> removing text from a PDF is not an easy task as >> - text which might visually appear as a single item might consistent of >> individual parts within the PDF itself e.g. each character or groups of >> characters are place individually in different COSStrings >> - text might be drawn using graphics commands >> - text can appear within different parts of the PDF (e.g. the text might be >> content of a form field AND the annotation representing the form field >> visually) >> - you need to look up the encoding information to get form the characters in >> the PDF "string" to the ones you are looking for >> …. >> >> If you can post a specific PDF to a public location and describe in detail >> which string should have been replaced which hasn't I will be able to tell >> you why that might have happened. >> >> Maruan >> >> >> > Am 23.03.2015 um 15:03 schrieb a7med shre3y <a7med.shr...@gmail.com>: >> > >> > Hi all, >> > >> > Currently I am facing a strange problem removing text from the some PDFs. >> > My program is able to find the text and "remove it" by calling the >> > COSString.reset() method. >> > The problem is, when I open the output PDF file, I still see the text but >> > not selectable (I mean when I try to highlight it with the mouse to copy >> > it, it's not selectable!). When print the content (tokens) of the output >> > file, I DO NOT find the text at all!! >> > >> > I am currently stuck in the PDF specifications 1.5 and really running out >> > of time. >> > >> > I'd so much appreciate any help or any idea on what's going on. >> > >> > Notes: >> > 1. I use use PDFBox 1.7.1 >> > 2. This problem does not occur with all PDFs, only some PDFs cause this >> > problem. >> > >> > Thank you very much. >> > a7mad >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org >> For additional commands, e-mail: users-h...@pdfbox.apache.org > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org