Re: How to identify underlying COSObject for a PDXObject or PDXObject for COSObject

2018-07-23 Thread Tilman Hausherr
Hi, I'm not sure if this is the answer to your question (which I haven't really understood), but please try the PrintImageLocations.java example from the source code download. Image objects do not know their own location(s). Tilman Am 23.07.2018 um 08:50 schrieb Ranjit Saripalli: Hi, I am t

Read jumbled text from multi-line highlighted text - PDFBox 1.8.14

2018-07-23 Thread CM Reddy
Hi All, We are using PDFBox 1.8.14 to manage PDF documents in our application. Implemented algorithm listed in link to read the highlighted from PDF documents. During testing the code, w

Re: How to identify underlying COSObject for a PDXObject or PDXObject for COSObject

2018-07-23 Thread Ranjit Saripalli
Hi Tilman, Thanks for you response. I can get position of the image in pdf as it is in printimagelocations example but I am trying to get the the location of COSObject. Let me try to keep my question simple, I have a COSObject with X and Y coordinate values as below. How to get the actual loca

Re: How to identify underlying COSObject for a PDXObject or PDXObject for COSObject

2018-07-23 Thread Tilman Hausherr
Hi, I don't know what that is. Maybe this is user provided information. Or maybe PDF 2.0? Can you share the PDF with this? If not, could you look at the PDF with PDFDebugger and then share a screenshot? You can include screenshots on the mailing list if they are inlined. Thunderbird can do i

Re: How to identify underlying COSObject for a PDXObject or PDXObject for COSObject

2018-07-23 Thread Marc Kaufman
A PDF XObject doesn't have a unique position. The ultimate position on the page is determined by the current transformation matrix in the content stream: The BBox entry in the form dictionary is expressed in form space, as are any coordinates used in the form XObject’s content stream, suc

Re: Read jumbled text from multi-line highlighted text - PDFBox 1.8.14

2018-07-23 Thread Tilman Hausherr
See the comments by me and mkl in the SO question. Tilman Am 23.07.2018 um 19:08 schrieb CM Reddy: Hi All, We are using PDFBox 1.8.14 to manage PDF documents in our application. Implemented algorithm listed in link

Removing Embedded Index

2018-07-23 Thread Matthew Clemente
First of all, thanks from all the feedback this mailing list has provided to previous questions; it’s been incredibly helpful. I have a pdf created with Acrobat that has an embedded index that I would like to remove. When I use the debugger, I can see it, under "Root/PieceInfo/SearchIndex”. I’ve

Re: Removing Embedded Index

2018-07-23 Thread Tilman Hausherr
Am 24.07.2018 um 02:07 schrieb Matthew Clemente: First of all, thanks from all the feedback this mailing list has provided to previous questions; it’s been incredibly helpful. I have a pdf created with Acrobat that has an embedded index that I would like to remove. When I use the debugger, I