Greetings, Thank you for your response. I'll extract out the test cases and will work with you.
Here is one to start with. I tried running PDF debugger in eclipse and noticed that tooltips are broken because of missing ICC profile ISOcoated_v2_300_bas.icc. A quick repository search showed this file in src/test directory of pdfbox project and hence maven did not include it in the build. My solution was to copy/paste the file to appropriate location and it started working as expected/ I can create an issue/bug if this is the right solution or since it's simple, someone with write access to repo can just copy the file. Regards, Niranjan On Sun, Mar 23, 2025 at 4:21 AM Tilman Hausherr <thaush...@t-online.de> wrote: > On 22.03.2025 20:51, Niranjan Rao wrote: > > Greetings, > > > > > > Having glanced through PDF specifications I understand kind of > > challenges PDFBox team faces. Scaling/transformation and matrix > > manipulations can drive anyone crazy. PDF debugger tool is another > > valuable tool that I found using more and more, its extremely useful > > tool. > > > > My use case is to extract text from PDF, but we're very picky about > > what we want to read and ideally we don't want to scan whole PDF as we > > can usually figure out on which page our changes will be. I found > > there are some limitations based on current text extraction logic and > > ended up copy/pasting the class and modifying it. > > > > I can fork the repository and submit my pull requests if team is > > willing to accept the PRs. Most of the changes, so far, are making > > methods accessible or wrapping them in other function calls while > > leaving core concept same. I'm willing to discuss my need and see if > > there is better or already supported way. > > > > Will PDFBox team be open to PRs and/or discussion? If so, what will be > > the process? I'm working in corporate environment and managed to get > > approval that our organization will be ok about submitting the changes > > even before broaching the subject here. > > > Hi, > > We're always open to discussions. We try to avoid "making everything > accessible" because this brings the risk that people come up with weird > ideas with unexpected side effects and this also prevents making some > changes under the hood in the future. Sometimes we also add things based > on users requests. We're not fully on gitbub, only as read-only mirror. > You can still create PRs. The best would be to discuss your ideas first, > try to break them in small parts. > > Tilman > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > >