Change requests

Niranjan Rao Sat, 22 Mar 2025 12:52:50 -0700

Greetings,

Having glanced through PDF specifications I understand kind ofchallenges PDFBox team faces. Scaling/transformation and matrixmanipulations can drive anyone crazy. PDF debugger tool is anothervaluable tool that I found using more and more, its extremely useful tool.

My use case is to extract text from PDF, but we're very picky about whatwe want to read and ideally we don't want to scan whole PDF as we canusually figure out on which page our changes will be. I found there aresome limitations based on current text extraction logic and ended upcopy/pasting the class and modifying it.

I can fork the repository and submit my pull requests if team is willingto accept the PRs. Most of the changes, so far, are making methodsaccessible or wrapping them in other function calls while leaving coreconcept same. I'm willing to discuss my need and see if there is betteror already supported way.

Will PDFBox team be open to PRs and/or discussion? If so, what will bethe process? I'm working in corporate environment and managed to getapproval that our organization will be ok about submitting the changeseven before broaching the subject here.



Regards,


Niranjan


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Change requests

Reply via email to