The poppler-utils package has tools like: pdfseparate -- page extraction tool pdftotext -- text extraction pdftohtml -- PDF to HTML converter
Many others as well, but those might be of immediate value to you. Years ago, pre-PDF, there used to be tools like ps2ps, pstops (different tools, I think) and such, that would do all sorts of manipulations. When PDF came into existence, folks wrote similar pdf2pdf and pdftopdf tools. I *thought* the first set was called "psutils" but `apt show psutils` shows something different. Likely things changed. After all, I'm referring to the late 80s, early 90s, and I probably grabbed my copies off of Usenet. But, that poppler-utils package looks to be in the same vein. mrc

