https://www.danisch.de/blog/2023/10/31/aktennotiz-zu-pdftotext-bei-vermurksten-zeichensaetzen/
The text is in german but what he says that he was able to extract text from obfuscated PDFs by converting them to PostScript and then back to PDF. I didn't test this myself but I suspect that the conversion to PostScript dumps the /ToUnicode stream, and that it is rebuilt from the font itself when the conversion is done.
Tilman --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
