On Tue, 18 May 2021 19:17:44 -0400 TomasK <[email protected]> dijo:
>I have some PDFs (contracts) from docusign and/or similar cloud service >- I can read and print them, but I cannot copy or search their content. > >The zealots have encoded every paragraph/page with some hash and >included custom fonts to make the document look and print normal. > >Does anyone know of some linuxy way to get rid of this BS and convert >the PDFs to normal unicode? I don't know if this will work, but it's the first thing I would try: Import the PDF to LibreOffice Writer, or Scribus, then re-export from them. Both of those programs have PDF import that sometimes will retain the original text, not separated into groups of a few words, as text is usually handled in a PDF. Another option would be to open in a PDF viewer, then export under various options. Also Ghostscript can view a PDF file, and can then export as pure Postscript. Not sure what that might accomplish, but give it a try. _______________________________________________ PLUG: https://pdxlinux.org PLUG mailing list [email protected] http://lists.pdxlinux.org/mailman/listinfo/plug
