On Sun, 13 Oct 2019, John Jason Jordan wrote:
I know nothing of emacs, but I would copy and paste the text from the PDF document into a plain GUI text editor (Gedit is what I normally use) and then use its search and replace function to replace them with nothing. If I need fancier features I use LibreOffice Writer.
John, Why GUI? I view the text in emacs and joe. Both show the same thing and there's no way to specify the caret-A and caret-a.
I have this problem occasionally with subtitles, and several of my subtitle editors have the ability to convert the encoding to UT-8. Just now I noticed that Gedit has the ability to set the language of a document, although the only choices it offers me are various varieties of English. I wonder what would happen if I opened the document in Gedit. Ditto for opening it in Writer or Scribus, which also have the ability to import a PDF document and keep the text as text.
The language is English. When I opened the document with xpdf it reported to the virtual terminal that it didn't understand the caracters in Adobe-Japan1. Regards, Rich _______________________________________________ PLUG mailing list [email protected] http://lists.pdxlinux.org/mailman/listinfo/plug
