PDF files with embedded custom fonts can be a pain for extracting text. Have you checked document properties | fonts to see what these are?
Also, some PDF files are encrypted to prevent copying of content. If printing is allowed, it might sometimes work to intercept the printing output stream. You might still get gibberish, as a result of the embedded fonts though. btw. There are several utilities available that can convert other encodings to Unicode. Unless the embedded font is properly documented, it's a hard slog to remap the encoding. I once tried this for an Indian language, but gave up after a few hours. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Is-Delitzsch-Hebrew-NT-available-tp3231746p3320641.html Sent from the SWORD Dev mailing list archive at Nabble.com. _______________________________________________ sword-devel mailing list: [email protected] http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
