[Gal, Can you join the texmacs-dev list to discuss it?]
[texmacs-dev, Please Cc Gal until and unless he joins the list.]

When I produce a PDF with TeXmacs (latest from svn) and use the "read
aloud" feature of pdf-max pro on Android, it doesn't "see" the typographic
ligatures, and pronounces words wrong. For example, "offense" becomes "o
ense", dropping the "ff" ligature. I wrote to the author of the pdf-writer
library that TeXmacs uses after reading a little about the problem using a
Google search, and learning that there exists something called "tagged pdf"
which I don't know much about yet; I'm too busy to read about it as much as
I'd like to.

I hereby "put the phones together" on this.

---------- Forwarded message ---------
From: Gal Kahana <gal.kah...@hotmail.com>
Date: Tue, Apr 12, 2016 at 1:31 AM
Subject: RE: Pdf-writer, tagged pdf?
To: Karl Hegbloom <karl.hegbl...@gmail.com>


Hi,
PDF-Writer can allow adding tagged pdf content. yes.
However, for the problem at hand i think that the toUnicode map of a font
should suffice, and in fact the library creates it when generating font
representations and using the regular text characters.

Its either that what you are seeing is a bug in creating a unicode map in
case of typographic ligatures usage, or that for some reason it is not
created at all (maybe through and alternative text setup).

We need someone from TeXmacs to help us pinpoint the issue. Do you know
anyone there and can communicate this to them?

Regards,
Gal.

------------------------------
Date: Sun, 10 Apr 2016 15:54:52 -0600
Subject: Pdf-writer, tagged pdf?
From: karl.hegbl...@gmail.com
To: gal.kah...@hotmail.com


I wonder if it supports creation of "tagged pdf"? My interest is due to
your library being used by the TeXmacs editor, and when I export to pdf and
have the pdf reader app perform text to speech (read aloud) it skips
typographic ligatures and thus mispronounces words that contain them. I
think the solution involves emitting tagged pdf that has typographic
ligatures transliterated to their multi character unicode equvalent for the
voice-reader and highlight + clipboard, while still using the ligature for
display and printing.

karl.hegbl...@gmail.com
http://karlhegbloom.blogspot.com
_______________________________________________
Texmacs-dev mailing list
Texmacs-dev@gnu.org
https://lists.gnu.org/mailman/listinfo/texmacs-dev

Reply via email to