In my zotero-texmacs-integration, when I set the CSL style to ISO-690, it
outputs "ISBN" followed by a two-byte character, followed by the actual
ISBN number. I'm taking the text it sends me, which is output via an output
format specification that wraps the text as LaTeX "bbl", and transforming
it into a TeXmacs tree with: (latex->texmacs (parse-latex str_text)).

In the result, I see, in red, <varspace>. When I catch the offending
character in an Emacs terminal buffer, and run it through Guile's
string->list, I get:  (#\302 #\240); and (map char->integer (string->list
" ")) yields (194 160). As hexadecimal, those are 0xC2 0xA0. That is UTF-8
encoded NO-BREAK SPACE.

What this tells me is that TeXmacs is converting the UTF-8 Unicode
character into one of it's internal things, but instead of it being
rendered as an actual non-breakable space, I see the string <varspace> in
red letters. I see the same thing when I paste that character into a
document.

What's the matter? How can I fix this? I think that it's not seeing the
encoding as being UTF-8 when it parses the LaTeX because the heuristics
fail in this case, since it's not a whole document, but only a snippet of
LaTeX. I want to control the encoding or run an explicit UTF-8 to Cork
first.
_______________________________________________
Texmacs-dev mailing list
Texmacs-dev@gnu.org
https://lists.gnu.org/mailman/listinfo/texmacs-dev

Reply via email to