Re: [XeTeX] xetex and xelatex bug in MiKTeX 2.9
Am Fri, 8 Jul 2011 15:29:21 +1000 schrieb Vafa Khalighi: Several users has reported that xetex and xelatex included in MikTeX 2.9 (and perhaps MiKTeX 2.8) is buggy. The minimal document is: \documentclass{article} \usepackage{fontspec} \setmainfont[Script=Arabic,Mapping=parsidigits]{XB Zar}% replace it with any other Arabic/Persian font that you have \usepackage{bidi} \setRTL \begin{document} \begin{equation} 1+2=3 \end{equation} \end{document} The reported problem is that equation number does not appear in the PDF. Can anyone confirm this Yes. The equation number is printed with round braces and when a font with Script=Arabic is used round braces has interesting side effects on numbers. \documentclass{article} \font\test=Scheherazade/ICU:script=arab;language=DFLT; \begin{document}\test 123 (123 abc 123 (123abc 345 789) \end{document} gives as output 123 abc 123 (123abc 345 (78 So numbers and the braces are swallowed in some cases. and if yes, would this be fixed in MiKTeX 2.9 (and perhaps in MiKTeX 2.8). If it is only a miktex bug and if the source can be found ... But at first it should be checked that it is not a xetex bug. Miktex uses the version: This is XeTeX, Version 3.1415926-2.3-0.9997.5 (MiKTeX 2.9) -- Ulrike Fischer -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to mix math fonts?
On Thu, Jul 7, 2011 at 10:08 AM, Ulrike Fischer ne...@nililand.de wrote: Am Thu, 7 Jul 2011 12:13:21 +0200 schrieb Mojca Miklavec: Thanks a lot for the answer. Actually, the looping itself is not a problem. I was trying to modify Will Robertson's document (http://mirrors.ctan.org/macros/latex/contrib/unicode-math/unimath-symbols.pdf) I'm still trying to decipher Ulrike's answer (bearing in mind that I don't speak any latex 3 at all). You don't need to much about latex 3, only that you activate the catcodes with \ExplSyntaxOn. The more important knowledge you need is how to define a mathversion. I just tried my code with unimath-symbols. It seems to work fine. I attach the new version. The glyphs of the second font are inserted with \SYMBUF. Attention: if you want to change the definition of command you must do it in two places (in the \mathaccent section it has a different definition). While testing I found out that \Cap and Cup give different results. I would say XITS is wrong: Yes, in a charmap tool like babelmap, U+22D3 DOUBLE UNION and U+22D2 DOUBLE INTERSECTION are confused (standing upside down?), but blame STIXGeneral -- XITS just inherits the mistake. \documentclass{article} \usepackage{unicode-math} \begin{document} \setmathfont{Asana Math} $\Cap \Cup$ \setmathfont{XITS Math} $\Cap \Cup$ \end{document} -- George N. White III aa...@chebucto.ns.ca Head of St. Margarets Bay, Nova Scotia -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
[XeTeX] \hyphenation{} and combining diacritics
I'm creating some hyphenation rules for Jarai texts that I'm interlinearizing. Here's the problem: In various texts, a complex character such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single code point (U+0103) or as a combination of code points (LATIN SMALL LETTER A: U+0061 plus COMBINING BREVE: U+0306). The \hyphenation{} command does not treat the two things as the same, meaning that I have to create two versions of a word if it has one accented character, four versions if it has two accented character, nine versions if it has three, etc. For example: \hyphenation{hơ-nuă hơ-nuă hơ-nuă hơ-nuă} (because O WITH HORN can be two code points or one) Is there a simple way to tell (Xe)LaTeX to treat precomposed and uncomposed characters identically without having to put in all the possibilities? -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com wrote: I'm creating some hyphenation rules for Jarai texts that I'm interlinearizing. Here's the problem: In various texts, a complex character such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single code point (U+0103) or as a combination of code points (LATIN SMALL LETTER A: U+0061 plus COMBINING BREVE: U+0306). Can't (shouldn't!) you pass your texts through a Unicode normalization process? Otherwise search on them might not work either, depending on how smart your search tool is. Mike Maxwell -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
So, I guess I was foolish to hope that Google has figured out how to return results that have non-identical but equivalent strings? I hope it's not too off-topic for this list, but can you point me to any good resources on normalization (is there a straightforward automation for someone who doesn't do scripting? am I supposed to use decomposed characters?)? Thanks. Josh On Fri, Jul 8, 2011 at 3:11 PM, maxwell maxw...@umiacs.umd.edu wrote: On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com wrote: I'm creating some hyphenation rules for Jarai texts that I'm interlinearizing. Here's the problem: In various texts, a complex character such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single code point (U+0103) or as a combination of code points (LATIN SMALL LETTER A: U+0061 plus COMBINING BREVE: U+0306). Can't (shouldn't!) you pass your texts through a Unicode normalization process? Otherwise search on them might not work either, depending on how smart your search tool is. Mike Maxwell -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
Unicode normalization was discussed on this list a couple of months ago. Phil Taylor provided a small program to do the job, and other utilities were referred to. There's also a command within XeTeX that normalizes unicode before passing it to TeX's digestion. Try this in your header: % Normalize any residual Unicode combining accents, % and write out error messages, if any: \XeTeXinputnormalization=1 \tracinglostchars=1 \tracingonline=1 Dominik On 8 July 2011 22:50, Joshua and Amy josh.ruth...@gmail.com wrote: So, I guess I was foolish to hope that Google has figured out how to return results that have non-identical but equivalent strings? I hope it's not too off-topic for this list, but can you point me to any good resources on normalization (is there a straightforward automation for someone who doesn't do scripting? am I supposed to use decomposed characters?)? Thanks. Josh On Fri, Jul 8, 2011 at 3:11 PM, maxwell maxw...@umiacs.umd.edu wrote: On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com wrote: I'm creating some hyphenation rules for Jarai texts that I'm interlinearizing. Here's the problem: In various texts, a complex character such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single code point (U+0103) or as a combination of code points (LATIN SMALL LETTER A: U+0061 plus COMBINING BREVE: U+0306). Can't (shouldn't!) you pass your texts through a Unicode normalization process? Otherwise search on them might not work either, depending on how smart your search tool is. Mike Maxwell -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
On Fri, 8 Jul 2011 15:50:07 -0500, Joshua and Amy josh.ruth...@gmail.com wrote: So, I guess I was foolish to hope that Google has figured out how to return results that have non-identical but equivalent strings? I'm sure google has figured this out, and some programs to an automatic conversion to composed or decomposed form. But I wouldn't be surprised if some programmer's editors, for example, don't do that (for some purposes, such as search-and-replace, the difference might be important), and maybe some other programs don't either. I hope it's not too off-topic for this list, but can you point me to any good resources on normalization (is there a straightforward automation for someone who doesn't do scripting? am I supposed to use decomposed characters?)? You can use either composed or decomposed characters for most purposes, although as I say some programs do an automatic (and possibly invisible) conversion. There's a general article on this issue here: http://en.wikipedia.org/wiki/Unicode_equivalence I know of library functions in Python that do the conversion; I'm sure they exist in Perl too. But I'm not aware of a general program (like iconv) that does it. (I think there's a hack with iconv that allows it to create decomposed forms, but that is not a bidirectional conversion.) Maybe someone else on this list knows of tools that do that. (What OS are you working on?) Mike Maxwell -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
This is a better answer than mine, so disregard my noise. But I do have a question below: On Fri, 8 Jul 2011 23:13:41 +0200, Dominik Wujastyk wujas...@gmail.com wrote: ... There's also a command within XeTeX that normalizes unicode before passing it to TeX's digestion. Try this in your header: % Normalize any residual Unicode combining accents, % and write out error messages, if any: \XeTeXinputnormalization=1 \tracinglostchars=1 \tracingonline=1 I found \XeTeXinputnormalization in XeTeX documentation, but I'm not familiar with the other two commands. I guess \tracingonline=1 means to output errors to stdout (or stderr?), but where is the effect of \tracinglostchars described? In particular, what error msgs should I look for if normalization fails or a font lacks a normalized character? Mike Maxwell -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
On 8 Jul 2011, at 23:24, maxwell wrote: I found \XeTeXinputnormalization in XeTeX documentation, but I'm not familiar with the other two commands. I guess \tracingonline=1 means to output errors to stdout (or stderr?), but where is the effect of \tracinglostchars described? See The TeXbook, p301: There is also \tracinglostchars, which (if positive) causes TeX to record each time a character has been dropped because it does not appear in the current font; ... (And yes, \tracingonline=1 means you'll get those messages in the terminal, not only the .log file. That's also a standard TeX parameter, documented in The TeXbook.) JK -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] \hyphenation{} and combining diacritics
Many, thanks, all, and sorry for missing the earlier discussion. But back to my original question, is there a way to get \hyphenation to require only one form and the rest come for free? On Fri, Jul 8, 2011 at 5:37 PM, Jonathan Kew jfkth...@googlemail.comwrote: On 8 Jul 2011, at 23:24, maxwell wrote: I found \XeTeXinputnormalization in XeTeX documentation, but I'm not familiar with the other two commands. I guess \tracingonline=1 means to output errors to stdout (or stderr?), but where is the effect of \tracinglostchars described? See The TeXbook, p301: There is also \tracinglostchars, which (if positive) causes TeX to record each time a character has been dropped because it does not appear in the current font; ... (And yes, \tracingonline=1 means you'll get those messages in the terminal, not only the .log file. That's also a standard TeX parameter, documented in The TeXbook.) JK -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex