Re: [XeTeX] xetex and xelatex bug in MiKTeX 2.9

2011-07-08 Thread Ulrike Fischer
Am Fri, 8 Jul 2011 15:29:21 +1000 schrieb Vafa Khalighi:

 Several users has reported that xetex and xelatex included in MikTeX 2.9
 (and perhaps MiKTeX 2.8) is buggy. The minimal document is:
 
 \documentclass{article}
 \usepackage{fontspec}
 \setmainfont[Script=Arabic,Mapping=parsidigits]{XB Zar}% replace it with any
 other Arabic/Persian font that you have
 \usepackage{bidi}
 \setRTL
 \begin{document}
 \begin{equation}
 1+2=3
 \end{equation}
 \end{document}
 
 The reported problem is that equation number does not appear in the PDF. Can
 anyone confirm this 

Yes. The equation number is printed with round braces and when a
font with Script=Arabic is used round braces has interesting side
effects on numbers. 

\documentclass{article}
\font\test=Scheherazade/ICU:script=arab;language=DFLT;
\begin{document}\test
123 (123 abc 

123 (123abc

345  789)
\end{document}


gives as output

123 abc

123 (123abc

345 (78

So numbers and the braces are swallowed in some cases.

 and if yes, would this be fixed in MiKTeX 2.9 (and perhaps in MiKTeX 2.8).

If it is only a miktex bug and if the source can be found ... But at
first it should be checked that it is not a xetex bug. 
Miktex uses the version:
This is XeTeX, Version 3.1415926-2.3-0.9997.5 (MiKTeX 2.9)

-- 
Ulrike Fischer 



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] How to mix math fonts?

2011-07-08 Thread George N. White III
On Thu, Jul 7, 2011 at 10:08 AM, Ulrike Fischer ne...@nililand.de wrote:

 Am Thu, 7 Jul 2011 12:13:21 +0200 schrieb Mojca Miklavec:


 Thanks a lot for the answer. Actually, the looping itself is not a
 problem. I was trying to modify Will Robertson's document
 (http://mirrors.ctan.org/macros/latex/contrib/unicode-math/unimath-symbols.pdf)

 I'm still trying to decipher Ulrike's answer (bearing in mind that I
 don't speak any latex 3 at all).

 You don't need to much about latex 3, only that you activate the
 catcodes with \ExplSyntaxOn. The more important knowledge you need
 is how to define a mathversion. I just tried my code with
 unimath-symbols. It seems to work fine. I attach the new version.

 The glyphs of the second font are inserted with \SYMBUF. Attention:
 if you want to change the definition of command you must do it in
 two places (in the \mathaccent section it has a different
 definition).

 While testing I found out that \Cap and Cup give different results.
 I would say XITS is wrong:

Yes, in a charmap tool like babelmap, U+22D3 DOUBLE UNION and
U+22D2 DOUBLE INTERSECTION are confused (standing upside down?),
but blame STIXGeneral -- XITS just inherits the mistake.

 \documentclass{article}
 \usepackage{unicode-math}

 \begin{document}
 \setmathfont{Asana Math}
 $\Cap \Cup$
 \setmathfont{XITS Math}
 $\Cap \Cup$
 \end{document}


-- 
George N. White III aa...@chebucto.ns.ca
Head of St. Margarets Bay, Nova Scotia


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


[XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread Joshua and Amy
I'm creating some hyphenation rules for Jarai texts that I'm
interlinearizing. Here's the problem: In various texts, a complex character
such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single code
point (U+0103) or as a combination of code points (LATIN SMALL LETTER A:
U+0061 plus COMBINING BREVE: U+0306). The \hyphenation{} command does not
treat the two things as the same, meaning that I have to create two versions
of a word if it has one accented character, four versions if it has two
accented character, nine versions if it has three, etc. For example:

\hyphenation{hơ-nuă hơ-nuă hơ-nuă hơ-nuă}

(because O WITH HORN can be two code points or one)

Is there a simple way to tell (Xe)LaTeX to treat precomposed and uncomposed
characters identically without having to put in all the possibilities?


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread maxwell
On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com
wrote:
 I'm creating some hyphenation rules for Jarai texts that I'm
 interlinearizing. Here's the problem: In various texts, a complex
character
 such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single
code
 point (U+0103) or as a combination of code points (LATIN SMALL LETTER A:
 U+0061 plus COMBINING BREVE: U+0306). 

Can't (shouldn't!) you pass your texts through a Unicode normalization
process?  Otherwise search on them might not work either, depending on how
smart your search tool is.

   Mike Maxwell


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread Joshua and Amy
So, I guess I was foolish to hope that Google has figured out how to return
results that have non-identical but equivalent strings?

I hope it's not too off-topic for this list, but can you point me to any
good resources on normalization (is there a straightforward automation for
someone who doesn't do scripting? am I supposed to use decomposed
characters?)?

Thanks.

Josh

On Fri, Jul 8, 2011 at 3:11 PM, maxwell maxw...@umiacs.umd.edu wrote:

 On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com
 wrote:
  I'm creating some hyphenation rules for Jarai texts that I'm
  interlinearizing. Here's the problem: In various texts, a complex
 character
  such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single
 code
  point (U+0103) or as a combination of code points (LATIN SMALL LETTER A:
  U+0061 plus COMBINING BREVE: U+0306).

 Can't (shouldn't!) you pass your texts through a Unicode normalization
 process?  Otherwise search on them might not work either, depending on how
 smart your search tool is.

   Mike Maxwell


 --
 Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread Dominik Wujastyk
Unicode normalization was discussed on this list a couple of months ago.
Phil Taylor provided a small program to do the job, and other utilities were
referred to.  There's also a command within XeTeX  that normalizes unicode
before passing it to TeX's digestion.  Try this in your header:

% Normalize any residual Unicode combining accents,
% and write out error messages, if any:
\XeTeXinputnormalization=1
\tracinglostchars=1
\tracingonline=1

Dominik


On 8 July 2011 22:50, Joshua and Amy josh.ruth...@gmail.com wrote:

 So, I guess I was foolish to hope that Google has figured out how to return
 results that have non-identical but equivalent strings?

 I hope it's not too off-topic for this list, but can you point me to any
 good resources on normalization (is there a straightforward automation for
 someone who doesn't do scripting? am I supposed to use decomposed
 characters?)?

 Thanks.

 Josh


 On Fri, Jul 8, 2011 at 3:11 PM, maxwell maxw...@umiacs.umd.edu wrote:

 On Fri, 8 Jul 2011 15:00:42 -0500, Joshua and Amy josh.ruth...@gmail.com
 
 wrote:
  I'm creating some hyphenation rules for Jarai texts that I'm
  interlinearizing. Here's the problem: In various texts, a complex
 character
  such as LATIN SMALL LETTER A WITH BREVE might be encoded as a single
 code
  point (U+0103) or as a combination of code points (LATIN SMALL LETTER A:
  U+0061 plus COMBINING BREVE: U+0306).

 Can't (shouldn't!) you pass your texts through a Unicode normalization
 process?  Otherwise search on them might not work either, depending on how
 smart your search tool is.

   Mike Maxwell


 --
 Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex





 --
 Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread maxwell
On Fri, 8 Jul 2011 15:50:07 -0500, Joshua and Amy josh.ruth...@gmail.com
wrote:
 So, I guess I was foolish to hope that Google has figured out how to
return
 results that have non-identical but equivalent strings?

I'm sure google has figured this out, and some programs to an automatic
conversion to composed or decomposed form.  But I wouldn't be surprised if
some programmer's editors, for example, don't do that (for some purposes,
such as search-and-replace, the difference might be important), and maybe
some other programs don't either.

 I hope it's not too off-topic for this list, but can you point me to any
 good resources on normalization (is there a straightforward automation
for
 someone who doesn't do scripting? am I supposed to use decomposed
 characters?)?

You can use either composed or decomposed characters for most purposes,
although as I say some programs do an automatic (and possibly invisible)
conversion.  

There's a general article on this issue here:
   http://en.wikipedia.org/wiki/Unicode_equivalence
I know of library functions in Python that do the conversion; I'm sure
they exist in Perl too.  But I'm not aware of a general program (like
iconv) that does it.  (I think there's a hack with iconv that allows it to
create decomposed forms, but that is not a bidirectional conversion.) 
Maybe someone else on this list knows of tools that do that.  (What OS are
you working on?)

   Mike Maxwell


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread maxwell
This is a better answer than mine, so disregard my noise.  But I do have a
question below:

On Fri, 8 Jul 2011 23:13:41 +0200, Dominik Wujastyk wujas...@gmail.com
wrote:
 ...
 There's also a command within XeTeX  that normalizes unicode
 before passing it to TeX's digestion.  Try this in your header:
 
 % Normalize any residual Unicode combining accents,
 % and write out error messages, if any:
 \XeTeXinputnormalization=1
 \tracinglostchars=1
 \tracingonline=1

I found \XeTeXinputnormalization in XeTeX documentation, but I'm not
familiar with the other two commands.  I guess \tracingonline=1 means to
output errors to stdout (or stderr?), but where is the effect of
\tracinglostchars described?  In particular, what error msgs should I look
for if normalization fails or a font lacks a normalized character?

   Mike Maxwell


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread Jonathan Kew
On 8 Jul 2011, at 23:24, maxwell wrote:

 I found \XeTeXinputnormalization in XeTeX documentation, but I'm not
 familiar with the other two commands.  I guess \tracingonline=1 means to
 output errors to stdout (or stderr?), but where is the effect of
 \tracinglostchars described?

See The TeXbook, p301: There is also \tracinglostchars, which (if positive) 
causes TeX to record each time a character has been dropped because it does not 
appear in the current font; ...

(And yes, \tracingonline=1 means you'll get those messages in the terminal, not 
only the .log file. That's also a standard TeX parameter, documented in The 
TeXbook.)

JK




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] \hyphenation{} and combining diacritics

2011-07-08 Thread Joshua and Amy
Many, thanks, all, and sorry for missing the earlier discussion.

But back to my original question, is there a way to get \hyphenation to
require only one form and the rest come for free?

On Fri, Jul 8, 2011 at 5:37 PM, Jonathan Kew jfkth...@googlemail.comwrote:

 On 8 Jul 2011, at 23:24, maxwell wrote:

  I found \XeTeXinputnormalization in XeTeX documentation, but I'm not
  familiar with the other two commands.  I guess \tracingonline=1 means to
  output errors to stdout (or stderr?), but where is the effect of
  \tracinglostchars described?

 See The TeXbook, p301: There is also \tracinglostchars, which (if
 positive) causes TeX to record each time a character has been dropped
 because it does not appear in the current font; ...

 (And yes, \tracingonline=1 means you'll get those messages in the terminal,
 not only the .log file. That's also a standard TeX parameter, documented in
 The TeXbook.)

 JK




 --
 Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex