Raymond,

> If I have a Hebrew text displayed in Adobe Acrobat I can select part of
> it and can paste it into Word. The trouble is that while individual
> characters are correctly displayed the order is reversed.

> Thus if I have
> in Acrobat
> קודמ (meaning 'prior')
> when pasted into Word I get
> םדוק

The Windows clipboard is a "multi-channel" medium, i.e. several different data 
formats
may be supplied at the same time by the sending application.
The receiving application may choose one of these formats.

Using a clipboard debugging tool, I see that Word fills up to 18 formats, like
000D  Unicode Text  (10 Bytes)
C090  Rich Text Format  (5815 Bytes)
C10E  HTML Format   (3641 Bytes),
whereas Adobe fills only 6 formats, e.g.
000D  Unicode Text   (11 Bytes)
C090  Rich Text Format (178 Bytes)

In both cases, the Unicode Text format contains the sequence 
U+05E7, U+05D5, U+05D3, U+05DE in logical order.

When "paste" is used in Word, a high level format is preferred by default,
so I suppose the RTF format is the problem here.

Word creates an RTF sequence like
{\ltrch\fcs1 \af220\afs40\alang1033 \rtlch\fcs0   \f220\fs40\lang1037
\langnp1033\langfenp2052\insrsid13502069\charrsid6162033\'f7\'e5\'e3\'ee}}

N.B. \'f7\'e5\'e3\'ee  is the CP1255 byte sequence for the Hebrew word above.

Adobe produces this RTF sequence:
\pard\plain\ql\f0\fs20 {\fs40 \u1511 \'F7\u1493 \'E5\u1491 \'E3\u1502 \'EE}
which is the right character sequence, but seems to be misunderstood by Word.

A solution is to use the Word command "Paste contents ..." (might be necessary 
to add it with "Customize"),
and then choose "unformatted Unicode text" from the format list.

Albrecht.



Reply via email to