RE: Word reversal from Abobe to Word

Dreiheller, Albrecht Thu, 07 Feb 2013 07:42:51 -0800

Raymond,

> If I have a Hebrew text displayed in Adobe Acrobat I can select part of
> it and can paste it into Word. The trouble is that while individual
> characters are correctly displayed the order is reversed.


> Thus if I have
> in Acrobat
> קודמ (meaning 'prior')
> when pasted into Word I get
> םדוק

The Windows clipboard is a "multi-channel" medium, i.e. several different data 
formats
may be supplied at the same time by the sending application.
The receiving application may choose one of these formats.

Using a clipboard debugging tool, I see that Word fills up to 18 formats, like
000D  Unicode Text  (10 Bytes)
C090  Rich Text Format  (5815 Bytes)
C10E  HTML Format   (3641 Bytes),
whereas Adobe fills only 6 formats, e.g.
000D  Unicode Text   (11 Bytes)
C090  Rich Text Format (178 Bytes)

In both cases, the Unicode Text format contains the sequence 
U+05E7, U+05D5, U+05D3, U+05DE in logical order.

When "paste" is used in Word, a high level format is preferred by default,
so I suppose the RTF format is the problem here.

Word creates an RTF sequence like
{\ltrch\fcs1 \af220\afs40\alang1033 \rtlch\fcs0   \f220\fs40\lang1037
\langnp1033\langfenp2052\insrsid13502069\charrsid6162033\'f7\'e5\'e3\'ee}}

N.B. \'f7\'e5\'e3\'ee  is the CP1255 byte sequence for the Hebrew word above.

Adobe produces this RTF sequence:
\pard\plain\ql\f0\fs20 {\fs40 \u1511 \'F7\u1493 \'E5\u1491 \'E3\u1502 \'EE}
which is the right character sequence, but seems to be misunderstood by Word.

A solution is to use the Word command "Paste contents ..." (might be necessary 
to add it with "Customize"),
and then choose "unformatted Unicode text" from the format list.

Albrecht.

RE: Word reversal from Abobe to Word

Reply via email to