On 2008-06-18 12:39:49 +0200, Santiago Vila wrote:
> Could it be because you are using a locale which is not UTF-8 friendly?
> (the one in the line "Locale:" above).

No, I use both ISO-8859-1 and UTF-8 locales. I did the test in a
uxterm, with:

LANG=POSIX
LC_CTYPE=en_US.UTF-8
LC_NUMERIC="POSIX"
LC_TIME=en_DK
LC_COLLATE=POSIX
LC_MONETARY="POSIX"
LC_MESSAGES="POSIX"
LC_PAPER="POSIX"
LC_NAME="POSIX"
LC_ADDRESS="POSIX"
LC_TELEPHONE="POSIX"
LC_MEASUREMENT="POSIX"
LC_IDENTIFICATION="POSIX"
LC_ALL=

but reported the bug from an xterm (so, using my default ISO-8859-1
locales).

> Does this happen with version 2.8.7 in experimental?

Yes, it occurs with diff 2.8.7-0.2.

The bug also occurs with:

  LC_ALL=en_US.UTF-8 diff -y file1 file2

To summarize, with the same files, here are the number of spaces before
the pipe character in the output, depending on the locales:

Locales    ISO-8859-1    UTF-8
Line 1          2          2       (contents: ab345...)
Line 2          1          3       (contents: àb345... in UTF-8)

The behavior is correct under ISO-8859-1 locales (since 'à' encoded
in UTF-8 takes 2 bytes thus is seen as 2 characters in ISO-8859-1).
However, under UTF-8, the number of spaces should be 2 instead of 3
for line 2.

-- 
Vincent Lefèvre <[EMAIL PROTECTED]> - Web: <http://www.vinc17.org/>
100% accessible validated (X)HTML - Blog: <http://www.vinc17.org/blog/>
Work: CR INRIA - computer arithmetic / Arenaire project (LIP, ENS-Lyon)



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to