--- Mikhael Goikhman wrote:
> On 18 Dec 2002 18:57:32 -0800, Nadim Shaikli wrote:
> > iso8859-6 is a subset of iso10646-1 -- again, iso8859-6 alone is
> > simply not usable; its visually incorrect without shaping and one
> > is not able to shape sans Form-B glyphs (there are a plethora of
> > posts regarding this topic on the 'net - I can certainly send you
> > the links if you like so as not to go on a tangent on this forum).
> 
> I know the theory well now. But you miss my point. I want everything to
> work, including iso8859-6. You can't deny that there is such charset (and
> encoding) and as you said it is not losing for Arabic. So if a user
> requested iso8859-6 fonts (without non iso8859-6 characters, of course)
> he don't want to see question marks for valid iso8859-6 characters.

I won't argue with that - but in all honesty I don't think anyone will
do that unless its for shear testing/curiosity.

> Don't worry about this, I may later fix one-byte Arabic charsets myself.
> Or not fix, if you are against supporting all existing Arabic fonts. :)

I'm not against it and in theory you are correct - it should work as
well so that fvwm doesn't deviate from "normality".  I'm a bit biased
as a great deal of us on arabeyes.org have been advocating UTF-8/ISO10646
since its solves so many of our problems; do go ahead and fix it, it
shan't affect the proper shaping doing in utf-8.

> > > We supported all iso encodings. I see no valid reason to stop to support
> > > iso8859-6. I think the problem is that once shaping is applied fribidi
> > > (or is it iconv?) can't go back to iso8859-6 and uses question marks
> > > then, so we should only apply shaping for unicode encoding of original
> > > strings.
> > 
> > I don't think its a question of support.  Fvwm is doing the right
> > thing.  I view this as "faulty/missing font" issue.  The font file
> > you were using simply doesn't have the _required_ Form-B glyphs and
> > thus Arabic can't be displayed properly.  Its like wanting to display
> > chinese without having the correct chinese glyphs and getting question
> > marks instead.
> 
> What you say is that all existing CP1256 and iso8859-6 one byte fonts
> should show question marks and never Arabic glyphs that they contain.
> I don't know, it is not hard to fix this situation.

And fixed it should be (I'm recanting my stance in favor of "theoretical
correctness" :-)

> By the way, FVWM supports CP-1256 encoding without problems, as far as
> I can see, when I set CP-1256 encoded title using:
> 
>   env LANG=ar_JO.iso8859-6 date | iconv -f iso8859-6 -t cp1256

Again theoretically - cool :-)  but I sure would not advocate anyone to
use M$'s encodings.

> I even see it correctly shaped (I think) if I use unicode font like:
> 
>   Style Arabic Font StringEncoding=CP1256:*-arabeyes-*/iso10646-1

So the above is saying take my strings (which are CP1256) and convert
them to UTF-8 (since iso10646-1 is noted) and use the noted font to
display them, right ?  If so it makes total sense - the man page should
note that 'StringEncoding' acts as the '-from' for 'iconv'.

> > I can understand the following encodings
> > UTF-8, USC-2, USC-4 and UTF-16, but don't quite understand a setting
> > akin to 'StringEncoding=iso8859-6' (unless fvwm is mapping names to
> > encodings which is what I thought it did - "convenience magic").
> 
> If you have text stored in some encoding (iso8859-6 or cp1256), you may
> find it useful to be able to convert it to something else, like utf-8
> to use with unicode fonts. FVWM allows this using StringEncoding.

OK, cool - then what I note above ("acts as the '-from'") is correct.

> Out of curiosity, do you have Arabic text files? Are they all in one or
> another unicode encoding? I read literature in several languages, but I
> should yet encounter utf-8 text. If there are one byte encodings and there
> is only one language (except for English) unicode is a waste of space. :)

Everything that I have is in UTF-8 (diskspace/memory is cheap); I'd
rather be consistent and correct :-)  If you'd like some Arabic UTF-8
samples, let me know.

> Of course, to see one byte encoded text, you should replace "set
> encoding=utf-8" in your .vimrc that I know you have.

:-) and the various emacs settings,

 (setq locale-coding-system   'utf-8)
 (set-terminal-coding-system  'utf-8)
 (set-keyboard-coding-system  'utf-8)
 (set-selection-coding-system 'utf-8)
 (prefer-coding-system        'utf-8)


Thanks & Regards,

 - Nadim (Thanks for CC'ing)


__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus - Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com
--
Visit the official FVWM web page at <URL:http://www.fvwm.org/>.
To unsubscribe from the list, send "unsubscribe fvwm-workers" in the
body of a message to [EMAIL PROTECTED]
To report problems, send mail to [EMAIL PROTECTED]

Reply via email to