Hi Christian,

> Just noticing that this grows quite large. If someone finds this
> unbearable for this list, please speak up to let me know I should cut down
> the off-topic stuff on my public mails!

No problem :-) I would hope that people talk more about the
"big font" approaches - Having either a big Unicode font in
XMS or maybe a 512 char double code page in the VGA card...

Combined with, for example, a UTF-8 enabled Super-NANSI to
make the step from strings to their display, of course. The
problem would be loss of "ASCII" art block graphics in apps
which are not using Unicode. A possible workaround would be
dosver-style, to make a per-app decision who uses Unicode.



I do not think that you could trust the data for this. Even
on Linux where Unicode is quite common now, usage of BOM is
rare. People try to keep their set of apps consistent to use
either UTF-8 everywhere or Latin1 everywhere or (preferred)
use whichever the LANG etc environment variables select at
the moment when the app starts. Given that DOS has many old
unmaintained apps, you will have to accept mixing in DOS:

Some old apps will only use ASCII anyway which is the same
for real ASCII and for UTF8 but some others will assume a
codepage (often 437) to be active. The block graphics and
other chars from the non-ASCII half of any codepage differ
in encoding from UTF8 so, as said, any display or similar
driver would need some way to switch between "classic code
page mode" and "UTF8 rendering mode". It could switch on
UTF8 based on explicit request from a modern app or based
on app name for old but known compatible apps... It would
switch off UTF8 when any app exits (int 21.4c / 21.31...).

Compatible apps would be apps which only display ASCII out
of themselves and which make no serious assumptions about
one byte being equal to one character. A good example are
MORE and TYPE: If you TYPE an UTF8 text with a special CON
driver which expects and renders UTF8, it will simply work
because TYPE passes the text file 1:1 and only uses plain
ASCII for built-in messages, if any. A good counter example
are PG and EDIT: They make the byte-is-character assumption
for scrolling (in particular horizontal scrolling) and EDIT
uses block graphics chars of codepages. So you have to put
your CON driver in NON-Unicode mode while using EDIT or PG.



As a general question - I would really like to know for
WHICH APPS people want to have Unicode support. Is this
only about proper display of playlists in MPXPLAY and of
CD, USB or local accented filenames in any file manager?

Is the issue also in general command.com style activity,
probably depending on DOSLFN being present? If yes, I do
assume that the LFN API already is explicit about whether
UTF8 or rather codepage style encoding should be used?

Are text editors also a case which should support Unicode
and if yes, why do you not use for example Blocek then?

Or is the idea to have "Unicode everywhere", even in the
PrintScreen hotkey, TREE, Undelete, the volume label for
SYS / FORMAT / VOL / LABEL, tools like FIND or DEBUG...?

Eric


------------------------------------------------------------------------------
Increase Visibility of Your 3D Game App & Earn a Chance To Win $500!
Tap into the largest installed PC base & get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
_______________________________________________
Freedos-devel mailing list
Freedos-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freedos-devel

Reply via email to