Re: Web Form: Old Russian charcaters

2003-02-07 Thread Vladimir Ivanov
Please, have a look at the characters in the range 0460 - 0486 in the font
Arial Unicode MS shipped with Office XP. May be those are the characters you
are looking for.

 Can anyone on the Unicode list help?
 Thanks,
 Magda


  -Original Message-
 
  Date/Time:Thu Jan 16 13:11:06 EST 2003
  Contact:  [EMAIL PROTECTED]
  Report Type:  Other Question, Problem, or Feedback
 
  I am looking for a way to use old Russian charcaters that are
  no longer used in modern Russian langauage.
 
  Can you help or provide a direction in which to look?





Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: Indic Devanagari Query))

2003-02-07 Thread Asmus Freytag
At 11:54 AM 2/6/03 -0800, Kenneth Whistler wrote:

My personal opinion? The whole debate about deprecation of
language tag characters is a frivolous distraction from
other technical matters of greater import, and things would
be just fine with the current state of the documentation.
But, if formal deprecation by the UTC is what it would take
to get people to stop advocating more use of the language
tags after the UTC has long determined that their use is
strongly discouraged, then so be it.


My personal opinion is that labelling them as restricted for
use with protocols requiring their use is sufficient and proper.
In the context of such protocols, the use of tag characters is
a fine mechanism. They certainly have some advantages over
ASCII-style markup (e.g. lang=...) in many situations.

Where they don't have a place is in regular 'plain' text streams.

Formal deprecation would imply to me that ANY use is discouraged,
including the use with protocols that wish to make use of them.
THAT seems to be going too far in this case.

Where we have deprecated format characters in the past it has been
precisely in situations where we wanted to discourage the use of
particular 'protocols', for example for shaping and national digit
selection.

A./




Unicode and Encoding Problems in Browsers

2003-02-07 Thread Muhammad Asif
Hi,
I actually want to enter some Arabic text in simple HTML text box . I set 
language in my Windows XP settings to Arabic. When i tried to type, there 
are certain characters that are not displayed. Instead rectangles are 
displayed. Characters are from Unicode BMP which is supposed to be 
supported by browsers. I am using IE 6.0

Also default encoding of browser is UTF-8, but if i change the encoding to 
default windows Western European it works fine and every character is got 
displayed. Problem in this is when form is submitted you did not get 
Unicode characters but their entity references in HTML.

How can i make browser work to display unicode characters with UTF-8 
encoding. So when form is submitted i get Unicode data to store in data base.

Thanks a lot for your time.

Asif 




Re: VS vs. P14 (was Re: Indic Devanagari Query)

2003-02-07 Thread Andrew C. West
John H. Jenkins wrote:

 Ah, but decorative motifs are not plain text.

Ah, but it could be.




Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: Indic Devanagari Query))

2003-02-07 Thread William Overington
I feel that as the matter was put forward for Public Review then it is
reasonable for someone reading of that review to respond to the review on
the basis of what is stated as the issue in the Public Review item itself.

Kenneth Whistler now states an opinion as to what the review is about and
mentions a file PropList.txt of which I was previously unaware.

Recent discussions in the later part of 2002 in this forum about the
possibilities of using language tags only started as a direct result of the
Unicode Consortium instituting the Public Review.

The recent statement by Asmus Freytag seems fine to me.  Certainly I might
be inclined to add in a little so as to produce Plane 14 tags are reserved
for use with particular protocols requiring, or providing facilities for,
their use so that the possibility of using them to add facilities rather
than simply using them when obligated to do so is included, but that is not
a great issue: what Asmus wrote is fine.

Public Review is, in my opinion, a valuable innovation.  Two issues have so
far been resolved using the Public Review process.  Those results do seem to
indicate the value of seeking opinions by Public Review.

As I have mentioned before I have a particular interest in the use of
Unicode in relation to the implementation of my telesoftware invention using
the DVB-MHP (Digital Video Broadcasting - Multimedia Home Platform) system.
I feel that language tags may potentially be very useful for broadcasts of
multimedia packages which include Unicode text files, by direct broadcast
satellites across whole continents.  Someone on this list, I forget who, but
I am grateful for the comment, mentioned that even if formal deprecation
goes ahead then that does not stop the language tags being used as once an
item is in Unicode it is always there.  So fine, though it would be nice if
the Unicode Specification did allow for such possibilities within its
wording.  The wording stated by Asmus Freytag pleases me, as it seems a
good, well-rounded balance between avoiding causing people who make many
widely used packages needing to include software to process language tags,
whilst still formally recognizing the opportunity for language tags to be
used to advantage in appropriate special circumstances.  I feel that that is
a magnificent compromise wording which will hopefully be widely applauded.

In using Unicode on the DVB-MHP platform I am thinking of using Unicode
characters in a file and the file being processed by a Java program which
has been broadcast.  The file PropList.txt just does not enter into it for
this usage, so it is not a problem for me as to what is in that file.  My
thinking is that many, maybe most, multimedia packages being broadcast will
not use language tags and will have no facilities for decoding them.
However, I feel that it is important to keep open the possibility that some
such packages can use language tags provided that the programs which handle
them are appropriately programmed.  There will need to be a protocol.
Hopefully a protocol already available in general internationalization and
globalization work can be used directly.  If not, hopefully a special
Panplanet protocol can be devised specifically for DVB-MHP broadcasting.

On the matter of using Unicode on the DVB-MHP platform, readers might like
to have a look at the following about the U+FFFC character.

http://www.users.globalnet.co.uk/~ngo/ast03200.htm

Readers who are interested in uses of the Private Use Area might like to
have a look at the following.  They are particularly oriented towards the
DVB-MHP platform but do have wider applications both on the web and in
computing generally.

http://www.users.globalnet.co.uk/~ngo/ast03000.htm

http://www.users.globalnet.co.uk/~ngo/ast03100.htm

http://www.users.globalnet.co.uk/~ngo/ast03300.htm

The main index page of the webspace is as follows.

http://www.users.globalnet.co.uk/~ngo

William Overington

7 February 2003



















Re: Unicode and Encoding Problems in Browsers

2003-02-07 Thread Otto Stolz
Muhammad Asif wrote:


When i tried to type, there are certain characters that are not displayed.



Instead rectangles are displayed.



This is the typical behaviour if the font in use does not comprise
a particular character.

Either, there is no suitable font installed on your system,
or the WWW page to be displayed requires a particular, yet
unsuitable font.

Cf. http://www.alanwood.net/unicode/ for more info on fonts
and browsers.

Best wishes,
  Otto Stolz





Re: Unicode and Encoding Problems in Browsers

2003-02-07 Thread Shlomi Tal
I'd like to mention that this problem which Muhammad Asif brings forth is an 
extant one in my circle of work. I work as PC technician, and one complaint 
I often get in tech support calls is that the user is unable to type Hebrew 
in the Search box in the MSN Israel website (msn.co.il) under Windows XP. At 
the first time, I told the user to set the Language for Non-Unicode 
Programs (known as the System Locale in Windows 2000, which sets the 
emulated ANSI codepage), but it didn't help: the user still complained of 
seeing boxes instead of proper Hebrew letters. The encoding of MSN.co.il is 
Hebrew (Windows).

It doesn't happen under all machines. Mine at home runs XP too, but I don't 
have that problem. I suspect it's not related to Unicode/encodings stuff at 
all. The fact that it appears only under XP (and not 2000 or 98, for 
instance) leads me to believe it may have something to do with the Java VM 
(which is by default lacking in XP and updates browser components when 
installed).

I hope that is of some enlightenment.

ST

_
MSN 8 helps eliminate e-mail viruses. Get 2 months FREE*. 
http://join.msn.com/?page=features/virus




Handwritten EURO sign

2003-02-07 Thread Marion Gunn
I wonder if any Unicoders have seen the handwritten EURO sign which differs
substantially from the usual computer-generated kind?

The one on the banknotes (lefthanded Crescent Moon with double bar) is
quite unlike one used around here (rounded reversed digit THREE with double
bar).

Any ideas? Reminded to post this by seeing the latter in very large
handwriting yesterday, at the pay-in desk to popular football grounds close
to where I live.
mg



--
Marion Gunn * EGT (Estab.1991) * http://www.egt.ie *
fiosruithe/enquiries: [EMAIL PROTECTED] * [EMAIL PROTECTED] *






Re: Handwritten EURO sign

2003-02-07 Thread Michael Everson
At 14:42 + 2003-02-07, Marion Gunn wrote:

I wonder if any Unicoders have seen the handwritten EURO sign which differs
substantially from the usual computer-generated kind?


I have seen a C with an equals sign inside it not touching the C.


The one on the banknotes (lefthanded Crescent Moon with double bar) is
quite unlike one used around here (rounded reversed digit THREE with double
bar).


That sounds like a script capital E with a double bar. You may find 
the discussion at 
http://www.evertype.com/standards/euro/euroglyph.html of interest.
--
Michael Everson * * Everson Typography *  * http://www.evertype.com



RE: Handwritten EURO sign

2003-02-07 Thread Marco Cimarosti
Marion Gunn wrote:
 I wonder if any Unicoders have seen the handwritten EURO sign 
 which differs substantially from the usual computer-generated
 kind?

In Italy, it is becoming common to see a sort of left parenthesis crossed by
a small Z.

Notice that this is very similar to a common handwritten forms of the lira
symbol (£): a vertical line crossed by a small Z.

_ Marco




Re: Handwritten EURO sign

2003-02-07 Thread Eric Muller
The latest issue of Baseline (www.baselinemagazine.com) has an article 
on the Euro. I did not read it, so I don't know if it speaks of 
handwritten forms.

Sign of the times: the euro currency symbol  by Conor Mangat.

Eric.





Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: IndicDevanagari Query))

2003-02-07 Thread Markus Scherer
William Overington wrote:

Kenneth Whistler now states an opinion as to what the review is about and
mentions a file PropList.txt of which I was previously unaware.


Kenneth Whistler referred to a file that is part of the publicicly and freely provided Unicode 
Character Database, showing various important code point property assignments. I believe that he 
assumed that reviewers of public issues would be familiar with, or else familiarize themselves 
sufficiently with, the Unicode Standard to arrive at an informed opinion.

...
As I have mentioned before I have a particular interest in the use of
Unicode in relation to the implementation of my telesoftware invention using
the DVB-MHP (Digital Video Broadcasting - Multimedia Home Platform) system.
I feel that language tags may potentially be very useful for broadcasts of
multimedia packages which include Unicode text files, by direct broadcast
satellites across whole continents. ...


Language tags are likely to be useful for any multimedia system. I would expect there to be 
sufficient bandwidth in a DVB system to carry either out-of-band (meta) data for plain text, or else 
to send richtext of some kind (e.g., XHTML).

markus

--
Opinions expressed here may not reflect my company's positions unless otherwise noted.




Re: CJK test data

2003-02-07 Thread Markus Scherer
Michael (michka) Kaplan wrote:

GB18030 does not define a specific standard for sorting (as far as I know, neither does GB13000). It
is an encoding standard.


GB 18030 certainly does not define sorting. It defines a CCS/CES based on a mapping table to/from 
Unicode/ISO 10646.

GB 13000 is, as far as I know, just the Chinese adoption of ISO 10646. As such, it is likely to also 
not define sorting because the relevant ISO standard is 14651 (=UCA).

For general test data for determining support of GB 18030 I suggest to contact the Chinese 
government and its standards agency. They have defined a certification procedure, and I assume that 
the data and procedure are available. I have no direct contacts for this myself.

markus

--
Opinions expressed here may not reflect my company's positions unless otherwise noted.




RE: CJK test data

2003-02-07 Thread Erik.Ostermueller
Markus wrote:
   For general test data for determining support of GB 
   18030 I suggest to contact the Chinese 
   government and its standards agency. They have defined 
   a certification procedure, and I assume that 
   the data and procedure are available. I have no direct 
   contacts for this myself.

Here is contact info from an 18030 article by Tom Emerson.
http://lisa.org/archive_domain/newsletters/2002/2.3/emerson.html
Hmmm.  No url.  No email address.  This will be interesting.

Standard Conformity Testing Center for Information Products 
#1 Andingmen Dong Da Jie 
Beijing, China 
Tel: 84029573 or 84029792 
Fax: 64007681




Re: VS vs. P14 (was Re: Indic Devanagari Query)

2003-02-07 Thread Asmus Freytag
At 01:52 AM 2/7/03 -0800, Andrew C. West wrote:

 Ah, but decorative motifs are not plain text.

Ah, but it could be.


Ah, but it wouldn't be Unicode.

A(h)./