Re: Web Form: Old Russian charcaters
Please, have a look at the characters in the range 0460 - 0486 in the font Arial Unicode MS shipped with Office XP. May be those are the characters you are looking for. Can anyone on the Unicode list help? Thanks, Magda -Original Message- Date/Time:Thu Jan 16 13:11:06 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I am looking for a way to use old Russian charcaters that are no longer used in modern Russian langauage. Can you help or provide a direction in which to look?
Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: Indic Devanagari Query))
At 11:54 AM 2/6/03 -0800, Kenneth Whistler wrote: My personal opinion? The whole debate about deprecation of language tag characters is a frivolous distraction from other technical matters of greater import, and things would be just fine with the current state of the documentation. But, if formal deprecation by the UTC is what it would take to get people to stop advocating more use of the language tags after the UTC has long determined that their use is strongly discouraged, then so be it. My personal opinion is that labelling them as restricted for use with protocols requiring their use is sufficient and proper. In the context of such protocols, the use of tag characters is a fine mechanism. They certainly have some advantages over ASCII-style markup (e.g. lang=...) in many situations. Where they don't have a place is in regular 'plain' text streams. Formal deprecation would imply to me that ANY use is discouraged, including the use with protocols that wish to make use of them. THAT seems to be going too far in this case. Where we have deprecated format characters in the past it has been precisely in situations where we wanted to discourage the use of particular 'protocols', for example for shaping and national digit selection. A./
Unicode and Encoding Problems in Browsers
Hi, I actually want to enter some Arabic text in simple HTML text box . I set language in my Windows XP settings to Arabic. When i tried to type, there are certain characters that are not displayed. Instead rectangles are displayed. Characters are from Unicode BMP which is supposed to be supported by browsers. I am using IE 6.0 Also default encoding of browser is UTF-8, but if i change the encoding to default windows Western European it works fine and every character is got displayed. Problem in this is when form is submitted you did not get Unicode characters but their entity references in HTML. How can i make browser work to display unicode characters with UTF-8 encoding. So when form is submitted i get Unicode data to store in data base. Thanks a lot for your time. Asif
Re: VS vs. P14 (was Re: Indic Devanagari Query)
John H. Jenkins wrote: Ah, but decorative motifs are not plain text. Ah, but it could be.
Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: Indic Devanagari Query))
I feel that as the matter was put forward for Public Review then it is reasonable for someone reading of that review to respond to the review on the basis of what is stated as the issue in the Public Review item itself. Kenneth Whistler now states an opinion as to what the review is about and mentions a file PropList.txt of which I was previously unaware. Recent discussions in the later part of 2002 in this forum about the possibilities of using language tags only started as a direct result of the Unicode Consortium instituting the Public Review. The recent statement by Asmus Freytag seems fine to me. Certainly I might be inclined to add in a little so as to produce Plane 14 tags are reserved for use with particular protocols requiring, or providing facilities for, their use so that the possibility of using them to add facilities rather than simply using them when obligated to do so is included, but that is not a great issue: what Asmus wrote is fine. Public Review is, in my opinion, a valuable innovation. Two issues have so far been resolved using the Public Review process. Those results do seem to indicate the value of seeking opinions by Public Review. As I have mentioned before I have a particular interest in the use of Unicode in relation to the implementation of my telesoftware invention using the DVB-MHP (Digital Video Broadcasting - Multimedia Home Platform) system. I feel that language tags may potentially be very useful for broadcasts of multimedia packages which include Unicode text files, by direct broadcast satellites across whole continents. Someone on this list, I forget who, but I am grateful for the comment, mentioned that even if formal deprecation goes ahead then that does not stop the language tags being used as once an item is in Unicode it is always there. So fine, though it would be nice if the Unicode Specification did allow for such possibilities within its wording. The wording stated by Asmus Freytag pleases me, as it seems a good, well-rounded balance between avoiding causing people who make many widely used packages needing to include software to process language tags, whilst still formally recognizing the opportunity for language tags to be used to advantage in appropriate special circumstances. I feel that that is a magnificent compromise wording which will hopefully be widely applauded. In using Unicode on the DVB-MHP platform I am thinking of using Unicode characters in a file and the file being processed by a Java program which has been broadcast. The file PropList.txt just does not enter into it for this usage, so it is not a problem for me as to what is in that file. My thinking is that many, maybe most, multimedia packages being broadcast will not use language tags and will have no facilities for decoding them. However, I feel that it is important to keep open the possibility that some such packages can use language tags provided that the programs which handle them are appropriately programmed. There will need to be a protocol. Hopefully a protocol already available in general internationalization and globalization work can be used directly. If not, hopefully a special Panplanet protocol can be devised specifically for DVB-MHP broadcasting. On the matter of using Unicode on the DVB-MHP platform, readers might like to have a look at the following about the U+FFFC character. http://www.users.globalnet.co.uk/~ngo/ast03200.htm Readers who are interested in uses of the Private Use Area might like to have a look at the following. They are particularly oriented towards the DVB-MHP platform but do have wider applications both on the web and in computing generally. http://www.users.globalnet.co.uk/~ngo/ast03000.htm http://www.users.globalnet.co.uk/~ngo/ast03100.htm http://www.users.globalnet.co.uk/~ngo/ast03300.htm The main index page of the webspace is as follows. http://www.users.globalnet.co.uk/~ngo William Overington 7 February 2003
Re: Unicode and Encoding Problems in Browsers
Muhammad Asif wrote: When i tried to type, there are certain characters that are not displayed. Instead rectangles are displayed. This is the typical behaviour if the font in use does not comprise a particular character. Either, there is no suitable font installed on your system, or the WWW page to be displayed requires a particular, yet unsuitable font. Cf. http://www.alanwood.net/unicode/ for more info on fonts and browsers. Best wishes, Otto Stolz
Re: Unicode and Encoding Problems in Browsers
I'd like to mention that this problem which Muhammad Asif brings forth is an extant one in my circle of work. I work as PC technician, and one complaint I often get in tech support calls is that the user is unable to type Hebrew in the Search box in the MSN Israel website (msn.co.il) under Windows XP. At the first time, I told the user to set the Language for Non-Unicode Programs (known as the System Locale in Windows 2000, which sets the emulated ANSI codepage), but it didn't help: the user still complained of seeing boxes instead of proper Hebrew letters. The encoding of MSN.co.il is Hebrew (Windows). It doesn't happen under all machines. Mine at home runs XP too, but I don't have that problem. I suspect it's not related to Unicode/encodings stuff at all. The fact that it appears only under XP (and not 2000 or 98, for instance) leads me to believe it may have something to do with the Java VM (which is by default lacking in XP and updates browser components when installed). I hope that is of some enlightenment. ST _ MSN 8 helps eliminate e-mail viruses. Get 2 months FREE*. http://join.msn.com/?page=features/virus
Handwritten EURO sign
I wonder if any Unicoders have seen the handwritten EURO sign which differs substantially from the usual computer-generated kind? The one on the banknotes (lefthanded Crescent Moon with double bar) is quite unlike one used around here (rounded reversed digit THREE with double bar). Any ideas? Reminded to post this by seeing the latter in very large handwriting yesterday, at the pay-in desk to popular football grounds close to where I live. mg -- Marion Gunn * EGT (Estab.1991) * http://www.egt.ie * fiosruithe/enquiries: [EMAIL PROTECTED] * [EMAIL PROTECTED] *
Re: Handwritten EURO sign
At 14:42 + 2003-02-07, Marion Gunn wrote: I wonder if any Unicoders have seen the handwritten EURO sign which differs substantially from the usual computer-generated kind? I have seen a C with an equals sign inside it not touching the C. The one on the banknotes (lefthanded Crescent Moon with double bar) is quite unlike one used around here (rounded reversed digit THREE with double bar). That sounds like a script capital E with a double bar. You may find the discussion at http://www.evertype.com/standards/euro/euroglyph.html of interest. -- Michael Everson * * Everson Typography * * http://www.evertype.com
RE: Handwritten EURO sign
Marion Gunn wrote: I wonder if any Unicoders have seen the handwritten EURO sign which differs substantially from the usual computer-generated kind? In Italy, it is becoming common to see a sort of left parenthesis crossed by a small Z. Notice that this is very similar to a common handwritten forms of the lira symbol (£): a vertical line crossed by a small Z. _ Marco
Re: Handwritten EURO sign
The latest issue of Baseline (www.baselinemagazine.com) has an article on the Euro. I did not read it, so I don't know if it speaks of handwritten forms. Sign of the times: the euro currency symbol by Conor Mangat. Eric.
Re: Plane 14 Tag Deprecation Issue (was Re: VS vs. P14 (was Re: IndicDevanagari Query))
William Overington wrote: Kenneth Whistler now states an opinion as to what the review is about and mentions a file PropList.txt of which I was previously unaware. Kenneth Whistler referred to a file that is part of the publicicly and freely provided Unicode Character Database, showing various important code point property assignments. I believe that he assumed that reviewers of public issues would be familiar with, or else familiarize themselves sufficiently with, the Unicode Standard to arrive at an informed opinion. ... As I have mentioned before I have a particular interest in the use of Unicode in relation to the implementation of my telesoftware invention using the DVB-MHP (Digital Video Broadcasting - Multimedia Home Platform) system. I feel that language tags may potentially be very useful for broadcasts of multimedia packages which include Unicode text files, by direct broadcast satellites across whole continents. ... Language tags are likely to be useful for any multimedia system. I would expect there to be sufficient bandwidth in a DVB system to carry either out-of-band (meta) data for plain text, or else to send richtext of some kind (e.g., XHTML). markus -- Opinions expressed here may not reflect my company's positions unless otherwise noted.
Re: CJK test data
Michael (michka) Kaplan wrote: GB18030 does not define a specific standard for sorting (as far as I know, neither does GB13000). It is an encoding standard. GB 18030 certainly does not define sorting. It defines a CCS/CES based on a mapping table to/from Unicode/ISO 10646. GB 13000 is, as far as I know, just the Chinese adoption of ISO 10646. As such, it is likely to also not define sorting because the relevant ISO standard is 14651 (=UCA). For general test data for determining support of GB 18030 I suggest to contact the Chinese government and its standards agency. They have defined a certification procedure, and I assume that the data and procedure are available. I have no direct contacts for this myself. markus -- Opinions expressed here may not reflect my company's positions unless otherwise noted.
RE: CJK test data
Markus wrote: For general test data for determining support of GB 18030 I suggest to contact the Chinese government and its standards agency. They have defined a certification procedure, and I assume that the data and procedure are available. I have no direct contacts for this myself. Here is contact info from an 18030 article by Tom Emerson. http://lisa.org/archive_domain/newsletters/2002/2.3/emerson.html Hmmm. No url. No email address. This will be interesting. Standard Conformity Testing Center for Information Products #1 Andingmen Dong Da Jie Beijing, China Tel: 84029573 or 84029792 Fax: 64007681
Re: VS vs. P14 (was Re: Indic Devanagari Query)
At 01:52 AM 2/7/03 -0800, Andrew C. West wrote: Ah, but decorative motifs are not plain text. Ah, but it could be. Ah, but it wouldn't be Unicode. A(h)./