Re: OED

2002-05-02 Thread Patrick T. Rourke

> >>  I wrote to Oxford today to complain that there should be a
> >>  Macintosh version. Please do likewise, if you're a Mac user
> >>  and think that we deserve better!
> >
>Already did so, in fact, just hours after I posted, I got a response (via
>snail mail) telling me they had no intention of doing so, as Mac users are
a
>"niche market."  (Of course, the "niche" happens to be a much bigger subset
>of their core market than of e.g. the business market, but what can we
say?)

Ok, looks like the phrase was "have only a minority share," a little less
inflammatory than "niche market," but the "only" was suggestive.  Apologies
for misleading anyone with the quotes; I must have conflated it with
something else.

Patrick Rourke
[EMAIL PROTECTED]








Greek Extended Characters

2002-05-01 Thread Patrick T. Rourke

> Internet Explorer 5.5, running under Windows 95 --
> a non-Unicode system except for the UniScribe support
> provided by IE -- can display not only Latin Y with grave
> and with acute but also Greek Upsilon with varia and
> with oxia.

Yes, but . . . they don't look very good as combining characters, at least
with Arial Unicode MS (and of course there are problems which the MS people
on this list know about and I'll bet are working to fix which prevent the
use of combining characters in Palatino Linotype altogether).  Somewhere I
have a whole table of what looks good and bad in each operating system (just
for polytonic Greek), with several combinations of fonts and applications,
and from what I remember, using certain fonts and certain applications, OS
X, XFree86 4 + *nix, and various Windows versions can all *display*
combining characters, with varying degrees of sophistication.  But they
don't look terribly good except in certain fonts and in certain apps.  "Not
terribly good" should be read as "often effectively unreadable" - if an
acute accent *overlaps* a smooth aspirate (they should not quite touch, and
the accent should be to the right of the aspirate) for instance, it is
sometimes hard to distinguish that combination from the combination of a
grave accent with a rough aspirate (that are wrongly overlapping).
Obviously not the fault of the standard, and I'm sure David understands
that, too (DP, feel free to take me to task on anything you disagree with),
but the fault of implementers, many of whom simply don't realize that there
are substantial communities of users in the West who need good Unicode
support as much as users in the East do.

> However, it might be worthwhile to submit the gist of this
> letter to the UTC with a request to document that the 'missing'
> combinations are expected to occur, and to alert font vendors
> intending to support classical Greek to make sure that their fonts
> supply these glyphs.

If no one objects, I'll forward this discussion en mass to another list for
classicists (at the Stoa Consortium) that might help with this, and has only
a few overlaps with the subscription to this list.  All the combinations
that might be expected to occur would be hard to document, because there are
epigraphic, paleolographical, and papyrus texts with very strange
combinations.  But they aren't all obscure: one "missing" combination, at
least, is in Homer (I don't remember if the original poster mentioned that
one or not).

Please object before 5:15 pm EDST 1 May 2002 if you don't want your comments
forwarded.

Thanks,

Patrick Rourke
[EMAIL PROTECTED]








Re: OED

2002-05-01 Thread Patrick T. Rourke

>I for one will not buy the OED on CD until it's in a format I can use at
>home on my Mac (OS X) - what I do at home is best done in OS X (though I
>wish more applications had extended Greek support), what I do at the is
>office best done in Windows XP.

> I wrote to Oxford today to complain that there should be a
> Macintosh version. Please do likewise, if you're a Mac user
> and think that we deserve better!

Already did so, in fact, just hours after I posted, I got a response (via
snail mail) telling me they had no intention of doing so, as Mac users are a
"niche market."  (Of course, the "niche" happens to be a much bigger subset
of their core market than of e.g. the business market, but what can we say?)

I had tried to convince them to at least look into some kind of Unix
interface, since that at least would satisfy OS X users as well as Linux
users; but to no avail.  If you're interested (or anyone else using a Mac is
interested), I could retype the letter into an email and send to you
offlist.

Patrick Rourke
[EMAIL PROTECTED]






Re: Private Use Agreements and Unapproved Characters

2002-03-13 Thread Patrick T. Rourke

Just to complete my thanks (now that I've received the digest), thanks 
too to Michael Everson for his comments, and John Hudson for the 
typographer's viewpoint on this suggestion.

On the other subject that has been zipping about under this heading:

I asked about ConScript only because it was the only existing registry 
of the type which I could find, and it seems like a good idea for a 
largish (hundreds or thousands) community of users.

Thanks again,
Patrick Rourke





Re: Private Use Agreements and Unapproved Characters

2002-03-13 Thread Patrick T. Rourke

Thanks to everyone who has commented, especially John Cowan, Doug Ewell, 
and David Starner (I'm on the digest, and so apologize if I haven't 
thanked someone who has provided substantial comments). Thanks too to 
Mr. Overington, though with Mr. Kaplan I agree that this is a bit too 
much work to avoid the minor issue of overlapping PUA uses for my 
purposes; I was hoping merely to find an existing registry which might 
have some overlap with the user community I'm concerned with. I'm 
replying mainly to Mr. Ewell's comments, which are the kind of 
counter-arguments I was hoping to be able to consider.

Sorry to be coy, but since I'm writing a proposal (not a Unicode 
proposal) to the authors of a couple of Unicode proposals for such a 
registry, and since the proposals which would be included in the 
registry are ones I did not have any hand in writing, I think it would 
be better for me to avoid too much precision until I've got the approval 
of the proposal writers (who would also be among the most important of 
my targeted users).

> There's no reason it has to be that way.  Proposed glyphs are posted on
> the Unicode Web site months in advance of their "go live" date, even
> before the beta period, largely for this reason.  I'm sure Unicode-aware
> type designers like John Hudson don't wait until a version of Unicode is
> formally released before they start designing glyphs.

True, but many scholarly communities are small enough that their needs 
might not be of interest to type designers with a wider targeted 
audience (like Mr. Hudson), and so depend largely upon small 
typographers, even amateurs to provide their type.  In such cases, it 
would seem to me that a registry such as the one I'm suggesting would 
help to drive the transition.  At any rate, I've already had two type 
designers who've done type for the community show interest in such a 
registry.

> One important point to remember is that any use or proposed use of the
> PUA, such as ConScript, is strictly up to private organizations, not the
> Unicode Consortium.  To be sure, ConScript is the domain of two guys who
> are quite influential in Unicode, but they do not maintain ConScript in
> any official capacity as representatives of Unicode.

Fully aware of this.  I'm thinking that this would be an improvement 
over the status quo, which is as David Starner suggested, the use of 
informal private encodings or escaped entities.

> I would think you could simply use the version number of the Unicode
> Standard.  For example, the use of Tagalog would have been conformant to
> this proposed PUA registry until Unicode version 3.2, at which time it
> would have to be removed from the registry because of its introduction
> into Unicode.

This had not occurred to me!  The only thing that would militate against 
this would be if additional characters were identified which had not yet 
been proposed and were proposed at a later date; that would require a 
new version number which would not be a Unicode point number, and so 
might be distinguished using a letter, etc.  (I don't foresee this 
happening, but it's better to be safe than sorry, no?).

> Conformance to this registry, especially over a period of time, is up to
> the user community.  The presence of a standard is no guarantee that it
> will be followed, or even noticed.

Excellent, this is the problem I was most concerned with.  The target 
users for the registry would be a small number of electronic scholarly 
publishers in the community.  The license for the fonts would strongly 
recommend that content providers using registry-based fonts would have 
to convert their character data to the Unicode-approved codepoints 
within say six months of release, and for the target publishers this 
wouldn't be a problem.  If the distribution sites for the released fonts 
all included prominent links to the registry site, and the registry site 
provided information on the progress of the characters in the encoding 
process, this would I hope drive the adoption of later versions.

So those outside the target user group would at least be made aware of 
the process by the license, and a mechanism would be in place to prevent 
the dead hand of the older versions of the registry from being quite so 
strong.

> Suppose Old Persian Cuneiform is encoded in Patrick's PUA registry next
> week, and that encoding achieves some popularity.  Then suppose at some
> later date it is encoded in Unicode, say version 4.1.  This will
> necessarily cause the encoding in Patrick's registry to be withdrawn, or
> at least deprecated.

I was thinking deprecated for two versions or two years, whichever was 
longer, and then ultimately withdrawn.

> How many people will switch immediately to the
> sanctioned Unicode encoding?  How quickly will existing software and
> data be converted?  Probably not right away, and the chances for a
> timely conversion are less if the private-use encoding is particularly
> successful, whether

Polytonic Greek

2001-04-03 Thread Patrick T. Rourke

>   2. Keyboards.  Are there any Unicode keyboards/
>  input methods for polytonic Greek?

I don't know about Mac.  But for Windows, Ralph Hancock's Antioch provides a
Word 2000 keyboard for Windows, and Keyman and Multikey can provide Windows
keyboards with Extended Greek capabilities (I think I have one for Multikey,
and another listmember has been working on one for Keyman); also, I've used
the SC UniPad editor (with a quick and dirty keyboard map I started and
which is being further developed by James Naughton, intended to be
semi-compatible with the betacode representation of polytonic Greek) with
some success.  I'll troll around some more with questions about Mac.

Your other question I've answered offlist with a link to a draft document
I've been working on, which is not ready for prime time.  There are several
easy to get TrueType fonts with such support, including Vusillus Old Face
Italic (Ralph Hancock), Cardo (David J. Perry), Athena Unicode (Jeffrey
Rusten), Code2000 (and I imagine Code2001 - I've been out of town for two
weeks and haven't seen James Kass's new font mentioned in previous threads),
Titus Cyberbit Basic (the Titus project only provides an exe-zipped version
however - if I understood a correspondent correctly, this is because
"Macintosh does not support Unicode." Well, there's a semantic issue there,
which the list was kind enough to clear up for me), and two Windows-only TTF
fonts (Windows only not technologically, but by license, or so I've assumed,
because they are bundled with Windows products) Palatino Linotype (comes
with Windows 2000) and Arial Unicode MS (comes with Office 2000).   All but
the Windows fonts are either freeware or shareware; there's a paidware
version of the Vusillus font that includes roman style for about US$50 (part
of the already mentioned Antioch package), and the price of Code2000 is very
reasonable.  The best *looking* fonts, though, are probably Minion Pro
(which is definitely paidware, at the low 3 figures in US$, I believe) and
the two Windows fonts (I happen to like the Palatino Linotype font, though I
don't quite understand why the combining diacriticals aren't working in
IE5 - rather than zero space characters, one gets an artifact character, and
a particularly obnoxious one, too).

All the shareware/freeware fonts I've mentioned I've tested myself under
their evaluation terms, both on Windows 9x (98, SE, and ME) and RedHat 7
(with XFree86 4.0 and Mozilla M17), and most also with RedHat 6.2.  (I've
uninstalled the shareware fonts.) Minion Pro I have not tested at all; the
two Windows fonts have been tested according to their licenses.

I'd be interested to hear just how much luck you have with extended Greek in
Acrobat, though.  I haven't experimented with it that I remember (I don't
like PDF except for print applications).  Just remember to stick with one of
the normalization forms (I use NFC, but then I do mostly web documents).

Patrick Rourke
(I'm on digest, so I apologize if someone else has answered this already)
[EMAIL PROTECTED]






Re: Square and lozenge notes -- Musical Notation 3.1 -- Mensural notation

2001-03-06 Thread Patrick T. Rourke

Quick tangential correction to that table that Patrick Andries supplied a
link to: it seems to imply that the Greek accents were musical notation;
they were not.  For ancient Greek musical notation see M.L. West, *Ancient
Greek Music*, pp. 254-276, especially the table on p. 256.

Patrick Rourke

- Original Message -
From: Patrick Andries




Sources : Dictionnaire de musique, Larousse & Encyclopaedia Universalis
(scanned copy can be sent)
http://www.nmc.vt.edu/staff/Ed/music/glossary/appendix/notation/Noteshapes.h
tml (only provided for the neat table I'm not able to reproduced)




Re: Javascript Chart

2001-03-06 Thread Patrick T. Rourke

In Windows ME, works in IE5.5 but not in Netscape 4.7 or Mozilla 2001021204.
So I doubt it would work in say Linux (I haven't tested it, but might be
able to later on).

In IE5.5/ME, it works for me not only with different fontName values, but
even with a list (e.g.,

var fontName="Arial Unicode MS,Lucida Sans Unicode"

).  Of course, the list simply uses the first font that's available,
regardless of whether the font contains any number of the characters being
mapped (so if one has both fonts, and puts Lucida Sans Unicode first, code
points that LSU doesn't provide glyphs for are displayed as artifact
characters (mostly boxes)).

Good job.  Next time, though, I'd suggest using CSS rather than the
deprecated  element.

Patrick Rourke
[EMAIL PROTECTED]


> I wrote a small HTML document that implements a quick-and-dirty chart for
> Unicode, and I thought that someone on the Unicode list could find it
> useful.
>
> The whole 17 planes may be reached, and you can see the three standard
> encoding forms (not the schemes!) of each character.
>
> The document is formed by two HTML files and can be used off-line. Of
> course, whether your browser shows the correct glyphs or just boxes
depends
> on the availability of fonts and Unicode support in your browser.
>
> I only tested it with Internet Explorer 4.0 on Windows NT (sp6), and I
have
> font Arial Unicode MS installed.
>
> I don't absolutely know (nor give warranties about) whether and how it
works
> on a different environment.
>
> If anyone is adventurous enough to wish to try it, find the two files
> attached. You should put both in the same directory, and open

> in your browser.
>
> Hint: red text is clickable and has some effect; black text is static.
>
> If you wish to read the sources, both files start with an explanatory
> comment, but the rest is totally uncommented (yet).
>
> Have fun (I hope).
> Marco
>
>
>
>
>




Re: Help with Greek special casing

2001-02-25 Thread Patrick T. Rourke

Don't know what the Unicode rules are, but the answer is no.  The final
sigma form is not used if the sigma is in a medial position in the word but
at the end of the line (e.g., when it occurs at the point of hyphenation in
a hyphenated word at line end).  Also, there is no reason why a consonant
other than rho should be followed by a combining diacritical mark, except
say an underdot for use in papyrological or epigraphical texts.

The upper case sigma is the same regardless of position; there is no
differentiation between upper case final sigma and upper case initial/medial
sigma.

If a font uses the lunate sigma for the initial/medial form, it must use it
for the final form as well, and vice versa.

Hope that helps,
Patrick Rourke
[EMAIL PROTECTED]

- Original Message -
From: "Carl W. Brown" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Sunday, February 25, 2001 10:30 PM
Subject: Help with Greek special casing


> From UTR 21.
>
> "If a mapping is marked by FINAL, use it when the character is not
followed
> by a cased character.
>
> It is final when followed by a hyphen or combining diacritical mark?  Can
> you have a final sigma in the middle of a word?
>
> Carl
>
>




Re: Inverted breve in Greek?

2001-02-22 Thread Patrick T. Rourke

In fact, this is the (barely) preferred form of the circumflex, or
perispomeni, for ancient Greek.  Of four of the five major publishers of
ancient Greek texts (I don't have any Budes available to check), Cambridge
UP and Teubner use the tilde-like form in their type; Oxford UP and Harvard
UP use the rounded form.  Of other publishers whose books I have readily
available, Macmillan, Oklahoma, American Book Company, and Aris and Phillips
use the rounded form, as do a couple of older Cambridge UP books; a book
from a prewar Berlin printer (A. Ebering) uses the tilde-like form; (the
following use Greek type in books which are primarily in Latin type)
California the rounded form, Methuen both, Manchester UP and Yale UP, the
tilde; Belgian Royal Academy, the rounded form.  For the dates: one Harvard
UP book I checked was published in 1994, and an Oxford book in 1988.  One
Cambridge book was published in 1985; the Teubners I checked stretch from
the 1880s to the 1970s.   One cannot even talk about a "continental" or an
"American" preference.

Coding as u+0342, as Mr. Pietsch recommended, allows the printer to assign
to the character whichever form it wishes, rather than coding as a tilde or
an inverted breve, which forces the issue as to presentation.

Patrick Rourke

- Original Message -
From: "Seán Ó Séaghdha" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Thursday, February 22, 2001 2:16 AM
Subject: Inverted breve in Greek?


Since there seem to be some people here who know about something about Greek
diacritics, I'm hoping someone here will be able to help me.  I know very
little about Greek, as will probably become clear.  I'm making a Unicode
version of an ASCII representation of an etymological dictionary, which
contains examples in many languages.  Mostly the sequences used to represent
non-ASCII characters are straightforward, but there's one I'm not sure
about.  It looks like an inverted breve and appears over Greek vowels,
appearing above any accents.

My question is - is this just a breve (and should I just encode it as 0311
COMBINING INVERTED BREVE) or is it an out of date version of something else
(e.g. 0342 COMBINING GREEK PERISPOMENI)?  This book was originally published
in 1896.

Examples (using inverted breve)

\u1f60\u0311\u03bc\u03cc\u03c2
ὠ̑μός

\u03c6\u03bb\u03bf\u03b9\u0311\u03c3\u03b2\u03bf\u03c2
φλοι̑σβος

\u03b2\u03bf\u03c5\u0311\u03c2
βου̑ς

\u03c6\u03b1\u0311\u03c1\u03bf\u03c2
φα̑ρος

\u03b2\u03c1\u03b9\u0311
βρι̑

\u03c6\u03c5\u0311\u03bb\u03bf\u03bd
φυ̑λον

`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`~
 S e á n   Ó   S é a g h d h a   [EMAIL PROTECTED]

Atcota brothchán bithnert. [Tugann brachán bithneart.]
Seanrá Sean-Ghaeilge.







Re: Macintosh OS8.6, OS9

2001-02-05 Thread Patrick T. Rourke

Thanks to everyone who responded, especially Mr. Hagedorn; as it is
precisely the Extended Greek, Basic Greek, and Combining diacriticals blocks
that interest me, this was very important information.

Patrick Rourke

> Netscape, Internet Explorer and Icab (another browser for the MacOs)
> use UTF8 (Internet Explorer and Icab via the TEC)
>
> In macOs 9.1 with an apllication which supports the Multilingual Text
> Engine you can even use a windows true type font like cyberbit
> without converting it to Macintosh format.





Re: Greek questions, on- and off-topic

2001-01-23 Thread Patrick T. Rourke

Here's a listing of the Unicode names (which are the modern Greek names, I
believe) for diacriticals in the Extended Greek range and the analogous
English *common* names of the Greek accents:

acute = oxia
grave = varia
circumflex = perispomeni
iota subscript = ypogegrammeni
smooth breathing = psili
rough breathing = dasia
diaresis = dialytika

"Tonos" is the Greek word for accent.  The letters with "tonos" in the basic
Greek block are called that because all accented Greek characters in modern
Greek script use the same accent - and that is the acute.

The following diacriticals are not used in typeset Greek text, but only in
dictionaries and other books where learners need to be given the length of
alpha, iota, and epsilon (omicron and epsilon are of course always short;
and omega and eta are of course always long, so one would never need the
macron or breve over the other four vowels, even in dictionary listings):

macron = macron
vrachy = breve

The basic Greek block also includes letters that are not used in Classical
Attic (Stigma, Digamma, Qoppa, Sampi, Yot), except that some are used as
numerals, and a number of characters that are only used in Coptic
(post-hieroglyphic Egyptian: Dei, Shei, Fei, Khei, Hori, Shima) and are
derived from the demotic Egyptian script.  Also do not use the "symbol"
versions of Greek letters.

Ano teleia is the semicolon (a raised dot). I imagine that the capitals with
diaresis are there for text that's in all capitals but is accented.

Note that ancient, biblical and Byzantine Greek all use the polytonic
version of the script, and modern Greek uses the monotonic (in effect, only
uses the acute accent).

I've been working for some time on an online resource for using Unicode with
ancient Greek, but it's not yet in finished form.

It is VERY important to follow Marco Cimarosti's suggestion to look at the
normalization forms chart.

Patrick Rourke
[EMAIL PROTECTED]


- Original Message -
From: "Marco Cimarosti" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Tuesday, January 23, 2001 3:10 AM
Subject: RE: Greek questions, on- and off-topic


> > My Greek textbook has acute, grave, and circumflex (called by
> > those names),
> > but I'm not sure what these correspond to in the Greek and
> > Greek Extended
> > blocks (there seem to be many more diacriticals than those).
> > Is there an on-line guide somewhere?
>
> There are in fact other diacritics used in Greek in addition to the three
> accents:
>
> - Dieresis or dialytica (also used in modern spelling)
> - Spiritus asper (romanized with an "h") and spiritus levis
> - Subscript iota (to show an unpronounced etymological "i")
> - Macron and breve (only used in grammar books and dictionaries)
> - Apostrophe (admitting it can be called a diacritic)
> - and something else that I am forgetting, probably...
>
> To know which Unicode code points should be used for these diacritics, the
> handiest thing is to look up the canonical decompositions in the
> UnicodeData.txt database, both in the basic Greek block (U+03xx) and in
the
> extended block (U+1Fxx). The canonical decomposition field is the data
just
> after the 5th semicolon on each line.
>
> _ Marco
>
>




Re: Transcriptions of "Unicode"

2001-01-15 Thread Patrick T. Rourke

He didn't actually say it: someone joked at a dinner or fundraiser that Dan
Quayle had felt guilty that he hadn't studied his Latin upon his visit to
Latin America, and the press picked it up as though it were a true report of
Quayle's own words.

What it says of the man that millions of people believed him capable of
saying it, I leave others to decide.  I suspect that he was nominated
because he reminded GHWB of someone.

Patrick Rourke
[EMAIL PROTECTED]

- Original Message -
From: "Tex Texin" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Monday, January 15, 2001 5:27 PM
Subject: Re: Transcriptions of "Unicode"


> Wasn't it Dan Quayle who said they speak Latin in Latin America?
>
>
> [EMAIL PROTECTED] wrote:
> >
> > > > > 1. When I learned Latin in the U.S. in the 1960s, we were taught a
> > > > > reconstructed Roman pronunciation.
> > >
> > > Latin is still spoken in Rome, at the Vatican.
> > >
> > > So there is a Roman pronunciation even today... (;
> > >
> > > Just kidding... although what I say is true...
> > >
> > > Alain
> >
> > How about a weekly radio news broadcast in Latin?
> >
> > http://www.yle.fi/ylenykko/nuntii.html (in Finnish :-)
> > http://www.yle.fi/fbc/latini/ (in Latin)
> > http://www.yle.fi/fbc/latini/summary.html (in English)
>
> --
> According to Murphy, nothing goes according to Hoyle.
> --
> Tex Texin  Director, International Business
> mailto:[EMAIL PROTECTED]  +1-781-280-4271 Fax:+1-781-280-4655
> Progress Software Corp.14 Oak Park, Bedford, MA 01730
>
> http://www.Progress.com#1 Embedded Database
>
> Globalization Program
> http://www.Progress.com/partners/globalization.htm
> --
-
>