Re: Dealing with Georgian capitalization in programming languages

2018-10-09 Thread Ken Whistler via Unicode
Martin, On 10/9/2018 12:47 AM, Martin J. Dürst via Unicode wrote: - Using the 'capitalize' method to (try to) get the titlecase   property of a MTAVRULI character. (There's no other way   currently in Ruby to get the titlecase property.) There may be others. If you have some ideas, I'd

Aw: Re: Dealing with Georgian capitalization in programming languages

2018-10-09 Thread Marius Spix via Unicode
The capital ẞ (U+1E9E) has been officially approved by the Council for the German Language since July 2018. However, there is no word starting with ß, that means the character is only relevant for full-capitalized words. It may only stand alone in spaced type, when there is no available italic

Re: Dealing with Georgian capitalization in programming languages

2018-10-09 Thread Martin J. Dürst via Unicode
Hello Ken, others, On 2018/10/03 06:43, Ken Whistler wrote: But it seems to me that the problem you are citing can be avoided if you simply rethink what your "capitalize" means. It really should be conceived of as first lowercasing the *entire* string, and then titlecasing the *eligible*

Re: Dealing with Georgian capitalization in programming languages

2018-10-04 Thread Martin J. Dürst via Unicode
Ken, Markus, Many thanks for your ideas, which I noted at https://bugs.ruby-lang.org/issues/14839. Regards, Martin. On 2018/10/03 06:43, Ken Whistler wrote: On 10/2/2018 12:45 AM, Martin J. Dürst via Unicode wrote: My questions here are: - Has this been considered when Georgian Mtavruli

Re: Dealing with Georgian capitalization in programming languages

2018-10-02 Thread Ken Whistler via Unicode
On 10/2/2018 12:45 AM, Martin J. Dürst via Unicode wrote: capitalize: uppercase (or title-case) the first character of the string, lowercase the rest When I say "cause problems", I mean producing mixed-case output. I originally thought that 'capitalize' would be fine. It is fine for

Re: Dealing with Georgian capitalization in programming languages

2018-10-02 Thread Philippe Verdy via Unicode
I see no easy way to convert ALL UPPERCASE text with consistant casing as there's no rule, except by using dictionnary lookups. In reality data should be input using default casing (as in dictionnary entries), independantly of their position in sentences, paragraphs or titles, and the contextual

Re: Dealing with Georgian capitalization in programming languages

2018-10-02 Thread Markus Scherer via Unicode
On Tue, Oct 2, 2018 at 12:50 AM Martin J. Dürst via Unicode < unicode@unicode.org> wrote: > ... The only > operation that can cause problems is 'capitalize'. > > When I say "cause problems", I mean producing mixed-case output. I > originally thought that 'capitalize' would be fine. It is fine for