> On 21 May 2017, at 18:41, Claude Warren <cla...@xenei.com> wrote:
> 
> Seems like you have done a lot of investigation so let meask.  Can you
> develop a mechanism that iz extensible to support the various localls and
> then just implement the simple version?  This would provide a framework so
> others could implement the ones the desire as needs arise.

Nice idea, I’ll see what’s possible.


> 
> On 21 May 2017 16:48, "Benedikt Ritter" <brit...@apache.org> wrote:
> 
>> Hi,
>> 
>>> Am 21.05.2017 um 08:06 schrieb Duncan Jones <dun...@wortharead.com>:
>>> 
>>> Hi everyone,
>>> 
>>> I’ve found some time to continue breaking WordUtils into separate
>> classes (eschewing the “big collection of static methods” approach).
>> However, as I read more about case handling in Unicode, I realise how
>> simplistic the WordUtils methods are and how complex a full solution would
>> need to be.
>>> 
>>> Section 5.18 of the Unicode specification [1] describes these
>> complexities. The mains ones that bother me are:
>>> 
>>> 1. Title case conversions vary widely between different locales and
>> languages. I’m not clear whether any locale is satisfied by the current
>> simplistic implementation in WordUtils.capitalize(str). Supporting this
>> correctly would be a serious challenge.
>>> 
>>> 2. All types of case conversion may vary depending upon context/locale.
>> There are examples provided in [1] where the outcome is different in a
>> Turkish locale or if the letter in question is followed by another or not.
>>> 
>>> Does anyone have a suggestion for how to move forward with this work? I
>> see three options: 1] Admit defeat and avoid the case conversion mess
>> entirely. 2] Mimic the existing functionality, but document the
>> limitations. 3] Attempt to deliver a locale-dependent version, perhaps
>> still with limitations (or for certain languages).
>>> 
>>> I’m leaning towards 2, perhaps even calling the classes “SimpleX…”.
>> 
>> Sounds good to me.
>> 
>>> 
>>> Thanks,
>>> Duncan
>>> 
>>> 
>>> [1] http://www.unicode.org/versions/Unicode9.0.0/ch05.pdf
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
>>> For additional commands, e-mail: dev-h...@commons.apache.org
>>> 
>> 
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
>> For additional commands, e-mail: dev-h...@commons.apache.org
>> 
>> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

Reply via email to