Re: UCA and Russian letter Ё

Leo Broukhis Sun, 30 Dec 2012 18:00:08 -0800

On Wed, Dec 26, 2012 at 11:18 AM, Whistler, Ken <[email protected]> wrote:
> Leo asked:
>
>> My question was narrower: assuming that the strings being compared are
>> words, could it be supported without any markup?
>
> ... where "it" refers to conditional weighting based on the (identified) word 
> boundary. And the answer to that is no, unless the word boundary was 
> explicitly indicated with some kind of a markup character, and then the 
> sequence of that markup character plus the target character of interest (in 
> this case Russian Yo) was given a tailored contraction in the weight table 
> which weighted it differently from any Russian Yo not in that particular 
> contraction sequence.


I see your point: if something can be trivially emulated with a markup
character, there is no need to augment the algorithm (emulating
backward accents with markup is possible but much more cumbersome).

What characters should be used for such markup, if need be?

Thanks,
Leo

Re: UCA and Russian letter Ё

Reply via email to