Kathey Marsden wrote:
Can anyone think of an example where two strings will be equal with TERRITORY_BASED collation but not with UCS_BASIC?


I found a case

For a TERRITORY_BASED collation french database, differences between pre-composed accents such as "\u00C0" (A-grave) and combining accents such as "A\u0300" (A, combining-grave) should match for = and like. But they do not match for UCS_BASIC. We insert both into a table and search based on equal and like.

This makes for a good case to work with for DERBY-2967 too because \u00C0 is one character and A\u0300 is two. I checked in a testcase to CollationTest

I was also wrong about the default being IDENTICAL strength. It seems to be TERTIARY.

Kathey


Reply via email to