https://bugzilla.wikimedia.org/show_bug.cgi?id=50936
--- Comment #5 from Subhashish Panigrahi <[email protected]> --- (In reply to comment #4) > (In reply to comment #2) > > http://or.wikipedia.org/s/5om is for ସମୟ and has unicode code points: > > U+0B38 U+0B2E U+0B5F > > > > http://or.wikipedia.org/s/gj1 is ସମୟ > > U+0B38 U+0B2E U+200C U+0B5F > > > > As you can see both titles look same but differs in data with an extra > > U+200C > > > > U+200C is ZERO WIDTH NON-JOINER an invisible character having different > > functionality in different scripts. > > Thanks. > > > > > I am not sure whether 200C has valid usage in or. If this is unwanted, you > > need > > to consider it as a spelling mistake. > > Then how can we know whether or not 200C has valid usage? I couldn't find any > 200C in Odia Unicode chart. We can ignore if this is rare, so far 3/4 cases. > We > could wait and see if we find more such cases. I guess U+200C would be required. When I type s+m+Y it resulted ସମ୍ୟ whereas s+m+_ (Shift dash "-")+ Y it resulted ସମୟ using typing tool Lekhani. In the latter case Shift - ("_") produces U+200C. Is there any other way to avoid this problem instead of blocking this as I feel for some spellings it would be needed. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
