https://bugzilla.wikimedia.org/show_bug.cgi?id=46773

       Web browser: ---
            Bug ID: 46773
           Summary: Word boundry parameter \b not working with Unicode
                    devanagari words
           Product: MediaWiki extensions
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: AbuseFilter
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected], [email protected]
    Classification: Unclassified
   Mobile Platform: ---

At Marathi language wikipedia usually I am useing
"contains_any(added_lines,"तू")"parameter to filter  a given word,since say i
want to stop use of word "तू" .

To avoid false positives due to prefixes and suffixes to the word; we want to
use parameter \b as word boundry on either side of the word or both side of the
word as per reuirement.   

We wish we should be able to use 

*contains_any(added_lines,"तू\b"),should work, so that we do not get a false
positive on word "तूप" and many similler words.

*contains_any(added_lines,"\bतू"),should work, so that we do not get a false
positive on word "धातू" and many similler words.   

*contains_any(added_lines,"\bतू\b"),should work,so that we do not get a false
positive on word "दुकानातून" and many similler words.

The related edit(abuse) filter on Marathi language wikipedia is
http://mr.wikipedia.org/wiki/विशेष:दुरूपयोग_गाळणी/10 

For words where prefixes and suffixes are less we are using ! parameter but
this parameter is not sufficient in words where too many suffixes or prefixes
are possible.

If  parameter \b can work or any other good option for word boundry it will be
usefull to many devanagari script using wikis like Hindi and many other.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to