On Tue, Aug 14, 2012 at 1:19 PM, Chris Hostetter <[email protected]> wrote: 0D ( - ) FULLWIDTH HYPHEN-MINUS > > ...so seemingly, according to the word boundary docs, there should be an > option to treat those individual characters as "MidLetter" characters w/o > requiring the user to change them to \u2027 in a CharFilter >
I don't agree with that logic at all. Why doesnt java.text.Breakiterator have such an option then? Because people impl the default algorithm for general purposes. Those tailorings are not 'mandatory'. -- lucidworks.com --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
