Re: [QScintilla] Search Whole Word matches Only on regular expression symbol

Baz Walter Sat, 12 Oct 2013 15:30:20 -0700

On 12/10/13 11:52, Phil Thompson wrote:

So I need to call SCI_SETWORDCHARS when a lexer is set using the value
returned by the lexer's wordCharacters() method.


Is this likely to cause any unforeseen problems?

As usual with Scintilla, the main source of potential problems issingle-byte vs multi-byte encodings. For latin-1, any byte in the range0-255 can be set as a word character. But for utf-8, only the asciirange is relevant - all unicode characters above 127 are always treatedas word characters, regardless of what has been set using SCI_SETWORDCHARS.

However, Scintilla's default set of word characters (i.e. those set viaSCI_SETCHARSDEFAULT) includes the standard alphanumerics and underscore,*plus* all the characters in the range 128-255 (regardless of thecode-page setting).

So, assuming the current lexer wordCharacters functions only ever returnascii, there is some potential for changes in behaviour if QScintilla isbeing used in *latin-1* mode (utf-8 mode should be unaffected).

The only other potential issue I can think of at the moment, is thatsetting the word characters automatically resets the whitespace andpunctuation characters to their default values.


--
Regards
Baz Walter
_______________________________________________
QScintilla mailing list
[email protected]
http://www.riverbankcomputing.com/mailman/listinfo/qscintilla

Re: [QScintilla] Search Whole Word matches Only on regular expression symbol

Reply via email to