On Mon, 28 Mar 2016 09:58:23 -0400
Joe Quinn wrote:

> On 3/28/2016 9:55 AM, RW wrote:

> >    Subject =~ /\$\b/  
> There's no word boundary between the $ and the ' ' because they're
> both in \W.

Thanks, I'd forgotten what the definition of a boundary was.


I presume that, until spamassassin gets full unicode support,
non-ascii characters are seen as one or more \W characters.
So:

   "  Ångström  "

would have boundaries at the points marked by "|"

  " Å|ngstr|ö|m| "

split into several words and without a boundary before the Å.

Reply via email to