[Zope-CMF] Fixing STX for non-ascii

Charlie Clark Wed, 13 Jun 2007 07:22:40 -0700

Hi,

since my patch to support ReST for Documents (& Newsitems - thanks toJens for this), I've gone back to looking at what was my originalproblem: STX choking with non-ascii text. From my tests thisafternoon this looks surprisingly easy to fix. This is a samplemethod from zope.structuredtext.document.py (which is used for Zope2.10 and up but looks little more than a simple reimplementation ofthe StructuredText package)


    def doc_strong(self,
                   s,

expr = re.compile('\*\*([\w%s\s]+?)\*\*' %(strongem_punc), re.UNICODE).search # works with non-ASCII# expr = re.compile(r'\*\*([%s%s%s\s]+?)\*\*' %(letters, digits, strongem_punc)).search # fails with non-ASCII#expr = re.compile(r'\s*\*\*([ \n\r%s0-9.:/;,\'\"\?\-\_\/\=\-\>\<\(\)]+)\*\*(?!\*|-)' % letters).search, # oldexpr, inconsistent punc, failed to cross newlines.

        ):

        r=expr(s)
        if r:
            start, end = r.span(1)

return (stng.StructuredTextStrong(s[start:end]),start-2, end+2)

It seems simply adding the re.UNICODE flag and using \w rather thanstring.letters + string.digits is sufficient. However, given myrelative inexperience with regexes this could simply be naïvety on mypart.

If this does indeed work does a patch need submitting for both Zope2.1x and Zope 3?


Charlie
--
Charlie Clark
Helmholtzstr. 20
Düsseldorf
D- 40215
Tel: +49-211-938-5360
GSM: +49-178-782-6226



_______________________________________________
Zope-CMF maillist  -  [email protected]
http://mail.zope.org/mailman/listinfo/zope-cmf

See http://collector.zope.org/CMF for bug reports and feature requests

[Zope-CMF] Fixing STX for non-ascii

Reply via email to