-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 7/25/12 7:13 PM, Tim Hatch wrote: > Hi Pygments users, > > There's been a bug in Pygments for a while now, with the way it > handled non-BMP[1] characters in the XQuery lexer. I don't think > many people used this lexer, and there were no testcases for this > particular issue, so I don't think anybody noticed. ... > I'd like to fix it properly before the next release goes out. > Attached is a patch I'd like to propose, which adds a new function > 'unirange' which will construct the appropriate regex to match a > non-BMP range against the internal representation of a string.
This code, slightly modified, is now in http://code.timhatch.com/hg/pygments-tim The tests still pass for me; please verify for you, and object loudly if you see any problems. I also pushed the escaping for unistring into the generation, since that seemed more natural now that it has to be there for the surrogate changes. Tim -----BEGIN PGP SIGNATURE----- Version: GnuPG/MacGPG2 v2.0.17 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAlA7KL0ACgkQgwVGtvGz4EeUBACeOA0M8lWxSH+YuRdiP6bnNCsB 9mkAoI+nafa513wkDNNJQp8l9vKj1Ry+ =VENX -----END PGP SIGNATURE----- -- You received this message because you are subscribed to the Google Groups "pocoo-libs" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/pocoo-libs?hl=en.
