Re: [Python-Dev] cc: "Martin v. Löwis"

2007-08-08 Thread Martin v. Löwis
>> I would likely close such a report as "works for me" (after testing >> it does - it did when I last ran it, which was before the release >> of Python 2.5). > > I think that you will find that you are using a non-standard > environment and set of Python sources. Please trust me that I didn't. S

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Martin v. Löwis
>> In the mediate term, locale-based testing will go away/be not >> implementable (in particular, Py3k won't have a byte-oriented >> character string type, so we can't use isprint). In general, >> isprint is unsuitable since it doesn't support multi-byte >> character sets. > > Well, iswprint isn't

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Nick Maclaren
=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= <[EMAIL PROTECTED]> wrote: > > >> Before discussing the escape, I'd like to see a specification of > >> it first - what characters precisely would classify as "printing"? > > > > For basic ASCII and locale-based testing, whatever isprint() says. > > Just

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Martin v. Löwis
>> Before discussing the escape, I'd like to see a specification of >> it first - what characters precisely would classify as "printing"? > > For basic ASCII and locale-based testing, whatever isprint() says. > Just as for isalpha(). In the mediate term, locale-based testing will go away/be not i

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Mike Klaas
In 8-Aug-07, at 12:47 PM, Nick Maclaren wrote: > >>> The other approach, which is to stick to true regular expressions, >>> and wholly or partially convert to DFAs, has already been rendered >>> impossible by even the limited Perl/PCRE extensions that Python >>> has adopted. >> >> Impossible? Sur

[Python-Dev] Please help verify SF data dump imported into (future) new tracker

2007-08-08 Thread Brett Cannon
We are getting very close to moving over to the new tracker (hopefully by the end of the month; no firm date yet, though, as we are still planning things out)! Part of the transition is taking a data dump provided by SourceForge and loading it into our Roundup instance. But we need to make some e

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Nick Maclaren
I am not on "Python 3000", so am restricting. Mike Klaas <[EMAIL PROTECTED]> wrote: > > > I have needed to push my stack to teach REs (don't ask), and am > > taking a look at the RE code. I may be able to extend it to support > > RFE 694374 and (more importantly) atomic groups and possessive > >

[Python-Dev] cc: "Martin v. Löwis"

2007-08-08 Thread Nick Maclaren
Re: [Python-Dev] Regular expressions, Unicode etc. =?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= <[EMAIL PROTECTED]> wrote: > > I recommend you use the 4.1 version of the database; this should > work out of the box, with no change to the build environment at > all. I tried that, of course. See below

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Nick Maclaren
[ I would appreciate not getting private copies as well. ] =?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= <[EMAIL PROTECTED]> wrote: > > Before discussing the escape, I'd like to see a specification of > it first - what characters precisely would classify as "printing"? For basic ASCII and locale-bas

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Mike Klaas
On 8-Aug-07, at 2:28 AM, Nick Maclaren wrote: > I have needed to push my stack to teach REs (don't ask), and am > taking a look at the RE code. I may be able to extend it to support > RFE 694374 and (more importantly) atomic groups and possessive > quantifiers. While I regard such things as revo

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Martin v. Löwis
> Further to the above, I found the Unicode sources, have rebuilt > the files, but it involved some fairly serious hacking to the > building mechanism and I have had to disable the Unicode 3.2 > support. And, of course, that means that 4 of the tests fail. > > This area needs addressing, not leas

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Martin v. Löwis
> My second one is about Unicode. I really, but REALLY regard it as > a serious defect that there is no escape for printing characters. > Any code that checks arbitrary text is likely to need them - yes, > I know why Perl and hence PCRE doesn't have that, but let's skip > that. That is easy to ad

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Georg Brandl
Nick Maclaren schrieb: > Further to the above, I found the Unicode sources, have rebuilt > the files, but it involved some fairly serious hacking to the > building mechanism and I have had to disable the Unicode 3.2 > support. And, of course, that means that 4 of the tests fail. > > This area nee

Re: [Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Nick Maclaren
Further to the above, I found the Unicode sources, have rebuilt the files, but it involved some fairly serious hacking to the building mechanism and I have had to disable the Unicode 3.2 support. And, of course, that means that 4 of the tests fail. This area needs addressing, not least because Py

[Python-Dev] Regular expressions, Unicode etc.

2007-08-08 Thread Nick Maclaren
I have needed to push my stack to teach REs (don't ask), and am taking a look at the RE code. I may be able to extend it to support RFE 694374 and (more importantly) atomic groups and possessive quantifiers. While I regard such things as revolting beyond belief, they make a HELL of a difference t