[issue2636] Adding a new regex module (compatible with re)

2014-11-09 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs added the comment: If I recall, I started this thread with a plan to update re itself with implementations of various features listed in the top post. If you look at the list of files uploaded by me there are seme complete patches for Re to add various features like Atomic

[issue1693050] \w not helpful for non-Roman scripts

2013-05-29 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs added the comment: Thanks Matthew and sorry to put you through more work; I just wanted to verify exactly which unicode (UTF-16 I take it) were being used to verify if the UNICODE standard expected them to be treated as unique words or single letters within a word. Sanskrit

[issue1693050] \w not helpful for non-Roman scripts

2013-05-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs added the comment: Matthew, I think that is considered a single word in Sanscrit or Thai so Python 3.x is correct. In this case you've written the Sanscrit word for Hindi. -- ___ Python tracker rep...@bugs.python.org http

[issue1693050] \w not helpful for non-Roman scripts

2013-05-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs added the comment: Maybe you could show us the byte-for-byte hex of the string you're testing so we can examine if it's really a code point intending word boundary or just a code point for the sake of beginning a new character

[issue17980] CVE-2013-2099 ssl.match_hostname() trips over crafted wildcard names

2013-05-16 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs timeho...@users.sourceforge.net: -- nosy: +timehorse ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue17980

[issue2636] Adding a new regex module (compatible with re)

2011-09-03 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: Although V1, V2 is less wordy, technically the current behavior is version 2.2.2, so logically this should be re.VERSION222 vs. re.VERSION3 vs. re.VERSIONn, with corresponding (?V222), (?V3) and future (?Vn). But that said

[issue2636] Adding a new regex module (compatible with re)

2011-09-01 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: On 1 September 2011 16:12, Matthew Barnett rep...@bugs.python.org wrote: Matthew Barnett pyt...@mrabarnett.plus.com added the comment: I think I need a show of hands. For my part, I recommend literal flags, i.e

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2010-12-23 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: +1 on VC -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2636 ___ ___ Python

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2010-07-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: What about a regex flag? Like regex.W or (?w)? -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2636

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2010-07-06 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: My only addition opinion is that re is very much used in deployed python applications and was written not just for correctness but also speed. As such, regex should be benchmarked fairly to show that it is commensurately

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2009-08-12 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: /lurk Re: timings Thanks for the info, John. First of all, I really like those tests and could you please submit a patch or other document so that we could combine them into the python test suite. The python test suite

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2009-08-12 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: Mea culpa et mes apologies, The '-s' option to John's expressions are indeed executed only once -- they are one-time setup lines. The final quoted expression is what's run multiple times. In other words, improving caching

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2009-03-10 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: Okay, as I said, Atomic Grouping, etc., off a recent 2.6 is already available and I can do any cleanups requested to those already mentioned, I just don't want to start any new items at the moment. As it is, we are still

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2009-03-10 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs timeho...@users.sourceforge.net added the comment: Thanks, Antione! Then I think for the most part any changes to Regexp will have to wait for 3.2 / 2.7. -- message_count: 71.0 - 72.0 ___ Python tracker rep...@bugs.python.org http

[issue214033] re incompatibility in sre

2008-10-13 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: The duplicate zero-or-one repeat operator bug described in this issue originally no longer exists in python 2.6. However, Trent Mick brings up a fair point in that expressions of the form (x*)? generate an error (issue 1456280) when

[issue1456280] Traceback error when compiling Regex

2008-10-13 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: This is another version of the redundant repeat issue defined in issues 2537 and 1633953 and although not described by the original report for issue 214033, the comments further down that issue also describe a similar situation

[issue1633953] re.compile((.*$){1,4}, re.MULTILINE) fails

2008-10-11 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: On first blush, this issue sounds quite similar to issue 2537, but I have been looking at different scenarios and found that there is a subtle difference because, grammatically: (?m)(?:.*$)(.*$) is the same as: (?m)(.*$){2} Yet

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-29 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Good work, Matthew. Now, another bazaar hint, IMHO, is once of my favourite commands: switch. I generally develop all in one directory, rather than getting a new directory for each branch. Once does have to be VERY careful to type bzr

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-29 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Matthew, I've traced down the patch failures in my merges and now each of the 4 versions of code on Launchpad should compile, though the first 2 do not pass all the negative look-behind tests, though your later 2 do. Any chance you could

[issue3255] [proposal] alternative for re.sub

2008-09-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Implementing Issue 3482 should solve this problem, and I will try to add it to issue 2636 so that it is captured in the general Regexp 2.7 redesign. -- nosy: +timehorse versions: +Python 2.7

[issue2650] re.escape should not escape underscore

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- versions: +Python 2.7, Python 3.1 -Python 2.6, Python 3.0 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2650

[issue2650] re.escape should not escape underscore

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2650 ___ ___ Python-bugs-list mailing

[issue1721518] Small case which hangs

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 -Python 2.4 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1721518

[issue1721518] Small case which hangs

2008-09-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Tested on 2.6rc2 and slow but successful. Issue 1662851 may be related. ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1721518

[issue1693050] \w not helpful for non-Roman scripts

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 -Python 2.4 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1693050

[issue2537] re.compile(r'((x|y+)*)*') should fail

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 -Python 2.6 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2537

[issue1633953] re.compile((.*$){1,4}, re.MULTILINE) fails

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 -Python 2.5 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1633953

[issue1282] re module needs to support bytes / memoryview well

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1282 ___ ___ Python-bugs-list mailing

[issue214033] re incompatibility in sre

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue214033 ___ ___ Python-bugs-list

[issue1708652] Exact matching

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1708652

[issue1456280] Traceback error when compiling Regex

2008-09-28 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1456280 ___ ___ Python-bugs-list

[issue3665] Support \u and \U escapes in regexes

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3665 ___ ___ Python-bugs-list mailing

[issue3482] re.split, re.sub and re.subn should support flags

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3482 ___ ___ Python-bugs-list mailing

[issue3482] re.split, re.sub and re.subn should support flags

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- versions: +Python 2.7, Python 3.1 -Python 2.6, Python 3.0 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3482

[issue3299] invalid object destruction in re.finditer()

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- versions: +Python 2.7 -Python 2.6 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3299 ___ ___ Python

[issue3665] Support \u and \U escapes in regexes

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- versions: +Python 2.7, Python 3.1 -Python 3.0 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3665

[issue1519638] Unmatched Group issue - workaround

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1519638 ___ ___ Python-bugs-list

[issue1519638] Unmatched Group issue - workaround

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- versions: +Python 2.7 -Python 2.5 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1519638

[issue1662581] the re module can perform poorly: O(2**n) versus O(n**2)

2008-09-27 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse versions: +Python 2.7 ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1662581

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Matthew, Did you upload a public SSH key to your Launchpad account? You're on MS Windows, right? I can try and do an install on an MS Windows XP box or 2 I have lying around and see how that works, but we should try and solve this vexing

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Great, Matthew!! Now, I'm still in the process of setting up branches related to your work; generally they should be created from a core and set of features implemented for example: To get from Version 2 to Version 3 of your Engine, I had

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Thanks, Matthew. My reading of that Answer is that you should be okay because you, I assume, installed the Windows-Native package rather than the cygwin that I first tested. I think the problem is specific to Cygwin as well

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Phew! Okay, all you patches have been applied as I said in a previous message, and you should now be able to check out lp:~pythonregexp2.7/python/issue2636+01+09-02+17+18+19+20+21+24+26 where you can then apply your latest known patch (rc2

[issue433029] SRE: posix classes aren't supported

2008-09-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: To clarify, you mean named character sets as found in Perl and Emacs, which are normally written, for example, like '[:ALPHANUM:]', right? We are working on that as Item 8 of Issue 2636: Regexp 2.7. If not, please clarify so I nknow what

[issue3299] invalid object destruction in re.finditer()

2008-09-26 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3299 ___ ___ Python-bugs-list mailing

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I've enumerated the current list of Item Numbers at the official Launchpad page for this issue: https://launchpad.net/~pythonregexp2.7 There you will find links to each development branch associated with each item, where a broader

[issue1160] Medium size regexp crashes python

2008-09-25 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1160 ___ ___ Python-bugs-list mailing

[issue1160] Medium size regexp crashes python

2008-09-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: It seems that changing the size type of the Regular Expression Byte-code is a nice quick-fix, even though it doubles the size of a pattern. It may have the added benefit that most machine architectures available today are at least partially

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Good catch, Matthew, and if you spot any other outstanding Regular Expression issues feel free to mention them here. I'll give issue 1160 an item number of 25 and think all we need to do here is change SRE_CODE to be typedefed

[issue1647489] zero-length match confuses re.finditer()

2008-09-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Perl gives this result for your new expression: ,undef,undef undef,undef,abc undef,,undef I think it has to do with not thinking of a string as a sequence of characters, but as a sequence of characters separated by null-space. Null-space

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Hmmm. Well, some of those are already covered: #2636: self #1160: Item 25 #1647489 : Item 24 #3511: Item 23 #3825: Item 9-2 #433028 : Item 21 #433027 : Item 20 #433024 : Item 19 #3262: Item 22 #3299: TBD #3665

[issue433024] SRE: (?flag) isn't properly scoped

2008-09-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue433024 ___ ___ Python-bugs-list

[issue433027] SRE: (?-flag) is not supported.

2008-09-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue433027 ___ ___ Python-bugs-list

[issue433028] SRE: (?flag:...) is not supported

2008-09-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue433028 ___ ___ Python-bugs-list

[issue3825] Major reworking of Python 2.5.2 re module

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Matthew, I am really happy that you are making such progress on your engine, but can I PLEASE ask you to slow down for a moment? We have a lot of issues already listed in issue 2636 that is a catch-all for any Python 2.7 Regexp

[issue3511] Incorrect charset range handling with ignore case flag?

2008-09-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3511 ___ ___ Python-bugs-list mailing

[issue3511] Incorrect charset range handling with ignore case flag?

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I think this is even more complicated when you consider that localization my be an issue. Consider Á: is this grammatically before A or after a? From a character set point of view, it is typically after a but when Locale is taken

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Thanks for weighing in Matthew! Yeah, I do get some flack for item 2 because originally item 3 wasn't supposed to cover named groups but on investigation it made sense that it should. I still prefer 2 over-all but the nice thing about them

[issue1647489] zero-length match confuses re.finditer()

2008-09-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue1647489 ___ ___ Python-bugs-list

[issue1647489] zero-length match confuses re.finditer()

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Hmmm. This strikes me as a bug, beyond the realm of Issue 3262. The two items may be related, but the dropping of the 'a' seems like unexpected behaviour that I doubt any current code is expecting to occur. Clearly, what is going

[issue1647489] zero-length match confuses re.finditer()

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Never mind inclusion in 2.6 as no-one has repeated this bug in re-world examples yet so it's going to have to wait for the Regexp 2.7 engine in issue 2636. -- versions: +Python 2.7 -Python 2.5

[issue1647489] zero-length match confuses re.finditer()

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Ah, I see the problem, if ptr is not incremented, then it will keep matching the first expression, (^z*), so it would have to both 'skip' the 'a' and NOT skip the 'a'. Hmm. You're right, Matthew, this is pretty complicated. Now, for your

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Good catch on issue 1647489 Matthew; it looks like this is where that bug fix will end up going. But, I am unsure if the solution for this issue is going to be the same as for 3262. I think the solution here is to add an internal flag

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I've moved all the development branches to the ~pythonregexp2.7 team so that we can work collaboratively. You just need to install Bazaar, join www.launchpad.net, upload your public SSH key and then request to be added

[issue3262] re.split doesn't split with zero-width regex

2008-09-22 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I think Mike Coleman proposal of enabling this behaviour via flag is probably best and IMHO we should consider it under these circumstances. Intuitively, I think you're interpretation of what re.split should do under zero-width conditions

[issue3262] re.split doesn't split with zero-width regex

2008-09-21 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3262 ___ ___ Python-bugs-list mailing

[issue3654] Duplicated test name in regex test script

2008-09-21 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue3654 ___ ___ Python-bugs-list mailing

[issue516762] have a way to search backwards for re

2008-09-21 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: -- nosy: +timehorse ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue516762 ___ ___ Python-bugs-list

[issue2636] Regexp 2.7 (modifications to current re 2.2.2)

2008-09-16 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Update 16 Sep 2008: Based on the work for issue #3825, I would like to simply update the item list as follows: 1) Atomic Grouping / Possessive Qualifiers (See also Issue #433030) [Complete] 2) Match group names as attributes (e.g

[issue3825] Major reworking of Python 2.5.2 re module

2008-09-15 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Well, I implemented this months ago, but have been busy with other things so I haven't updated in a while. I noticed that the current version is missing my patches for Atomic Grouping / Possessive Qualifiers and a number of other patches I

[issue3825] Major reworking of Python 2.5.2 re module

2008-09-15 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I have uploaded my test cases for Atomic Grouping / Possessive Qualifier, which is the common code we seem to have developed, as this may be of use to you. I also have documentation, but for now, would you mind running these tests against

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-19 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Thanks for weighing in Mark! Actually, your point is valid and quite fair, though I would not assume that Item 3 would be included just because Item 2 isn't. I will do my best to develop both, but I do not make the final decision

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Well, it's time for another update on my progress... Some good news first: Atomic Grouping is now completed, tested and documented, and as stated above, is classified as issue2636-01 and related patches. Secondly, with caveats listed

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10052/issue2636-09.patch ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10467/issue2636.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10428/issue2636-05-only.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10468/issue2636-05.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10469/issue2636-07.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10470/issue2636-07-only.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue433030] SRE: Atomic Grouping (?...) is not supported

2008-06-17 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I have finished work on the Atomic Grouping / Possessive Qualifiers support and am including a patch to achieve this; however, http://bugs.python.org/issue2636 should be consulted for the complete list of changes in the works

[issue433030] SRE: Atomic Grouping (?...) is not supported

2008-06-17 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file9897/PyLibDiffs.txt ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue433030

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-06-17 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Sorry, as I stated in the last post, I generated the patches then realized that I was missing the documentation for Item 2, so I have updated the issue2636-02.patch file and am attaching that separately until the next release of the patch

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-29 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Added file: http://bugs.python.org/file10467/issue2636.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-29 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10427/issue2636.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-29 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Added file: http://bugs.python.org/file10468/issue2636-05.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-29 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10429/issue2636-05.diff ___ Python tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Mark scribbled: One possible solution would be a grouptuples() function that returned a tuple of 3-tuples (index, name, captured_text) with the name being None for unnamed groups. Hmm. Well, that's not a bad idea at all IMHO and would

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Added file: http://bugs.python.org/file10428/issue2636-05-only.diff __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I am finally making progress again, after a month of changing my patches from my local svn repository to bazaar hosted on launchpad.net, as stated in my last update. I also have more or less finished the probably easiest item, #5, so I

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Added file: http://bugs.python.org/file10429/issue2636-05.diff __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636 __ ___ Python

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-24 Thread Jeffrey C. Jacobs
Changes by Jeffrey C. Jacobs [EMAIL PROTECTED]: Removed file: http://bugs.python.org/file10056/issue2636-05.patch __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-05-01 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I am making my changes in a Bazaar branch hosted on Launchpad. It took me quite a while to get things set up more-or-less logically but there they are and I'm currently trying to re-apply my local changes up to today into the various

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-04-26 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Thank you and Merci Antoine! That is a good point. It is clearly specific to the compiler whether a switch-case will be turned into a series of conditional branches or simply creating an internal jump table with lookup. And it is true

[issue433030] SRE: Atomic Grouping (?...) is not supported

2008-04-25 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Quick update (see thread 2636 for other updates to the Regex functionality): I do have a design in my mind for how to accomplish both Atomic Grouping and Possessive Qualifiers and it would work with either the existing Engine design

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-04-24 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Thanks Jim for your thoughts! Armaury has already explained about Perl 5.10.0. I suppose it's like Macintosh version numbering, since Mac Tiger went from version 10.4.9 to 10.4.10 and 10.4.11 a few years ago. Maybe we should call Python

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-04-18 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: This simple patch adds (?P#...)-style comment support. Added file: http://bugs.python.org/file10056/issue2636-05.patch __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2636

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-04-17 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I am very sorry to report (at least for me) that as of this moment, item 9), although not yet complete, is stable and able to pass all the existing python regexp tests. Because these tests are timed, I am using the timings from the first

[issue2636] Regexp 2.6 (modifications to current re 2.2.2)

2008-04-15 Thread Jeffrey C. Jacobs
New submission from Jeffrey C. Jacobs [EMAIL PROTECTED]: I am working on adding features to the current Regexp implementation, which is now set to 2.2.2. These features are to bring the Regexp code closer in line with Perl 5.10 as well as add a few python-specific niceties and potential speed

[issue433030] SRE: Atomic Grouping (?...) is not supported

2008-03-29 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: I'm digging into the sre_parse.py at the moment and this I have all the changes I need for that now. The rest of the changes I believe are in either sre_compile.py or more likely directly in _sre.c, so I will examine those files next. I

[issue433030] SRE: (?...) is not supported

2008-03-28 Thread Jeffrey C. Jacobs
Jeffrey C. Jacobs [EMAIL PROTECTED] added the comment: Fredrik, If you're still listening, I am gonna try and tackle this one but I would like to know why you or the famous Jeffrey of the Regexp world claims that there is already code in the Regexp Engine for Atomic Grouping? Adding a hook

  1   2   >