New issue 2052: Regex module memory leak/crash https://bitbucket.org/pypy/pypy/issue/2052/regex-module-memory-leak-crash
magxx: While using the [regex](https://pypi.python.org/pypi/regex/2015.05.10) module when parsing a lot of data (I have ~1GB of lines) I've notice that at a certain point the code balloons in memory usage to about 200MB/s. I highly suspect it has something to do with regex.finditer() as if I have that line of code the memory leak will occur. When parsing a small 30MB file this issue did not get a chance to arise. The code looks like: ``` #!python int_group_test = special_end_regex.search(string) if int_group_test: new_string = string[:int_group_test.start()] last_splitter = [m.end() for m in dash_regex.finditer(new_string)] ``` I have tried both 32 and 64 bit versions of PyPy with the same result. If the re module is used instead then this problem does not occur. I also used CPython with regex and re and both succeed without any such memory leaks. _______________________________________________ pypy-issue mailing list pypy-issue@python.org https://mail.python.org/mailman/listinfo/pypy-issue