------- You are receiving this mail because: ------- You are on the CC list for the bug.
http://bugs.exim.org/show_bug.cgi?id=1130 Summary: pcregrep doesn't copy entire lines to output when they are long ( > 25000 chars) Product: PCRE Version: 8.12 Platform: Other OS/Version: Linux Status: NEW Severity: bug Priority: medium Component: Code AssignedTo: [email protected] ReportedBy: [email protected] CC: [email protected] I create a file with a single long line with an 'a', many 'x's and a 'b'. pcregrep for a or for b produces output that differs from the origial file, which I didn't expect. This happens for 25000 'x's, but for 20000 'x's it is fine, as this shell snippet illustrates: > perl -e 'print "a", "x"x25000, "b\n"' > long > perl -e 'print "a", "x"x20000, "b\n"' > short > for ls in long short ; do for ab in a b ; do pcregrep $ab $ls > ${ls}${ab} ; > done ; done > md5sum long* short* a02b4cbbb437eaf52997832952a1d052 long 1dfac8b938bfaec4c6bd727ffae356fd longa 1d5d6df30c643aed4a626dd8ab36f2ec longb 27ee48c18be91ac0038ba8d9a3988625 short 27ee48c18be91ac0038ba8d9a3988625 shorta 27ee48c18be91ac0038ba8d9a3988625 shortb > ls -l long* short* -rw-r--r-- 1 pvm pvm 25003 2011-07-12 14:33 long -rw-r--r-- 1 pvm pvm 24576 2011-07-12 14:33 longa -rw-r--r-- 1 pvm pvm 427 2011-07-12 14:33 longb -rw-r--r-- 1 pvm pvm 20003 2011-07-12 14:33 short -rw-r--r-- 1 pvm pvm 20003 2011-07-12 14:33 shorta -rw-r--r-- 1 pvm pvm 20003 2011-07-12 14:33 shortb I expected > pcregrep a long to generate the same output as > pcregrep b long to be identical to long, just as is the case with the short* files I was pcregrep-ping through minified javascript files (that are all in one line) and noticed that the output doesn't have a terminating newline: > (cat long ; cat long) > 2long > pcregrep a 2long | wc -l 0 > grep a 2long | wc -l 2 > pcregrep --version pcregrep version 8.12 2011-01-15 on Ubuntu natty -- Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email -- ## List details at https://lists.exim.org/mailman/listinfo/pcre-dev
