ID:               41216
 Updated by:       [EMAIL PROTECTED]
 Reported By:      DPP <paul dot dovbush at gmail dot com>
-Status:           Open
+Status:           Bogus
 Bug Type:         PCRE related
 Operating System: WinXPsp2
 PHP Version:      5.2.1
 New Comment:

Sorry, but your problem does not imply a bug in PHP itself.  For a
list of more appropriate places to ask for help using PHP, please
visit http://www.php.net/support.php as this bug system is not the
appropriate forum for asking support questions.  Due to the volume
of reports we can not explain in detail here why your report is not
a bug.  The support channels will be able to provide an explanation
for you.

Thank you for your interest in PHP.




Previous Comments:
------------------------------------------------------------------------

[2007-04-27 17:33:54] DPP <paul dot dovbush at gmail dot com>

Forgot to say: file contain russian text encoded in UTF-8.
Without PCRE_UTF8 modifier regexp falls on russian letter "R".

------------------------------------------------------------------------

[2007-04-27 17:26:59] DPP <paul dot dovbush at gmail dot com>

Description:
------------
Parsing file with 10000 lines of following format:

level + delim + [EMAIL PROTECTED]@ + delim +] tag + [delim + line_value +]
terminator

level           digit
delim           space
xref_id alphanum
tag             alpha (english)
line_value      any (except terminator)
terminator      \r\n

With regexp:

$c=preg_match_all("/^\s*(\d+)\s+(@(\S+)@\s+)?(\w+)(\s+@(\S+)@\s*|.*)?$/Sm",$fp,$m,PREG_PATTERN_ORDER);

Setting PCRE_UTF8 modifier slows whole script down 30 times (from 300ms
to 9000ms).

May be more accurate regexp here will be
$c=preg_match_all("/^ *(\d+) +(@([EMAIL PROTECTED])@ +)?([^ \\n]+)(
+@([EMAIL PROTECTED])@ *| +[^\\n]*)?$/m",$fp,$m,PREG_PATTERN_ORDER);
But it changes nothing.



------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=41216&edit=1

Reply via email to