Yes, those fales positives, I see.
Now that happens, some URLs are killed:
---------------------------------------------------------------------------
To extract:
http:://www.vibeuiaductor.com/auoid/njs_pt1.mp3 200 ok audio/mpeg
71089796 njs_p
t1.mp3 17.01.2012 23:47:49 2 3 Apache
00:00.501 utf-8
The extraction:
njs_pt1.mp3
---------------------------------------------------------------------------
To extract (actually not, it would not be needed):
[email protected] 2 4
00:00.000 utf-8
The extraction:
gmail.com
---------------------------------------------------------------------------
To extract:
http:://sites.google.com/site/mauriciowhysou38/unove-suns/MidnightStar-Searching
ForLove.mp3 200 ok text/html
MidnightStar-SearchingForLove.mp3 25.01.2011
19:30:47 2 1 1 GSE 00:01.983 utf-8
The extraction:
MidnightStar-SearchingForLove.mp3
---------------------------------------------------------------------------
cite:
--------------------------------------------------------------------------------
Alternatively, if you know the form of the urls you want to match, it
might be workable to write a simpler pattern from scratch - a large part of this
version seems to deal with the query part after
?.--------------------------------------------------------------------------------
I assume, there are to many different, unknown forms.
May be it is easier to extract the www. URLs in a first step and in a second
step all of the remaining URLs containing http and similar.
Thank you very much.
--
<http://forum.pspad.com/read.php?2,62001,62026>
PSPad freeware editor http://www.pspad.com