=?windows-1250?Q?Re:_Extract_all_links_/_URLs?_[62026]?=

Dirk Fri, 21 Feb 2014 07:09:59 -0800

Yes, those fales positives, I see.

Now that happens, some URLs are killed:


--------------------------------------------------------------------------- 
To extract:


http:://www.vibeuiaductor.com/auoid/njs_pt1.mp3 200     ok      audio/mpeg      
71089796        njs_p
t1.mp3  17.01.2012  23:47:49    2               3       Apache          
00:00.501       utf-8


The extraction:


njs_pt1.mp3

--------------------------------------------------------------------------- 
To extract (actually not, it would not be needed):


                [email protected]            2               4               
        00:00.000       utf-8


The extraction:


gmail.com

--------------------------------------------------------------------------- 
To extract:


http:://sites.google.com/site/mauriciowhysou38/unove-suns/MidnightStar-Searching
ForLove.mp3     200     ok      text/html               
MidnightStar-SearchingForLove.mp3       25.01.2011 
19:30:47        2       1       1       GSE             00:01.983       utf-8


The extraction:


MidnightStar-SearchingForLove.mp3

--------------------------------------------------------------------------- 

cite:
--------------------------------------------------------------------------------
Alternatively, if you know the form of the urls you want to match, it
might be workable to write a simpler pattern from scratch - a large part of this
version seems to deal with the query part after 
?.--------------------------------------------------------------------------------

I assume, there are to many different, unknown forms. 

May be it is easier to extract the www. URLs in a first step and in a second
step all of the remaining URLs containing http and similar.

Thank you very much.

-- 
<http://forum.pspad.com/read.php?2,62001,62026>
PSPad freeware editor http://www.pspad.com

=?windows-1250?Q?Re:_Extract_all_links_/_URLs?_[62026]?=

Odpovedet emailem