> The pseudocode is something like
> 
> get: 
>  grab url
>  parse HTML
>  foreach internal link, get link (remember depth!)
>  if specialtag found, push url onto foundlist
>  
> print foundlist

Take a look at WWW::Robot, which does exactly this.  It works for me,
although I'm not sure where its maintainers are.  I posted a patch to
this list recently and never got a response.

-- 
Reinier Post

Reply via email to