Hi Darshit,

Apologies for my previous email formatting: it got broken.

Yes, I was thinking of using the --follow-tags switch that
also was suggested in the "HTML Parsing" thread (Sep 2014),
but as far as I understand how it works in html-url.c, it
builds an intersection between the list of built-in HTML
tags and the tags the user wants to follow, so it cannot
be used to register new tags and attributes. This is why
I considered adding a new switch.


23 січня 2020, 19:39:04, від "Darshit Shah" <[email protected]>:

> Hi Lyubomyr,
> 
> Thanks for your patch!
> 
> I haven't tested it yet, but I wonder why the existing switch
> `--follow-tags=LIST` was not enough for your use case. IIRC, it was
> exactly this case that it was implemented for.
> 
> * Lyubomyr Shaydariv <[email protected]> [200123 18:35]:
> > Hi,
> > 
> > Some HTML documents use non-standard attributes that are essentially URLs 
> > that might be walked through. The attached patch allows to specify new tags 
> > and attributes to follow.
> > 
> > Example of use:
> > 
> > ./wget -nd -r -P . -A jpg \
> >     --custom-html-attrs=div/big_img,div/med_img \
> >     http://localhost/index.html
> > 
> > However:
> > The patch does not include tests or external documentation updates of any 
> > kind. It does not validate tag/attribute pairs except of simple "/" 
> > checking. 
> > I'm not a C programmer, so I'm fine if the patch is considered poor and 
> > consequently rejected.
> > Thanks.
> 
> 
> 
> 


Reply via email to