Iam trying to use the html parser as a standalone parser in my code. Iam trying to retrieve only the urls of the elements to render the page i have set the dash_p_leaf_HTML in get_urls_html It does not download img tag urls inside the href tag Consider the following snippet of html from https://www.verisign.com <TABLE border=0 cellPadding=0 cellSpacing=0 width="100%"> <TBODY> <TR vAlign=center> <TD align=middle width=102><A href="http://ad.doubleclick.net/jump/N595.verisign/B34145;sz=102x80;ord=[timestamp]?"><IMG alt="Click Here!" border=0 height=80 src="VeriSign Inc_ - www_verisign_com_files/377234-secure_servers_102x80_1_pm.gif" width=102></A></TD> The parser does not seem to recognise 377234-secure_servers_102x80_1_pm.gif inside the img tag which is embeded in href I hope i am not missing any setting or is this a bug Regards Bharath
