Iam trying to use the html parser as a  standalone parser in my code.
Iam trying to retrieve only the urls of the elements to render the page
i have set the dash_p_leaf_HTML in get_urls_html
It does not download  img tag urls inside the href tag
Consider the following snippet of html from https://www.verisign.com

 <TABLE border=0 cellPadding=0 cellSpacing=0 width="100%">
        <TBODY>
        <TR vAlign=center>
          <TD align=middle width=102><A

href="http://ad.doubleclick.net/jump/N595.verisign/B34145;sz=102x80;ord=[timestamp]?";><IMG

            alt="Click Here!" border=0 height=80
            src="VeriSign Inc_ -
www_verisign_com_files/377234-secure_servers_102x80_1_pm.gif"
            width=102></A></TD>

The parser does not seem to recognise
377234-secure_servers_102x80_1_pm.gif inside the img tag which is
embeded
in href
I hope i am not missing any setting or is this a bug

Regards
Bharath

Reply via email to