[ https://issues.apache.org/jira/browse/DROIDS-72?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Richard Frovarp updated DROIDS-72: ---------------------------------- Attachment: read-base-uri.patch Updates core so that the base uri can be read out of an element in the source. > Doesn't honor base element > -------------------------- > > Key: DROIDS-72 > URL: https://issues.apache.org/jira/browse/DROIDS-72 > Project: Droids > Issue Type: Bug > Components: core > Affects Versions: Graduating from the Incubator > Reporter: Richard Frovarp > Fix For: 0.01 > > Attachments: read-base-uri.patch > > > The HtmlParser and LinkExtractor do not honor the base element in HTML. This > will make crawling of some sites impossible. LinkExtractor and HtmlParser > should be able to be given a element/attribute pair to look for a base URI. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.