Hello everyone, I've created a plugin for Nutch 1.0 that extends the parser.
This plugin extract several kinds of information from the document DOM. In some cases I need to extract an "href" of a certain link. The link in the DOM is still relative as it was originally written in the html document, so for example it might be a link with an href of "/music". My question is - how can I make this link have an absolute url - for example make "/music" to "http://www.example.com/music"? Thanks a lot, Eran