Hi Tony, On Tue, Jun 18, 2013 at 11:49 AM, Tony Mullins <[email protected]>wrote:
> ...instead > of returning html of the current page it is returning me the url of all the > pages in seed.txt > I suspect that this should not be happening at all! > > Could you please try entering 2 or more urls in seed.txt and and then > get webPage.getContent().array() > in your ParseFilter plugin .... then you will see instead of returning the > html of current webPage.getBaseUrl() , it is returning the html of all urls > of seed.txt. > This does not make sense Tony. When would a call to page.getContent().array() return you page.getBaseUrl()? If you want the BaseUrl() just call getBaseUrl(). If you want page HTML go .getContent(), why are you involving .getBaseUrl()?

