how to target certain links !! do you know how the links are made !? i mean
their format ?
you can just set a regular expression to accept only those kind of links
> Date: Mon, 5 Oct 2009 21:39:52 +0200
> From: a...@getopt.org
> To: nutch-user@lucene.apache.org
> Subject: Re: Targeting Specific Links for Crawling
>
> Eric wrote:
> > Does anyone know if it possible to target only certain links for
> > crawling dynamically during a crawl? My goal would be to write a plugin
> > for this functionality but I don't know where to start.
>
> URLFilter plugins may be what you want.
>
>
> --
> Best regards,
> Andrzej Bialecki <><
> ___. ___ ___ ___ _ _ __________________________________
> [__ || __|__/|__||\/| Information Retrieval, Semantic Web
> ___|||__|| \| || | Embedded Unix, System Integration
> http://www.sigram.com Contact: info at sigram dot com
>
_________________________________________________________________
New: Messenger sign-in on the MSN homepage
http://go.microsoft.com/?linkid=9677403