how to target certain links !! do you know how the links are made !? i mean 
their format ?
you can just set a regular expression to accept only those kind of links 



> Date: Mon, 5 Oct 2009 21:39:52 +0200
> From: a...@getopt.org
> To: nutch-user@lucene.apache.org
> Subject: Re: Targeting Specific Links for Crawling
> 
> Eric wrote:
> > Does anyone know if it possible to target only certain links for 
> > crawling dynamically during a crawl? My goal would be to write a plugin 
> > for this functionality but I don't know where to start.
> 
> URLFilter plugins may be what you want.
> 
> 
> -- 
> Best regards,
> Andrzej Bialecki     <><
>   ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
> 
                                          
_________________________________________________________________
New: Messenger sign-in on the MSN homepage
http://go.microsoft.com/?linkid=9677403

Reply via email to