Hi Jerome,
Thanks. So essentially I need to rebuild the IndexSegment class and
customise. I guess that's the beauty of open-source software. The downside
is I don't know any Java!
I'll take a look into getting that done later.
Many thanks,
Dean
----- Original Message -----
From: "Jérôme Charron" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Thursday, November 17, 2005 10:18 PM
Subject: Re: Crawling a page for links, but not indexing it
Is there anyway that I can do this from the Nutch side?
Yes ... by modifying the IndexSegment class and avoid adding to the index
the documents that match a configurable URL...
;-)
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc. Get Certified Today
Register for a JBoss Training Course. Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general