would this delete documents?
Lucene.IndexReader.delete(new Term("path",SomeParticularObject.getPath()))
if this is the command to delete documents than create a crawler application
to crawl your site returning a list of documents/urls and then run the above
command through in a loop deleting the documents, just an idea - im not that
knowledgeable yet on nutch/lucene, hope it helps
----- Original Message -----
From: "Sébastien LE CALLONNEC" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Wednesday, September 07, 2005 10:13 PM
Subject: RE: Recrawling
Hi Jake,
I presume you're using the "crawl" command: it means you have to delete
the already existing index to crawl again...
Regards,
Sebastien
--- "Vanderdray, Jake" <[EMAIL PROTECTED]> a écrit :
I want to apologize in advance for this very basic question, but
my searches aren't turning up the answer so far. I've successfully
run
a crawl and I can search the results. I'd like to update my index by
re-crawling my site, but when I try to use the same command I used
the
first time I get an error saying that the index already exists.
What is the correct method for re-crawling a site? I'd be happy
to add the answer back into the nutch site or wiki if I can.
Thanks,
Jake.
___________________________________________________________________________
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger
Téléchargez cette version sur http://fr.messenger.yahoo.com