would this delete documents?

Lucene.IndexReader.delete(new Term("path",SomeParticularObject.getPath()))

if this is the command to delete documents than create a crawler application to crawl your site returning a list of documents/urls and then run the above command through in a loop deleting the documents, just an idea - im not that knowledgeable yet on nutch/lucene, hope it helps


----- Original Message ----- From: "Sébastien LE CALLONNEC" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Wednesday, September 07, 2005 10:13 PM
Subject: RE: Recrawling


Hi Jake,


I presume you're using the "crawl" command: it means you have to delete
the already existing index to crawl again...

Regards,
Sebastien


--- "Vanderdray, Jake" <[EMAIL PROTECTED]> a écrit :

I want to apologize in advance for this very basic question, but
my searches aren't turning up the answer so far.  I've successfully
run
a crawl and I can search the results.  I'd like to update my index by
re-crawling my site, but when I try to use the same command I used
the
first time I get an error saying that the index already exists.

What is the correct method for re-crawling a site?  I'd be happy
to add the answer back into the nutch site or wiki if I can.

Thanks,
Jake.








___________________________________________________________________________
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger
Téléchargez cette version sur http://fr.messenger.yahoo.com



Reply via email to