Chetan,

I assuming you're looking to update a document within a segment. The way
Nutch works, once a segment iscreated, it's not meant to be updated. So, if
you were to get the same URL again, it will be in a new segment.
Periodically, you merge multiple segments which removes the duplicates.

Most medium to large installs have their own way to deal with this situation
given their needs.

Regards,
CC
 

-----Original Message-----
From: Chetan Sahasrabudhe [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, April 20, 2005 3:27 AM
To: [email protected]
Subject: What does segments stand for ?

What does segments stand for ?
As per my understanding for every crawl nutch creates a separate segment.
If I want to update values for perticular URL in x segment, then how do I
decide what segment folder to use for applying IndexSegment ?
Regards
Chetan 


Regards 
Chetan 


Reply via email to