Chetan, I assuming you're looking to update a document within a segment. The way Nutch works, once a segment iscreated, it's not meant to be updated. So, if you were to get the same URL again, it will be in a new segment. Periodically, you merge multiple segments which removes the duplicates.
Most medium to large installs have their own way to deal with this situation given their needs. Regards, CC -----Original Message----- From: Chetan Sahasrabudhe [mailto:[EMAIL PROTECTED] Sent: Wednesday, April 20, 2005 3:27 AM To: [email protected] Subject: What does segments stand for ? What does segments stand for ? As per my understanding for every crawl nutch creates a separate segment. If I want to update values for perticular URL in x segment, then how do I decide what segment folder to use for applying IndexSegment ? Regards Chetan Regards Chetan
