Thanks for the information. I am planning to use Nutch 2.x. On 23 January 2013 13:58, Markus Jelsma <[email protected]> wrote:
> If you use 1.x and don't merge segments you still have older versions of > documents. There is no active versioning in Nutch 1x except segment naming > and merging, if you use it. > > -----Original message----- > > From:Tejas Patil <[email protected]> > > Sent: Wed 23-Jan-2013 09:25 > > To: [email protected] > > Subject: Re: Nutch support with regards to Deduplication and Document > versioning > > > > Hi Anand, > > Nutch will keep the latest content of a given url (based on the time when > > it was fetched). It wont store the old versions. > > > > Thanks, > > Tejas > > > > > > On Wed, Jan 23, 2013 at 12:12 AM, Anand Bhagwat <[email protected] > >wrote: > > > > > Hi, > > > I want to know what kind of support does Nutch provides with regards to > > > de-duplication and document versioning? > > > > > > Thanks, > > > Anand. > > > > > >

