Thanks for the information. I am planning to use Nutch 2.x.

On 23 January 2013 13:58, Markus Jelsma <[email protected]> wrote:

> If you use 1.x and don't merge segments you still have older versions of
> documents. There is no active versioning in Nutch 1x except segment naming
> and merging, if you use it.
>
> -----Original message-----
> > From:Tejas Patil <[email protected]>
> > Sent: Wed 23-Jan-2013 09:25
> > To: [email protected]
> > Subject: Re: Nutch support with regards to Deduplication and Document
> versioning
> >
> > Hi Anand,
> > Nutch will keep the latest content of a given url (based on the time when
> > it was fetched). It wont store the old versions.
> >
> > Thanks,
> > Tejas
> >
> >
> > On Wed, Jan 23, 2013 at 12:12 AM, Anand Bhagwat <[email protected]
> >wrote:
> >
> > > Hi,
> > > I want to know what kind of support does Nutch provides with regards to
> > > de-duplication and document versioning?
> > >
> > > Thanks,
> > > Anand.
> > >
> >
>

Reply via email to