The latest segments would have a modified date of when you ran generate dbsegments I don't know how to do it in script. Possibly the nutch generatecommand has a return? That's a question for someone with more knowledge than I. But I too would like to know. I don't know about: Deleted 0 content duplicates.
-----Original Message----- From: Hasan Diwan [mailto:[EMAIL PROTECTED] Sent: Monday, February 27, 2006 6:45 PM To: [email protected]; [EMAIL PROTECTED] Subject: Re: nutch-extensionpoints 0.71 Mr. Braman (or anyone else): On 27/02/06, Richard Braman <[EMAIL PROTECTED]> wrote: > > > bin/nutch fetch segments/<latest_segment> How would I determine which is the latest segment? I don't really know what your other question was. I know there are duplicate URLs in urls.txt. Why would I be getting the line below? > 060227 150626 Deleted 0 content duplicates. > Thanks again for the kind assistance. -- Cheers, Hasan Diwan <[EMAIL PROTECTED]>
