The latest segments would have a modified date of when you ran generate
dbsegments
I don't know how to do it in script.
Possibly the nutch generatecommand has a return?
That's a question for someone with more knowledge than I.  But I too
would like to know.
I don't know about:  Deleted 0 content duplicates.


-----Original Message-----
From: Hasan Diwan [mailto:[EMAIL PROTECTED] 
Sent: Monday, February 27, 2006 6:45 PM
To: [email protected]; [EMAIL PROTECTED]
Subject: Re: nutch-extensionpoints 0.71


Mr. Braman (or anyone  else):

On 27/02/06, Richard Braman <[EMAIL PROTECTED]> wrote:
>
>
> bin/nutch fetch segments/<latest_segment>


How would I determine which is the latest segment?

I don't really know what your other question was.

I know there are duplicate URLs in urls.txt. Why would I be getting the
line below?

> 060227 150626 Deleted 0 content duplicates.
>

Thanks again for the kind assistance.

--
Cheers,
Hasan Diwan <[EMAIL PROTECTED]>

Reply via email to