I just merged all of my segments into one, which was fast considering I have only around 80k documents. After I run mergesegs I am a little confused as to what to do.

I indexed the new master segment... now what?

I ran updatedb (command bin/nutch updatedb crawl/db crawl/segments/segment_name_here) and it told me my database now has 260k documents. Did I just _add_ the new segment to the old database? I'm a bit confused because there are only 80k documents inthe segment, how can there be 260k records in the database?

I also merged the indexes (ls -d crawl/segments/* | xargs bin/nutch merge crawl/index), but I'm afraid it will also retain the old records like the updatedb command seemed to do. Is this true?

So my questions are as follows: What's up with it saying I have 260k records in my index? Also, will the merge command (see above) recreate the index each time it is run, or is it just adding new indexes to old ones?

Matt

Reply via email to