Doug,

I cannot reproduce this.
I was able to reproduce it on different system several times.
Important is that you use at least two boxes.
Create a crawldb with may 100 000 entries.
Generate a segment from this without limitations and count the entries in the fresh generated segment. I had written a own tool testing this using sequence file reader, you will see that the generated segment is around 50 000 enties not 100 000.

The problem is somehow related to the two boxes.

If you like I can write a test that makes the problem reproducible, but it may takes some time since there is just to much in the queue.

Stefan

Reply via email to