Is it possible for me to dedup a Lucene index on a Hadoop filsystem
against a finished Lucene index?

I build up my index with Nutch as per normal, but I would like to
inject single urls and merge the result into the final index without
having to run a full crawl.

Cheers
Rob

Reply via email to