Hey there, currently i try to debug the dedup results from nutch. There is a page with is exactly the same (compared the HTML with a diff tool) as on a differed Domain but dedup does not delete this entry.
Is this caused by the differed Domain? If so, is there a possibility to configure that? Thanks in advice Jan --

