Hi,Eugen I think that is right way.
----------- Regards, Alexey > P.P.S Why not to develop efficient technique to fight near-duplicates > and SE spam? This is absolutely necessary if build Internet search > engine based on nutch. Another "must have" is variable refetch time > for pages (this could be based on estimating average update time of > the page + taking into account page score) > -- > Best regards, > Eugen mailto:[EMAIL PROTECTED]
