Alexey,

Sorry for the delay answering you. I will definitely share my code
with nutch community, but currently I'm on vacation, away from my
sources, so I will share them as soon as my vacation ends ;-)

P.S. Nutch is great I and I hope that my efforts will help to make it
better.

P.P.S Why not to develop efficient technique to fight near-duplicates
and SE spam? This is absolutely necessary if build Internet search
engine based on nutch. Another "must have" is variable refetch time
for pages (this could be based on estimating average update time of
the page + taking into account page score)


> Hi,Jerome

> I think that the best way is to ask Eugene to share his code. I hope
> he will comply our request... :)
> I want to believe that his answer will be positive! if not, then I
> will share my "BAD code" to You.

> -----------

> Regards

> Alexey

-- 
Best regards,
 Eugen                            mailto:[EMAIL PROTECTED]


Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to