Thanks for your reply.
I want to know about all possible configuration for inserting new outlinks and 
inlinks in database not a specific configuration!


Regards,
MSH





On Sunday, May 17, 2015 11:10 PM, Talat Uyarer <[email protected]> wrote:



Hi Mahdieh,

For the first question. Can you share your nutchğsite.xml ?

Second question. Depends on your scroing plugin. In default Nutch has
one scroing plugin that is opic. You should understand How opic
scoring  works[0]. In DBUpdateMapper[1] step scores are distributed
and in DBUpdateReducer, collect coming score.

HTH

[0] https://wiki.apache.org/nutch/FixingOpicScoring
[1] 
https://github.com/apache/nutch/blob/2.x/src/java/org/apache/nutch/crawl/DbUpdateMapper.java#L83
[2] 
https://github.com/apache/nutch/blob/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java#L195


2015-05-17 9:24 GMT+03:00 mahdieh Shahverdi <[email protected]>:
> Hi All,I have two questions about outlinks and inlinks in nutch 2.3:1. I 
> searched inside nutch 2.3 code to get where it adds new outlinks and inlinks 
> extracted from a newly fetched url page in webpage table for fetching in next 
> rounds but I found nothing!2. where does nutch update scores of above 
> outlinks and inlinks? I read DBupdatejob.java but I don't understand how 
> nutch searches outlinks and inlinks of a fetched url in database for updating 
> their scores!
> Regards,MSH
>



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Reply via email to