Indexer doesn't consider linkdb when calculating boost value
------------------------------------------------------------

         Key: NUTCH-267
         URL: http://issues.apache.org/jira/browse/NUTCH-267
     Project: Nutch
        Type: Bug

  Components: indexer  
    Versions: 0.8-dev    
    Reporter: Chris Schneider
    Priority: Minor


Before OPIC was implemented (Nutch 0.7, very early Nutch 0.8-dev), if 
indexer.boost.by.link.count was true, the indexer boost value was scaled based 
on the log of the # of inbound links:

    if (boostByLinkCount)
      res *= (float)Math.log(Math.E + linkCount);

This is no longer true (even before Andrzej implemented scoring filters). 
Instead, the boost value is just the square root (or some other scorePower) of 
the page score. Shouldn't the invertlinks command, which creates the linkdb, 
have some affect on the boost value calculated during indexing (either via the 
OPICScoringFilter or some other built-in filter)?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to