[ https://issues.apache.org/jira/browse/NUTCH-518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12513821 ]
Doğacan Güney commented on NUTCH-518: ------------------------------------- This is another alternative. I am not suggesting that we use it but just to put it on the table: * Remove initial score argument from indexerScore and generatorSortValue. * Change ScoringFilters.java to collect scores from different ScoringFilter-s. * Calculate their geometric mean. This approach is far more aggressive. It is like a logical AND. With geometric mean a page is 'important' pretty much only if *all* scoring filters decide that it is important. I really like this approach, but it won't work for people who want to give a high score to pages with certain content even if the page itself has no inlinks (for this case, addition would have worked very well). > Fix OpicScoringFilter to respect scoring filter chaining > -------------------------------------------------------- > > Key: NUTCH-518 > URL: https://issues.apache.org/jira/browse/NUTCH-518 > Project: Nutch > Issue Type: Bug > Components: indexer > Affects Versions: 1.0.0 > Reporter: Enis Soztutar > Assignee: Doğacan Güney > Fix For: 1.0.0 > > Attachments: opicScoring.chain.patch > > > Opic Scoring returns the score that it calculates, rather than returning > previous_score * calculated_score. This prevents using another scoring filter > along with Opic scoring. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.