[ 
https://issues.apache.org/jira/browse/NUTCH-518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12513821
 ] 

Doğacan Güney commented on NUTCH-518:
-------------------------------------

This is another alternative. I am not suggesting that we use it but just to put 
it on the table:

* Remove initial score argument from indexerScore and generatorSortValue.
* Change ScoringFilters.java to collect scores from different ScoringFilter-s.
* Calculate their geometric mean.

This approach is far more aggressive. It is like a logical AND. With geometric 
mean a page is 'important' pretty much only if *all* scoring filters decide 
that it is important. I really like this approach, but it won't work for people 
who want to give a high score to pages with certain content even if the page 
itself has no inlinks (for this case, addition would have worked very well).



> Fix OpicScoringFilter to respect scoring filter chaining
> --------------------------------------------------------
>
>                 Key: NUTCH-518
>                 URL: https://issues.apache.org/jira/browse/NUTCH-518
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.0.0
>            Reporter: Enis Soztutar
>            Assignee: Doğacan Güney
>             Fix For: 1.0.0
>
>         Attachments: opicScoring.chain.patch
>
>
> Opic Scoring returns the score that it calculates, rather than returning 
> previous_score * calculated_score. This prevents using another scoring filter 
> along with Opic scoring. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to