Hi Dmitriy,

>-----Original Message-----
>From: Dmitriy V. Kazimirov [mailto:[email protected]]
>Sent: Saturday, June 26, 2010 9:53 PM
>To: [email protected]
>Subject: How to make nutch take distance between terms in document in
>account?
>
>Hi,
>
>Is it possible to make nutch scoring take into account distance between
>terms?
>
>i.e. if we have query president bush medvev, document where all 3 terms
>are near(how to define 'near' hear is also interesting) each are scoring
>higher than they are away from each other?
>
>If that's not possible right now - I'm correct that new QueryFilter
>should
>be implemented?how this should be made?

This is implemented, but is not being used, if I am not wrong. Please see
addSloppyPhrases in BasicQueryFilter.java. Note that SLOP (the proximity
parameter) is set to Integer.MAX_VALUE which defines 'near' as 'very far'.
I did not find any code that would change it in Nutch.

Regards,

Arkadi

>
>
>
>
>With regards, Dmitriy

Reply via email to