Allow setting of slop values for non-quote phrase queries on query-basic plugin
-------------------------------------------------------------------------------
Key: NUTCH-877
URL: https://issues.apache.org/jira/browse/NUTCH-877
Project: Nutch
Issue Type: Improvement
Components: searcher
Affects Versions: 1.2
Environment: All
Reporter: Dennis Kubes
Assignee: Dennis Kubes
Fix For: 1.2
Patch adds a configuration variable for setting slop values on phrase queries.
The default slop value, which currently can't be changed through configuration,
is Integer.MAX_VALUE. It produces something like this, which doesn't seem
right to me. If you are searching for a phrase you usually want it within a
certain distance:
2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
* 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
o 9.657982 = idf(content: my=13470 phrase=534)
o 0.0074169594 = queryNorm
This patch adds the query.phrase.slop configuration value to the
nutch-default.xml file. It has a default setting of 5.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.