Enrico Triolo wrote:
> Hi all, I'm trying to implement a search plugin to search on the
> 'subType' field added by index-more plugin. It's a very simple plugin,
> copied almost entirely from query-basic.
>
> The problem is, when I perform a query on that field I get no results
> at all. Other fields are handled by the same plugin, and I'm able to
> search over them. Moreover, performing queries with luke on the
> subType field I get the expected results.
>
> Looking at the source code I found out that when parsing a query
> string all fields are transformed lower case: so, the query
> 'subType:html' becomes 'subtype:html' (see method 'getNextToken' in
> org.apache.nutch.analysis.NutchAnalysisTokenManager).
> Could it be this the cause of the wrong result set? Is there a reason
> why fields are treated this way?

For simplicity and user-friendliness. While in Lucene we can reasonably 
expect that sophisticated users will construct sophisticated queries, 
paying attention to lower/upper-case, we need to lower the barrier for a 
general-purpose search engine frontend.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to