Enrico Triolo wrote: > Hi all, I'm trying to implement a search plugin to search on the > 'subType' field added by index-more plugin. It's a very simple plugin, > copied almost entirely from query-basic. > > The problem is, when I perform a query on that field I get no results > at all. Other fields are handled by the same plugin, and I'm able to > search over them. Moreover, performing queries with luke on the > subType field I get the expected results. > > Looking at the source code I found out that when parsing a query > string all fields are transformed lower case: so, the query > 'subType:html' becomes 'subtype:html' (see method 'getNextToken' in > org.apache.nutch.analysis.NutchAnalysisTokenManager). > Could it be this the cause of the wrong result set? Is there a reason > why fields are treated this way?
For simplicity and user-friendliness. While in Lucene we can reasonably expect that sophisticated users will construct sophisticated queries, paying attention to lower/upper-case, we need to lower the barrier for a general-purpose search engine frontend. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
