[ 
https://issues.apache.org/jira/browse/NUTCH-316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma closed NUTCH-316.
-------------------------------

    Resolution: Won't Fix

> Confusion about query languages
> -------------------------------
>
>                 Key: NUTCH-316
>                 URL: https://issues.apache.org/jira/browse/NUTCH-316
>             Project: Nutch
>          Issue Type: Bug
>          Components: web gui
>    Affects Versions: 0.8
>         Environment: n/a
>            Reporter: KuroSaka TeruHiko
>
> In 2006-6-16 nightly source code, src/web/jsp/search.jsp has these lines:
>   String queryLang = request.getParameter("lang");
>   if (queryLang == null) { queryLang = ""; }
>   Query query = Query.parse(queryString, queryLang, nutchConf);
> According to the observation of URLs shown in the browser, the lang parameter 
> reflects the language
> of the GUI (the language in which GUI elements are labeled) as the user 
> clicks on the two letter code 
> near the bottom of each Nutch GUI screen.
> The Java API Doc on Query is not clear about what queryLang is meant.  Is 
> this the language of
> the query (how query should be lemmatized, if supported by the analyzer, and 
> what stop word list
> should be applied), is is this the language of the documents to be searched?
> Although the two concepts above are closely related, they are not tied to the 
> GUI language at all.
> I, as Japanese user, might prefer to see all GUIs in Japanese, but I would 
> still need to
> search English documents for Englsh words.  The current implementation of 
> search.jsp seems
> to restrict search domain to the documents of the GUI language in one way (by 
> treating the
> terms to be from the GUI language), or the other (restricting the search 
> domain to the documents
> of the GI language).
> To be perfect, there should be a drop-down list from which the language of 
> query analyzer
> is selected, and a set of check boxes from which the document languages can 
> be selected,
> in addition to the existing line of two letter language codes from which the 
> GUI language is choosen.
> But that would be too clutering.  
> Google uses a separate configuration screen to let the user to choose a set 
> of languages
> of the documents to be searched.  That might be a good middle-of-the-road 
> approach.
> Because of the lack of language processing on search terms, Google does not 
> need to know
> the language of the query.  Nutch GUI might want to have a drop down list 
> from which a language
> of the query can be choosen, with the GUI language pre-selected.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to