Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by KurosakaTeruhiko:
http://wiki.apache.org/nutch/MultiLingualSupport

The comment on the change is:
Minor correcton to previous comment and a new comment about query language

------------------------------------------------------------------------------
  }}}
  so that, the {{{IndexWriter}}} is called with the good {{{Analyzer}}} 
implementation.
   
-   (Note by KurosakaTeruhiko) This seems to have been implemented in Nutch 
0.8.  The following lines were found in {{{Indexer}}}, not {{{IndexSegment}}} 
which no longer exists in Nutch 0.8:
+   (Note by KurosakaTeruhiko) This seems to have been implemented in Nutch 
0.8.  The following lines are found in {{{Indexer}}}, not {{{IndexSegment}}} 
which no longer exists in Nutch 0.8:
    {{{
  final AnalyzerFactory factory = new AnalyzerFactory(job);
   .
@@ -70, +70 @@

  But the {{{lang}}} attribute in this case must be retrieved from the 
front-end using the following policy:
   1. Use an optional {{{lang}}} attribute provided by the search interface.
   2. If no such attribute is provided by the search interface, then uses the 
Browser language.
-  3. (try to identify the query language using the LanguageIdentifierPlugin)
+  3. (try to identify the query language using the 
LanguageIdentifierPlugin)(Note by KurosakaTeruhiko: This probably won't work 
well because queries are usually too short to tell a language.  "chat" can be 
English or French, for example.  What language is "Euro"?)
  
  === Code modifications ===
  The query analysis requires more code modifications than the document 
analysis.

Reply via email to