Hi,

I have recently dealt with Japanese support and wrote
how I did it on
http://nislab.human.waseda.ac.jp/blog/?page_id=7 . I
think it'll give you some idea.

Br,
Roman

--- sanjeev <[EMAIL PROTECTED]> wrote:

> 
> Hi all,
> 
> I am still waiting for some help re: the thai
> language indexing and
> searching.
> 
> Please help as i'm quite lost on this one.
> 
> Thanks and regards,
> sanjeev.
> 
> 
> sanjeev wrote:
> > 
> > Thanks for clearing up some doubts. But exactly
> how do i wrap it ?
> > Do I need to make changes in code to utilize the
> new thaitokenizer ?
> > If yes - where are the places that need
> modification ? 
> > Do I need to download a dev version and do a
> recompile ?
> > 
> > Please - if you could possibly tell me the steps -
> in brief - i would be
> > highly obliged.
> > 
> > Thanks,
> > sanjeev.
> > 
> > 
> > 
> > 
> > Jérôme Charron wrote:
> >> 
> >>> i used an existing ThaiAnalyzer which was in
> lucene packlage.
> >>> ok - i renamed the lucene.analysis.th.* to
> nutch.analysis.th.* -
> >>> compiled
> >>> and
> >>> placed all class files in a jar -
> analysis-th.jar (do i need to bundle
> >>> the
> >>> ngp file in the jar as well ?)
> >> 
> >> 1. You don't have to refactor the lucene
> analyzer. Just to wrap it like I
> >> do
> >> with french and german analyzers (they both use
> some analyzers from
> >> lucene).
> >>  2. Analyzer doesn't need ngp files... I think
> you misunderstood
> >> something:
> >> 2.1 In one side there is the language identifier
> that use NGP files to
> >> identify language of a document
> >> 2.2 In the other sided if a suitable analyzer is
> found for the identified
> >> language, it is used to analyze the document.
> >> 
> >> Regards
> >> 
> >> Jérôme
> >> 
> >> 
> >> -- 
> >> http://motrech.free.fr/
> >> http://www.frutch.org/
> >> 
> >> 
> > 
> > 
> 
> -- 
> View this message in context:
>
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html#a7827701
> Sent from the Nutch - Dev mailing list archive at
> Nabble.com.
> 
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

Reply via email to