implement thai lanaguage analyzer during nutch crawl process

2006-11-26 Thread sanjeev
the plugin.includes property in nutch-site.xml but nothing is happening - the log still shows not including language identifier - :-( what's wrong ? Please help. Thanks in advance. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-during-nutch-crawl-process

implement thai lanaguage analyzer during nutch crawl process

2006-11-26 Thread sanjeev
the plugin.includes property in nutch-site.xml but nothing is happening - the log still shows not including language identifier - :-( what's wrong ? Please help. Thanks in advance. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-during-nutch-crawl-process

RE: implement thai lanaguage analyzer in nutch

2006-11-14 Thread sanjeev
Message- From: sanjeev [mailto:[EMAIL PROTECTED] Sent: 2006-11-08 19:28 To: nutch-dev@lucene.apache.org Subject: Re: implement thai lanaguage analyzer in nutch I need a Thai Analyzer for Nutch. I want the crawler to be intelligent enough to split thai words correctly since thai don't

RE: implement thai lanaguage analyzer in nutch

2006-11-10 Thread Teruhiko Kurosaka
the search term is one Unicode character. -kuro -Original Message- From: sanjeev [mailto:[EMAIL PROTECTED] Sent: 2006-11-08 19:28 To: nutch-dev@lucene.apache.org Subject: Re: implement thai lanaguage analyzer in nutch I need a Thai Analyzer for Nutch. I want the crawler

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread Arun Kaundal
is the same as for any other language. But yes - even I would appreciate any information to resolve this problem. regards, sanjeev. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7233864 Sent from the Nutch - Dev mailing list archive

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread sanjeev
this implemented ASAP and I can't wait. cheers, sanjeev. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7236321 Sent from the Nutch - Dev mailing list archive at Nabble.com.

RE: implement thai lanaguage analyzer in nutch

2006-11-08 Thread Teruhiko Kurosaka
Sanjay, I don't think you should follow the Chinese example and extend the CJK range. This was needed because Chinese and Japanese don't use space to separate words. I believe Thai uses spaces, right? If so, you should extend LETTER range to include Thai character rather than CJK. Another place

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread ogjunk-nutch
ThaiWordFilter.java Otis - Original Message From: Teruhiko Kurosaka [EMAIL PROTECTED] To: sanjeev [EMAIL PROTECTED]; nutch-dev@lucene.apache.org Sent: Wednesday, November 8, 2006 2:16:38 PM Subject: RE: implement thai lanaguage analyzer in nutch Sanjay, I don't think you should follow the Chinese

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread sanjeev
: Wednesday, November 8, 2006 2:16:38 PM Subject: RE: implement thai lanaguage analyzer in nutch Sanjay, I don't think you should follow the Chinese example and extend the CJK range. This was needed because Chinese and Japanese don't use space to separate words. I believe Thai uses spaces

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread sanjeev
PM Subject: RE: implement thai lanaguage analyzer in nutch Sanjay, I don't think you should follow the Chinese example and extend the CJK range. This was needed because Chinese and Japanese don't use space to separate words. I believe Thai uses spaces, right? If so, you should extend

Re: implement thai lanaguage analyzer in nutch

2006-11-08 Thread sanjeev
have to get this implemented ASAP and I can't wait. cheers, sanjeev. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7236321 Sent from the Nutch - Dev mailing list archive at Nabble.com. -- View this message

Re: implement thai lanaguage analyzer in nutch

2006-11-07 Thread kauu
i think you should learn the javacc ,then understand the analasis.jj then the thai will be resolved soon . just try it On 11/7/06, sanjeev [EMAIL PROTECTED] wrote: Hello, After playing around with nutch for a few months I was tying to implement the thai lanaguage analyzer for nutch

Re: implement thai lanaguage analyzer in nutch

2006-11-07 Thread sanjeev
/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7232439 Sent from the Nutch - Dev mailing list archive at Nabble.com.

Re: implement thai lanaguage analyzer in nutch

2006-11-07 Thread Arun Kaundal
I need more information please. Thanks a bunch. -- View this message in context: http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7232439 Sent from the Nutch - Dev mailing list archive at Nabble.com.