the plugin.includes property in nutch-site.xml but nothing is happening -
the log still shows not including language identifier - :-( what's wrong ?
Please help.
Thanks in advance.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-during-nutch-crawl-process
the plugin.includes property in nutch-site.xml but nothing is happening -
the log still shows not including language identifier - :-( what's wrong ?
Please help.
Thanks in advance.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-during-nutch-crawl-process
Message-
From: sanjeev [mailto:[EMAIL PROTECTED]
Sent: 2006-11-08 19:28
To: nutch-dev@lucene.apache.org
Subject: Re: implement thai lanaguage analyzer in nutch
I need a Thai Analyzer for Nutch. I want the crawler to be
intelligent enough
to split thai words correctly since thai don't
the search term is one Unicode character.
-kuro
-Original Message-
From: sanjeev [mailto:[EMAIL PROTECTED]
Sent: 2006-11-08 19:28
To: nutch-dev@lucene.apache.org
Subject: Re: implement thai lanaguage analyzer in nutch
I need a Thai Analyzer for Nutch. I want the crawler
is the
same as for any other language.
But yes - even I would appreciate any information to resolve this problem.
regards,
sanjeev.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7233864
Sent from the Nutch - Dev mailing list archive
this implemented ASAP and I can't wait.
cheers,
sanjeev.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7236321
Sent from the Nutch - Dev mailing list archive at Nabble.com.
Sanjay,
I don't think you should follow the Chinese example and extend the CJK
range.
This was needed because Chinese and Japanese don't use space to separate
words. I believe Thai uses spaces, right? If so, you should extend
LETTER
range to include Thai character rather than CJK.
Another place
ThaiWordFilter.java
Otis
- Original Message
From: Teruhiko Kurosaka [EMAIL PROTECTED]
To: sanjeev [EMAIL PROTECTED]; nutch-dev@lucene.apache.org
Sent: Wednesday, November 8, 2006 2:16:38 PM
Subject: RE: implement thai lanaguage analyzer in nutch
Sanjay,
I don't think you should follow the Chinese
: Wednesday, November 8, 2006 2:16:38 PM
Subject: RE: implement thai lanaguage analyzer in nutch
Sanjay,
I don't think you should follow the Chinese example and extend the CJK
range.
This was needed because Chinese and Japanese don't use space to separate
words. I believe Thai uses spaces
PM
Subject: RE: implement thai lanaguage analyzer in nutch
Sanjay,
I don't think you should follow the Chinese example and extend the CJK
range.
This was needed because Chinese and Japanese don't use space to separate
words. I believe Thai uses spaces, right? If so, you should extend
have to get this implemented ASAP and I can't
wait.
cheers,
sanjeev.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7236321
Sent from the Nutch - Dev mailing list archive at Nabble.com.
--
View this message
i think you should learn the javacc ,then understand the analasis.jj
then the thai will be resolved soon .
just try it
On 11/7/06, sanjeev [EMAIL PROTECTED] wrote:
Hello,
After playing around with nutch for a few months I was tying to implement
the thai lanaguage analyzer for nutch
/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7232439
Sent from the Nutch - Dev mailing list archive at Nabble.com.
I need more information please.
Thanks a bunch.
--
View this message in context:
http://www.nabble.com/implement-thai-lanaguage-analyzer-in-nutch-tf2587282.html#a7232439
Sent from the Nutch - Dev mailing list archive at Nabble.com.
14 matches
Mail list logo