RE: Tokenizer and Filter Factory to index Chinese characters

2015-07-07 Thread Markus Jelsma
Yes, but it is a small change :) M. -Original message- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Tuesday 7th July 2015 4:50 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters So we have to recompile the analysers

Re: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread Zheng Lin Edwin Yeo
...@gmail.com Sent: Thursday 25th June 2015 11:38 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Hi, The result doesn't seems that good as well. But you're not using the HMMChineseTokenizerFactory? The output below is from

RE: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread Markus Jelsma
-Original message- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Thursday 25th June 2015 11:38 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Hi, The result doesn't seems that good as well

Re: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread davidphilip cherian
=solr.CJKBigramFilterFactory/ -Original message- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Thursday 25th June 2015 11:24 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Thank you. I've tried

Re: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread Zheng Lin Edwin Yeo
- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Monday 6th July 2015 12:31 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Yes, I tried that also, but I faced some compatibility issues with Solr 5.2.1, as the libs that I found

Re: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread Zheng Lin Edwin Yeo
have enough time to spend: https://github.com/cslinmiso/paoding-analysis -Original message- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Thursday 25th June 2015 11:38 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Hi

RE: Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Markus Jelsma
Sent: Thursday 25th June 2015 11:24 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Thank you. I've tried that, but when I do a search, it's returning much more highlighted results that what it supposed to. For example

Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Zheng Lin Edwin Yeo
Hi, Does anyone knows what is the correct replacement for these 2 tokenizer and filter factory to index chinese into Solr? - SmartChineseSentenceTokenizerFactory - SmartChineseWordTokenFilterFactory I understand that these 2 tokenizer and filter factory are already deprecated in Solr 5.1, but I

Re: Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Zheng Lin Edwin Yeo
: Thursday 25th June 2015 11:02 To: solr-user@lucene.apache.org Subject: Tokenizer and Filter Factory to index Chinese characters Hi, Does anyone knows what is the correct replacement for these 2 tokenizer and filter factory to index chinese into Solr

RE: Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Markus Jelsma
=solr.CJKBigramFilterFactory/ -Original message- From:Zheng Lin Edwin Yeo edwinye...@gmail.com Sent: Thursday 25th June 2015 11:24 To: solr-user@lucene.apache.org Subject: Re: Tokenizer and Filter Factory to index Chinese characters Thank you. I've tried that, but when I do a search, it's

RE: Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Markus Jelsma
To: solr-user@lucene.apache.org Subject: Tokenizer and Filter Factory to index Chinese characters Hi, Does anyone knows what is the correct replacement for these 2 tokenizer and filter factory to index chinese into Solr? - SmartChineseSentenceTokenizerFactory - SmartChineseWordTokenFilterFactory

Re: Tokenizer and Filter Factory to index Chinese characters

2015-06-25 Thread Zheng Lin Edwin Yeo
: Tokenizer and Filter Factory to index Chinese characters Thank you. I've tried that, but when I do a search, it's returning much more highlighted results that what it supposed to. For example, if I enter the following query: http://localhost:8983/solr/chinese1/highlight?q=我国 I get