Re: Deleting Solr index
just rm -r SOLR_DIR/data/index. 2008/6/18 Mihails Agafonovs [EMAIL PROTECTED]: How can I clear the whole Solr index? Ar cieņu, Mihails -- regards j.L
Re: Chinese Language + Solr
On Thu, May 15, 2008 at 11:25 PM, Walter Underwood [EMAIL PROTECTED] wrote: I've worked with the Basis products. Solid, good support. Last time I talked to them, they were working on hooking them into Lucene. i don't know basis product. but i know google use it and in china, google.cnnot better that baidu. we always use baidu.com to search chinese information. For really good quality results from any of these, you need to add terms to the user dictionary of the segmenter. These may be local jargon, product names, personal names, place names, etc. yes, i agree your point. baidu's analyzer use this way which i learn from Internet. Baidu has different problems than the rest of us, because their code has to be scary fast. They might even trade lower quality for more speed. Can u say it more? I think baidu use more cache server and have effective cache strategy. wunder -- regards j.L
anyone use hadoop+solr?
can u talk about it ? maybe i will use hadoop + solr. thks for ur advice. -- regards j.L
Re: Chinese Language + Solr
if commercial analyzers, i recommend http://www.hylanda.com/(it is the best analyzer in chinese word) On Thu, May 15, 2008 at 8:32 AM, j. L [EMAIL PROTECTED] wrote: u can try je-analyzer,,,i building 17m docs search site by solr and je-analyzer On Thu, May 15, 2008 at 6:44 AM, Walter Underwood [EMAIL PROTECTED] wrote: N-gram works pretty well for Chinese, there are even studies to back that up. Do not use the N-gram matches for highlighting. They look really stupid to native speakers. wunder On 5/14/08 2:03 PM, Otis Gospodnetic [EMAIL PROTECTED] wrote: There are no free morphological analyzers for Chinese (are there for any language?) that I know. People tend to use one of the n-gram analyzers from Lucene contrib. I've used them before and they do OK. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Francisco Sanmartin [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Wednesday, May 14, 2008 4:54:05 PM Subject: Chinese Language + Solr I have had successful experiences using Sorl with an English website, and now I am going to deploy Solr in a chinese site. I've been looking in the mailing list and there are some useful information in the old posts. But, we would like some kind of feedback of the people who already have deployed Solr in any CJK Language. Is there any free and good analyzer? (Preferible morphological) Among all the commercial analyzers, what would you recommend? Is there any of them that works ok out-of-the-box with Solr? Thanks in advance. Pako -- regards j.L -- regards j.L
Re: Chinese Language + Solr
I don't know the cost. I know the bigger chinese search use it. More chinese people who study and use full-text search think it is the best chinese analyzer which u can buy. Baidu(www.baidu.com), is the biggest chinese search, and googlechina is the No 2. Baidu not use it (http://www.hylanda.com/ http://www.hylanda.com/%28it), they use theirself chinese analyzer. On Thu, May 15, 2008 at 8:45 AM, Otis Gospodnetic [EMAIL PROTECTED] wrote: Out of curiosity, what's the cost (the site is in Chinese, so I can't tell :( )? BasisTech are the main people for this type of stuff. Expensive, though, I believe. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch -- regards j.L
Re: question about fl=score
2008/3/20 李银松 [EMAIL PROTECTED]: 1、When I set fl=score ,solr returns just as fl=*,score ,not just scores Is it a bug or just do it on purpose? u can set fl=id,score, solr not support the style like fl=score My customer want to get the 1th-10010th added docs So I have to sort by timestamp, to get top10010 docs' timestamp …… limit 1, 10010 order by timestamp? -- regards j.L
i think it is time to release new solr version
because lucene 2.3.0 today released.. -- regards j.L