Re: searching using the CJKAnalyzer

2004-10-10 Thread Daan Hoogland
back to entities in the results, but I'm not sure what the criteria should be. It seems to be just binary data. How to conclude that 0Š4?0†3¨¦?0„4 means ÓÐÒ°? Thanks Che Dong Daan Hoogland wrote: LS, in http://issues.apache.org/eyebrowse/ReadMsg?listId=30msgNo=8980 Jon Schuster

Re: searching using the CJKAnalyzer

2004-10-12 Thread Daan Hoogland
(?, LATIN_1_SUPPLEMENT) token found : length: 1 char(?, LATIN_1_SUPPLEMENT) char(, LATIN_1_SUPPLEMENT) char(, LATIN_1_SUPPLEMENT) token found : length: 1 char(, LATIN_1_SUPPLEMENT) searching contents: This was a query for two japanese characters. -Original Message- From: Daan Hoogland

different analyzer all produce the same index?

2004-10-04 Thread Daan Hoogland
H all, I try to create different indices using different Analyzer-classes. I tried standard, german, russian, and cjk. They all produce exactly the same index file (md5-wise). There are over 280 pages so I expected at least some differences. Any ideas anyone? -- The information contained

indexing numeric entities?

2004-10-07 Thread Daan Hoogland
Hello, Does anyone do indexeing of numeric entities for japanese characters? I have (non-x)html containing those entities and need to index and search them. -- The information contained in this communication and any attachments is confidential and may be privileged, and is for the sole use

Re: indexing numeric entities?

2004-10-07 Thread Daan Hoogland
Daan Hoogland wrote: Daan Hoogland wrote: Hello, Does anyone do indexeing of numeric entities for japanese characters? I have (non-x)html containing those entities and need to index and search them. Can the CJKAnalyzer index a string like #9679;#20837;#31038;? It seems

Re: indexing numeric entities?

2004-10-07 Thread Daan Hoogland
maybe inline? html xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance; head titlejapan/title /head body bgcolor=#FF alink=black p #12501;#12451;#12540;#12523;#12489;#12469;#12540;#12499;#12473;#12456;#12531;#12472;#12491;#12450; /p /html Indexing the above document using the

searching using the CJKAnalyzer

2004-10-08 Thread Daan Hoogland
LS, in http://issues.apache.org/eyebrowse/ReadMsg?listId=30msgNo=8980 Jon Schuster explains how to get a Japanese search system working. I followed his advice and got a index that luke shows as what I expected it to be. I don't know how to enter a search so that it gets passed to the engine

Re: IndexWriter failure leaves lock in place

2005-01-10 Thread Daan Hoogland
Joseph (and others), I'm not an expert on lucene either. Your mail just rang a bell and I thought I'd contribute the ring for any expert to use. I have found stale locks on a system running on solaris/iplanet with the FSDirectory. The same code does not pose a problem on a