Re: wana use CJKAnalyzer

2006-09-25 Thread James liu
2006/9/25, Walter Underwood <[EMAIL PROTECTED]>: This document has two problems. First, the document is not well-formed XML. Open it in Firefox and you will see this error: XML Parsing Error: mismatched tag. Expected: . Location: file:///Users/wunderwood/Desktop/jl.xml Line Number 15,

Re: wana use CJKAnalyzer

2006-09-25 Thread Walter Underwood
This document has two problems. First, the document is not well-formed XML. Open it in Firefox and you will see this error: XML Parsing Error: mismatched tag. Expected: . Location: file:///Users/wunderwood/Desktop/jl.xml Line Number 15, Column 3: After I fix that, it still is not legal

Re: wana use CJKAnalyzer

2006-09-22 Thread James liu
2006/9/23, Walter Underwood <[EMAIL PROTECTED]>: On 9/21/06 5:37 PM, "James liu" <[EMAIL PROTECTED]> wrote:> Yes,it working. the root of my problem is xml muse be encoded by utf-8.> if use php,it not about www browser. just notice that > curl header information must be utf-8.> if use post.sh,xml mu

Re: wana use CJKAnalyzer

2006-09-22 Thread Walter Underwood
On 9/22/06 10:22 AM, "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > What I think might be ideal: If there is a charset definition, then > let the servlet handle it by requesting a Writer. If there isn't > a charset definition, request a byte-oriented InputStream from the > container and let the XML

Re: wana use CJKAnalyzer

2006-09-22 Thread Yonik Seeley
On 9/22/06, Walter Underwood <[EMAIL PROTECTED]> wrote: This might be a Solr bug. Solr should be able to accept XML in any of the required encodings (ASCII, Latin 1, UTF-8, and UTF-16). Getting XML content types exactly right is tricky, see RFC 3023. Right now Solr pays attention to Content-typ

Re: wana use CJKAnalyzer

2006-09-22 Thread Walter Underwood
On 9/21/06 5:37 PM, "James liu" <[EMAIL PROTECTED]> wrote: > Yes,it working. the root of my problem is xml muse be encoded by utf-8. > if use php,it not about www browser. just notice that > curl header information must be utf-8. > if use post.sh,xml muse be encoded by utf-8.(my editplus default e

Re: wana use CJKAnalyzer

2006-09-21 Thread James liu
2006/9/22, Chris Hostetter <[EMAIL PROTECTED]>: : i just wanna say: no your help,maybe i will give up.thk u again. : : http://www.flickr.com/photos/[EMAIL PROTECTED]/248815068/ : > thk Hoss,Nick Snels,Koji,Mike and everybody who helped me and wanna help : > me.. : > : > i can use solr wit

Re: wana use CJKAnalyzer

2006-09-21 Thread Yonik Seeley
On 9/21/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : i just wanna say: no your help,maybe i will give up.thk u again. : : http://www.flickr.com/photos/[EMAIL PROTECTED]/248815068/ : > thk Hoss,Nick Snels,Koji,Mike and everybody who helped me and wanna help : > me.. : > : > i can use so

Re: wana use CJKAnalyzer

2006-09-21 Thread Chris Hostetter
: i just wanna say: no your help,maybe i will give up.thk u again. : : http://www.flickr.com/photos/[EMAIL PROTECTED]/248815068/ : > thk Hoss,Nick Snels,Koji,Mike and everybody who helped me and wanna help : > me.. : > : > i can use solr with Chinese Word. I'm sorry, i'm really confused now

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
i just wanna say: no your help,maybe i will give up.thk u again. http://www.flickr.com/photos/[EMAIL PROTECTED]/248815068/ 2006/9/21, James liu <[EMAIL PROTECTED]>: thk Hoss,Nick Snels,Koji,Mike and everybody who helped me and wanna help me.. i can use solr with Chinese Word. --

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
thk Hoss,Nick Snels,Koji,Mike and everybody who helped me and wanna help me.. i can use solr with Chinese Word.

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
i recompile it. when i ant dist...cmd shows some api is old and uncheck... Is it problem? my java version you can find http://www.flickr.com/photos/[EMAIL PROTECTED]/?saved=1 2006/9/21, James liu <[EMAIL PROTECTED]>: i dont know it is import i add junit when i use ant dist,,,it show

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
i dont know it is import i add junit when i use ant dist,,,it show me error information : not found junit,,,so i download and add it. Is it problem about CJKAnalyzer?

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
i use lukeall.jar to check indexdata. u can find picture from http://www.flickr.com/photos/[EMAIL PROTECTED]/?saved=1 solr.jpg is i use lukeall.jar to check solr's index data. lucene.jpg is i use lukeall.jar to check lucene's index data. now i use lucene is ok.

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
sorry,,it is wrong... my schema.xml id content 在06-9-21,James liu <[EMAIL PROTECTED]> 写道: to mike: " Are you testing the same field to which you are adding the analyzer? I noticed in another mail that you added this to the "text_lu" field type--the so

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
Hoss, thk for ur help 2006/9/21, Chris Hostetter <[EMAIL PROTECTED]>: : 6:start jetty,java -jar start.jar : 7:no error. : 8: http://localhost:8983/solr/admin,,,i click analyzer link,,,and try : test analyzer chinese word,but nothing happend. ...i don't know much about non latin characters bu

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
to mike: " Are you testing the same field to which you are adding the analyzer? I noticed in another mail that you added this to the "text_lu" field type--the solr example uses "text", as I recall." now my schema.xml: 111 姓名是刘平 112 姓名是小王 113 老婆不在家 but i m f

Re: wana use CJKAnalyzer

2006-09-20 Thread Chris Hostetter
: you find index data from my attachements. its name is solr.jpg and lucene : breaking well, its name is lucene.jpg FYI: the mailing list only allows text attachments, if you want to refer to images you have to send a URL to an image online somewhere instead. -Hoss

Re: wana use CJKAnalyzer

2006-09-20 Thread Chris Hostetter
: 6:start jetty,java -jar start.jar : 7:no error. : 8: http://localhost:8983/solr/admin,,,i click analyzer link,,,and try : test analyzer chinese word,but nothing happend. ...i don't know much about non latin characters but i tried making the same changes you did, and asked a coworker who speaks

Re: wana use CJKAnalyzer

2006-09-20 Thread Mike Klaas
On 9/19/06, James liu <[EMAIL PROTECTED]> wrote: 4:modify schema(conf/schema.conf), like yours,,just "" Are you testing the same field to which you are adding the analyzer? I noticed in another mail that you added this to the "text_lu" field type--the solr example uses "text", as I recall. -M

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
attachements:  schema.xml2006/9/20, James liu <[EMAIL PROTECTED]>: i m java newer. so i print these steps.solr tutorial i test is ok.anything you wanna know, mail me.-- regardsjl id content

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
i m java newer. so i print these steps. solr tutorial i test is ok. anything you wanna know, mail me.

Re: wana use CJKAnalyzer

2006-09-20 Thread James liu
2006/9/20, Yonik Seeley <[EMAIL PROTECTED]>: On 9/20/06, James liu <[EMAIL PROTECTED]> wrote: > My step to support CJK...: > 1:add lucene-analyzers-2.0.0.jar to > "C:\cygwin\tmp\solr-nightly\lib" > 2:use cmd, "cd C:\cygwin\tmp\solr-nightly","ant dist" > 3:copy "C:\cygwin\tmp\solr-nightly\dist\so

Re: wana use CJKAnalyzer

2006-09-20 Thread Yonik Seeley
On 9/20/06, James liu <[EMAIL PROTECTED]> wrote: My step to support CJK...: 1:add lucene-analyzers-2.0.0.jar to "C:\cygwin\tmp\solr-nightly\lib" 2:use cmd, "cd C:\cygwin\tmp\solr-nightly","ant dist" 3:copy "C:\cygwin\tmp\solr-nightly\dist\solr- 1.0.war" to "C:\cygwin\tmp\solr-nightly\example\weba

wana use CJKAnalyzer

2006-09-19 Thread James liu
My step to support CJK...:1:add lucene-analyzers-2.0.0.jar to "C:\cygwin\tmp\solr-nightly\lib"2:use cmd, "cd C:\cygwin\tmp\solr-nightly","ant dist"3:copy "C:\cygwin\tmp\solr-nightly\dist\solr- 1.0.war" to "C:\cygwin\tmp\solr-nightly\example\webapps\solr.war"4:modify schema(conf/schema.conf), like