This Luke tool seems to be pretty cool. I have installed and its very easy to
find out the indexes and what is being stored. thanks for this info.

I have tried in tomcat and things works fine without issues. Default
operator is OR in my case. i havent tried with setting up stanalone
indexsearcher but i believe it should work. Please let me know if any
issues.


叶双明 wrote:
> 
> And, you can use Tool luke to see what is in the index indeed.
> what is in the Query which put into IndexSearcher.search(), what is the
> defaultOperatoer of QueryParser.
> 
> Can you get hits by setup a simple IndexSearcher, no through  tomcat?
> 
> 2008/9/18 anandsarwade <[EMAIL PROTECTED]>
> 
>>
>> Hi,
>>
>> I do get the same string from Mysql and also in servlet request. I could
>> observe the actaul string in eclipse while debugging. it is stored as
>> UTF-8
>> format so retrievel is coming as stored.
>>
>> plz let me know if iam not clear
>>
>>
>> 叶双明 wrote:
>> >
>> > You must trace the string  in each step!
>> > Important step is get string from MYSQL and get parameter in servlet,
>> > please
>> > check it, do you get the right string?
>> > Chinese has the same problem too.
>> >
>> > 2008/9/17 anandsarwade <[EMAIL PROTECTED]>
>> >
>> >>
>> >> Hello Jimi,
>> >>
>> >> Thanks a lot for your valuable suggestion.
>> >>
>> >> I am using tomcat 5 . As per your suggestions ,checked the server.xml
>> but
>> >> found that no URIEncoding was set.
>> >> I have set now and to my great relief :-) i could see the Lucene
>> results
>> >> on
>> >> my browser for japanese string with request objects in UTF-8 now.
>> >>
>> >> Thanks again for your help.
>> >>
>> >> Regards,
>> >> Anand.
>> >>
>> >>
>> >> JimiH wrote:
>> >> >
>> >> > What webserver are you using? For example, with Tomcat, it could be
>> >> > because of the setting URIEncoding in server.xml.
>> >> >
>> >> > http://tomcat.apache.org/tomcat-5.5-doc/config/http.html
>> >> >
>> >> > /Jimi
>> >> >
>> >> > mogul | jimi hullegård | system developer | hudiksvallsgatan 4, 113
>> 30
>> >> > stockholm sweden | +46 8 506 66 172 | +46 765 27 19 55 |
>> >> > [EMAIL PROTECTED] | www.mogul.com
>> >> >
>> >> >
>> >> >> -----Original Message-----
>> >> >> From: anandsarwade [mailto:[EMAIL PROTECTED]
>> >> >> Sent: den 17 september 2008 16:42
>> >> >> To: java-user@lucene.apache.org
>> >> >> Subject: Lucene search fails for japanese characters in URL
>> >> >>
>> >> >>
>> >> >> Hi ,
>> >> >>
>> >> >> I am facing below problem. Please help me in this.
>> >> >>
>> >> >> I have integrated CJK Analyzer for Japanese characters. I am
>> >> >> able to save
>> >> >> japanese double byte characters in mysql database in UTF-8
>> >> >> format without
>> >> >> issues. I could that data is getted indexed. Now when i
>> >> >> search the Japanese
>> >> >> characters which were indexed using the URL below , returns
>> >> >> empty results.
>> >> >>
>> >> >> http://xml.demo.myaol.jp:8082/portal/gallery-search?first=1&ma
>> >> >> x=100&cap=言語
>> >> >>
>> >> >> Noticed that the above url gets converted to the following
>> >> >> URL having some
>> >> >> HTML encoded strings in search.
>> >> >>
>> >> >> http://xml.demo.myaol.jp:8082/portal/gallery-search?first=1&ma
>> >> >> x=100&cap=%E8%A8%80%E8%AA%9E
>> >> >>
>> >> >> This does not match with the existing lucene indexes
>> >> >> henceforth returns
>> >> >> empty results.  How do i solve this lucene search issue
>> >> >> having japanese
>> >> >> words in URLs.? Is there any way to convert such characters
>> >> >> back to Japanese
>> >> >> words???
>> >> >>
>> >> >> Any help/suggestions in this regards is highly appreciated.
>> >> >>
>> >> >> Thanks in Advance.
>> >> >>
>> >> >> Regards,
>> >> >> Anand
>> >> >>
>> >> >> --
>> >> >> View this message in context:
>> >> >> http://www.nabble.com/Lucene-search-fails-for-japanese-charact
>> >> >> ers-in-URL-tp19533647p19533647.html
>> >> >> Sent from the Lucene - Java Users mailing list archive at
>> Nabble.com.
>> >> >>
>> >> >>
>> >> >>
>> ---------------------------------------------------------------------
>> >> >> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> >> >> For additional commands, e-mail: [EMAIL PROTECTED]
>> >> >>
>> >> >>
>> >> >
>> >> >
>> >>
>> >> --
>> >> View this message in context:
>> >>
>> http://www.nabble.com/Lucene-search-fails-for-japanese-characters-in-URL-tp19533647p19534342.html
>> >> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>> >>
>> >>
>> >> ---------------------------------------------------------------------
>> >> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> >> For additional commands, e-mail: [EMAIL PROTECTED]
>> >>
>> >>
>> >
>> >
>> > --
>> > Sorry for my english!! 明
>> > Please help me to correct my english expression and error in syntax
>> >
>> >
>>
>> --
>> View this message in context:
>> http://www.nabble.com/Lucene-search-fails-for-japanese-characters-in-URL-tp19533647p19547081.html
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> For additional commands, e-mail: [EMAIL PROTECTED]
>>
>>
> 
> 
> -- 
> Sorry for my english!! 明
> Please help me to correct my english expression and error in syntax
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Lucene-search-fails-for-japanese-characters-in-URL-tp19533647p19549854.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to