Re: character encoding issue...

2013-11-10 Thread Michael Sokolov
3P3Cw2X8 Thanks On Tue, Oct 29, 2013 at 9:20 PM, Chris wrote: Hi All, I get characters like - �� - CTA - in the solr index. I am adding Java beans to solr by the addBean() function. This seems to be a character encoding issue. Any pointers on how to resolve

Re: character encoding issue...

2013-11-09 Thread Chris
>>>>> >>>>> >>>>> - >>>>> � >>>>> >>>>> >>>>> >>>>> Can you please advic

Re: character encoding issue...

2013-11-05 Thread T. Kuro Kurosaka
am adding Java beans to solr by the addBean() function. This seems to be a character encoding issue. Any pointers on how to resolve this one? I have seen that this occurs mostly for japanese chinese characters. -- - T. "Kuro" Kurosaka • Senior Software Engineer

Re: character encoding issue...

2013-11-04 Thread Chris
> > I had faced similar issue with Chinese Characters and had resolved > with > > > the > > > > above config. > > > > > > > > Links for reference : > > > > > > > > > > > > > > http://zensarteam.wordpress.com/2011/11/25/6-steps-to-configure-solr-on-apache-tomcat-7-0-20/ > > > > > > > > > > > > > > http://blog.sidu.in/2007/05/tomcat-and-utf-8-encoded-uri-parameters.html#.Um_3P3Cw2X8 > > > > > > > > > > > > Thanks > > > > > > > > > > > > > > > > On Tue, Oct 29, 2013 at 9:20 PM, Chris wrote: > > > > > > > > > Hi All, > > > > > > > > > > I get characters like - > > > > > > > > > > �� - CTA - > > > > > > > > > > in the solr index. I am adding Java beans to solr by the addBean() > > > > > function. > > > > > > > > > > This seems to be a character encoding issue. Any pointers on how to > > > > > resolve this one? > > > > > > > > > > I have seen that this occurs mostly for japanese chinese > characters. > > > > > > > > > > > > > > >

Re: character encoding issue...

2013-11-04 Thread Erick Erickson
e : > > > > > > > > > http://zensarteam.wordpress.com/2011/11/25/6-steps-to-configure-solr-on-apache-tomcat-7-0-20/ > > > > > > > > > http://blog.sidu.in/2007/05/tomcat-and-utf-8-encoded-uri-parameters.html#.Um_3P3Cw2X8 > > > > >

Re: character encoding issue...

2013-11-03 Thread Rajani Maski
/tomcat-and-utf-8-encoded-uri-parameters.html#.Um_3P3Cw2X8 > > > > > > Thanks > > > > > > > > On Tue, Oct 29, 2013 at 9:20 PM, Chris wrote: > > > > > Hi All, > > > > > > I get characters like - > > > > > > �� - CTA - > > > > > > in the solr index. I am adding Java beans to solr by the addBean() > > > function. > > > > > > This seems to be a character encoding issue. Any pointers on how to > > > resolve this one? > > > > > > I have seen that this occurs mostly for japanese chinese characters. > > > > > >

Re: character encoding issue...

2013-10-31 Thread Chris
Tue, Oct 29, 2013 at 9:20 PM, Chris wrote: > > > Hi All, > > > > I get characters like - > > > > �� - CTA - > > > > in the solr index. I am adding Java beans to solr by the addBean() > > function. > > > > This seems t

Re: character encoding issue...

2013-10-29 Thread Rajani Maski
ng Java beans to solr by the addBean() > function. > > This seems to be a character encoding issue. Any pointers on how to > resolve this one? > > I have seen that this occurs mostly for japanese chinese characters. >

character encoding issue...

2013-10-29 Thread Chris
Hi All, I get characters like - �� - CTA - in the solr index. I am adding Java beans to solr by the addBean() function. This seems to be a character encoding issue. Any pointers on how to resolve this one? I have seen that this occurs mostly for japanese chinese

[Solved] Re: VelocityResponseWriter/Solritas character encoding issue

2009-11-27 Thread Sascha Szott
Hi Erik, I've finally solved the problem. Unfortunately, the parameter v.contentType was not described in the Solr wiki (I've fixed that now). The point is, you must specify (in your solrconfig.xml) text/xml;charset=UTF-8 in order to receive correctly UTF-8 encoded HTML. That's it! B

Re: VelocityResponseWriter/Solritas character encoding issue

2009-11-18 Thread Sascha Szott
Hi Lance, Lance Norskog wrote: > What platform are you using? Windows does not use UTF-8 by default, > and this can cause subtle problems. If you can do the same thing on > other platforms (Linux, Mac) that would help narrow down the problem. My Solr server runs in a Tomcat server on a Ubuntu Linu

Re: VelocityResponseWriter/Solritas character encoding issue

2009-11-18 Thread Lance Norskog
What platform are you using? Windows does not use UTF-8 by default, and this can cause subtle problems. If you can do the same thing on other platforms (Linux, Mac) that would help narrow down the problem. On Wed, Nov 18, 2009 at 8:15 AM, Sascha Szott wrote: > Hi Erik, > > Erik Hatcher wrote: >>

Re: VelocityResponseWriter/Solritas character encoding issue

2009-11-18 Thread Sascha Szott
Hi Erik, Erik Hatcher wrote: Can you give me a test document that causes an issue? (maybe send me a Solr XML document in private e-mail). I'll see what I can do once I can see the issue first hand. Thank you! Just try the utf8-example.xml file in the exampledoc directory. After having index

Re: VelocityResponseWriter/Solritas character encoding issue

2009-11-18 Thread Erik Hatcher
Sascha, Can you give me a test document that causes an issue? (maybe send me a Solr XML document in private e-mail). I'll see what I can do once I can see the issue first hand. Erik On Nov 18, 2009, at 2:48 PM, Sascha Szott wrote: Hi, I've played around with Solr's VelocityR

VelocityResponseWriter/Solritas character encoding issue

2009-11-18 Thread Sascha Szott
Hi, I've played around with Solr's VelocityResponseWriter (which is indeed a very useful feature for rapid prototyping). I've realized that Velocity uses ISO-8859-1 as default character encoding. I've changed this setting to UTF-8 in my velocity.properties file (inside the conf directory), i.e

Re: character encoding issue

2009-11-04 Thread Jérôme Etévé
Hi, How do you post your data to solr? If it's by posting XML, then it should be properly encoded in UTF-8 (which is the XML default). Regardless of what's in the DB (which can be a mystery with MySQL). At query time, if the XML writer is used, then it's encoded in UTF-8. If the json one is used

Re: character encoding issue

2009-11-04 Thread Jonathan Hendler
Hi Peter, I have the same set of issues and will look for a response here. Sometimes those other chars can be create at the time of input (like extraction from a Microsoft Office doc from third part tool for example). But MySQL looking OK in the browser might be because the encoding of MyS

character encoding issue

2009-11-04 Thread Peter Hedlund
I'm having a problem with character encoding. The data that I'm indexing with SOLR is being pulled from a MySQL database and then the index is being integrated into a PHP application. When I display the text from the SOLR index it's full of strange characters (–, é, etc...). However, when