Adding data as UTF-8

2007-03-10 Thread Morten Fangel
Hi, I've been working on adding some Solr-integration into my current project, but have run into a problem with non-ascii characters. I send a document like the following: --- ?xml version=1.0 encoding=UTF-8? adddoc field name=question_id228/field field name=question_titleVedhæft billede

Re: Adding data as UTF-8

2007-03-10 Thread Morten Fangel
On Saturday 10 March 2007 21:39, Bertrand Delacretaz wrote: On 3/10/07, Morten Fangel [EMAIL PROTECTED] wrote: ...I send a document like the following: --- ?xml version=1.0 encoding=UTF-8?... I assume you're using your own code to send the document? Indeed. Solr will be integrated

Re: Adding data as UTF-8

2007-03-10 Thread Walter Underwood
It is better to use application/xml. See RFC 3023. Using text/xml; charset=UTF-8 will override the XML encoding declaration. application/xml will not. wunder On 3/10/07 12:39 PM, Bertrand Delacretaz [EMAIL PROTECTED] wrote: On 3/10/07, Morten Fangel [EMAIL PROTECTED] wrote: ...I send a

Re: Adding data as UTF-8

2007-03-10 Thread Morten Fangel
On Saturday 10 March 2007 22:18, Walter Underwood wrote: It is better to use application/xml. See RFC 3023. Using text/xml; charset=UTF-8 will override the XML encoding declaration. application/xml will not. Thanks for the info. I've changed the header accordingly. -fangel

Re: Adding data as UTF-8

2007-03-10 Thread Bertrand Delacretaz
On 3/10/07, Walter Underwood [EMAIL PROTECTED] wrote: It is better to use application/xml. See RFC 3023. Using text/xml; charset=UTF-8 will override the XML encoding declaration. application/xml will not... I agree, but did you try this with our example setup, started with java -jar start.jar?

Re: Adding data as UTF-8

2007-03-10 Thread Walter Underwood
If it does something different, that is a bug. RFC 3023 is clear. --wunder On 3/10/07 1:49 PM, Bertrand Delacretaz [EMAIL PROTECTED] wrote: On 3/10/07, Walter Underwood [EMAIL PROTECTED] wrote: It is better to use application/xml. See RFC 3023. Using text/xml; charset=UTF-8 will override the

Re: Adding data as UTF-8

2007-03-10 Thread Bertrand Delacretaz
On 3/10/07, Walter Underwood [EMAIL PROTECTED] wrote: If it does something different, that is a bug. RFC 3023 is clear. --wunder.. Sure - just wanted to confirm what I'm seeing, thanks! -Bertrand