[ 
https://issues.apache.org/jira/browse/SOLR-412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12780284#action_12780284
 ] 

Age Jan Kuperus commented on SOLR-412:
--------------------------------------

I agree. Although I was pretty sure XSLT 2.0 was even stricter but could not 
immediately find a formal reference.
So I did some more research today and found the following confirmation in 
http://www.w3.org/TR/xslt-xquery-serialization/, which is part of XSLT 2.0:

"media-type     A string of Unicode characters specifying the media type (MIME 
content type) [RFC2046]; the charset parameter of the media type MUST NOT be 
specified explicitly in the value of the media-type parameter".

Therefore I would like you to have a look at my patch and comment on it (or 
even commit it ;-). Committing this patch would also require the patches for 
SOLR-233 and SOLR-514  to be undone (as their results are illegal in both XSLT 
1.0 and 2.0), and possibly has documentation consequences.

> XsltWriter does not output UTF-8 by default
> -------------------------------------------
>
>                 Key: SOLR-412
>                 URL: https://issues.apache.org/jira/browse/SOLR-412
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.2
>         Environment: Tomcat 5.5
> Linux Red Hat ES4  (2.6.9-5.ELsmp from 'uname -a')
>            Reporter: Lance Norskog
>         Attachments: diff-2009-10-22
>
>
> XsltWriter outputs XML text in ISO-8859-1 encoding by default.
> Tomcat 5.5 has URIEncoding="UTF-8" set in the <Connector> element as 
> described in the Wiki.
> This outout description in the XML: 
> <xsl:output method="xml" encoding="utf-8" />
> gives output with this header:
> HTTP/1.1 200 OK
> Server: Apache-Coyote/1.1
> Content-Type: text/xml;charset=ISO-8859-1
> Transfer-Encoding: chunked
> Date: Wed, 14 Nov 2007 17:49:11 GMT
> I had to change the <xsl:output> directive to this:
>  <xsl:output media-type="text/xml; charset=UTF-8" encoding="UTF-8"/>
> This is the root cause of SOLR-233.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to