[ 
https://issues.apache.org/jira/browse/SOLR-4265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13544517#comment-13544517
 ] 

Uwe Schindler commented on SOLR-4265:
-------------------------------------

I quickly deployed solr.war from the unpatched and patched build to Tomcat 
6.0.36 (which one is known to have problems). I only changed the port number in 
Tomcat to 8983 to make my test script working:

- On Tomcat with unpatched Solr test_utf8.sh fails:
{noformat}
bash-4.1$ ./test_utf8.sh
Solr server is up.
ERROR: HTTP GET is not accepting UTF-8
HTTP POST is accepting UTF-8
HTTP POST does not default to UTF-8
ERROR: HTTP GET is not accepting UTF-8 beyond the basic multilingual plane
HTTP POST is accepting UTF-8 beyond the basic multilingual plane
ERROR: HTTP POST + URL params is not accepting UTF-8 beyond the basic 
multilingual plane
{noformat}

- With this patch all is fine:
{noformat}
bash-4.1$ ./test_utf8.sh
Solr server is up.
HTTP GET is accepting UTF-8
HTTP POST is accepting UTF-8
HTTP POST defaults to UTF-8
HTTP GET is accepting UTF-8 beyond the basic multilingual plane
HTTP POST is accepting UTF-8 beyond the basic multilingual plane
HTTP POST + URL params is accepting UTF-8 beyond the basic multilingual plane
{noformat}

This is identical to Jetty. So we have a fix and no reconfiguration is needed 
in Tomcat anymore!

Please note: The last test (ERROR: Response can't return UTF-8 beyond the basic 
multilingual plane) fails for me on cygwin with any servlet container, but 
passes on linux with UTF-8; but this is caused by the shell script which does 
not work on windows default locale (windows-1252).
                
> Encoding problem from test console
> ----------------------------------
>
>                 Key: SOLR-4265
>                 URL: https://issues.apache.org/jira/browse/SOLR-4265
>             Project: Solr
>          Issue Type: Bug
>          Components: web gui
>    Affects Versions: 4.0
>         Environment: Windows but, environment independent
>            Reporter: Alex Rocher
>            Assignee: Uwe Schindler
>         Attachments: SOLR-4265.patch, SOLR-4265.patch, 
> SolrDispatchFilter.java.patch
>
>
> When you type an accent (in french language for example) in the console query 
> tester, there's no charset conversion (servlet request charset conversion)
> Eg.: "même" is converted into it's ISO-8859-1 representation ==> fail
> The reason : getCharacterEncoding from HTTPRequest is not tested. Il it's 
> null, il will assume to convert an UTF-8 encoding charset.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to