Add some tolerance to BuilderUtil.validateCharSetEncoding() -----------------------------------------------------------
Key: AXIS2-4386 URL: https://issues.apache.org/jira/browse/AXIS2-4386 Project: Axis 2.0 (Axis2) Issue Type: Improvement Components: client-api Affects Versions: 1.4.1 Environment: IBM WebSphere Application Server 6.0.2.29 / IBM JRE 1.4.2 SR11 Reporter: Christian Gosch I currently try to use a given web service which returns "inconsistent" responses concerning the encoding setup: The HTTP header is set and claims to head an ISO-8859-1 response, but the XML content itself claims to be of UTF-8. As far as I can see this causes my Axis2 1.4.1 client (generated based on xmlbeans 2.3) to throw the following exception: org.apache.axis2.AxisFault: Character Set Encoding from transport information [ISO-8859-1] does not match with character set encoding in the received SOAP message [UTF-8] at org.apache.axis2.builder.BuilderUtil.validateCharSetEncoding(BuilderUtil.java:786) at org.apache.axis2.builder.SOAPBuilder.processDocument(SOAPBuilder.java:57) at org.apache.axis2.transport.TransportUtils.createDocumentElement(TransportUtils.java:164) at org.apache.axis2.transport.TransportUtils.createSOAPMessage(TransportUtils.java:112) at org.apache.axis2.transport.TransportUtils.createSOAPMessage(TransportUtils.java:88) at org.apache.axis2.description.OutInAxisOperationClient.handleResponse(OutInAxisOperation.java:353) at org.apache.axis2.description.OutInAxisOperationClient.send(OutInAxisOperation.java:416) at org.apache.axis2.description.OutInAxisOperationClient.executeImpl(OutInAxisOperation.java:228) at org.apache.axis2.client.OperationClient.execute(OperationClient.java:163) [...] The response itself looks like: "HTTP/1.1 200 OK[\r][\n]" "HTTP/1.1 200 OK[\r][\n]" "Date: Tue, 16 Jun 2009 15:02:40 GMT[\r][\n]" "Server: Apache-Coyote/1.1[\r][\n]" "X-Powered-By: Servlet 2.4; Tomcat-5.0.28/JBoss-3.2.6 (build: CVSTag=JBoss_3_2_6 date=200410140106)[\r][\n]" "Content-Type: text/xml;charset=ISO-8859-1[\r][\n]" "Content-Length: 475[\r][\n]" "[\r][\n]" "<?xm" "l version="1.0" encoding="UTF-8"?>[\n]" "<Envelope xmlns="http://schemas.xmlsoap.org/soap/envelope/" xmlns:soapenv="http://schemas.xmlsoap.org/soap/envelope/" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"><Body><soapenv:Fault xmlns=""><faultcode>soapenv:Client</faultcode><faultstring>Error in parsing</faultstring><detail><detaildata>Content is not allowed in prolog.</detaildata></detail></soapenv:Fault></Body></Envelope>" (as extracted from logs generated by "org.apache.commons.httpclient.Wire wire"). Formally thats OK, but in real world it is all about interoperability, and that is: about tolerance, as far as its clear what is "meant". Thus it would be nice to be able to deliberately "weaken" the validation process especially for the encoding: It should be possible to switch of the check of XML encoding setup against HTTP header encoding setup and instead simply use the XML encoding setup. Especially if a given remote service must be used, the client code implementer has no influence on what the service returns -- she simply has to arrange her code to match the given service. And in my case, this simply seems impossible with the released Axis2 1.4.1 client code. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.