CRLF is translated to LF in scanCharData
----------------------------------------

         Key: XERCESC-1361
         URL: http://issues.apache.org/jira/browse/XERCESC-1361
     Project: Xerces-C++
        Type: Bug
  Components: SAX/SAX2  
    Versions: 2.6.0    
 Environment: win2k, Xerces-c 2.6(build the src with vc6+sp5) and Xerces-c 2.1 
binary version
    Reporter: ding hua


 When i parse a simple xml document. there is a CRLF between aaa and bbb. But 
saxparse call method characters, the string is translated to aaa LF bbb. It 
loses the char CR.


<?xml version="1.0" encoding="gb2312" standalone="no"?>
<dd><ddrow><text>aaa
bbb</text>
</ddrow></dd>

 And i trace the code, i find the char is eated up by handleEOL. I want keep 
the content unchanged. Is it reasonable? Thanks.

The call stack
xercesc_2_6::XMLReader::handleEOL(unsigned short & 0x000d, unsigned char 0x00) 
line 898
xercesc_2_6::XMLReader::getNextCharIfNot(const unsigned short 0x003c, unsigned 
short & 0x000d) line 789
xercesc_2_6::ReaderMgr::getNextCharIfNot(const unsigned short 0x003c, unsigned 
short & 0x000d) line 398
xercesc_2_6::IGXMLScanner::scanCharData(xercesc_2_6::XMLBuffer & {...}) line 
2630 + 17 bytes
xercesc_2_6::IGXMLScanner::scanContent() line 837
xercesc_2_6::IGXMLScanner::scanDocument(const xercesc_2_6::InputSource & {...}) 
line 204 + 8 bytes
xercesc_2_6::SAXParser::parse(const xercesc_2_6::InputSource & {...}) line 720


internal\XMLReader.hpp Ln895
                if ( fCharBuf[fCharIndex] == chLF              || 
                    ((fCharBuf[fCharIndex] == chNEL) && fNEL)  )
                {
                    fCharIndex++;
                }

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to