Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Scott Morgan
Boris Kolpackov wrote: There is no such thing as BOM for UTF-8. http://unicode.org/faq/utf_bom.html#25 a BOM can be used as a signature no matter how the Unicode text is transformed: UTF-16, UTF-8, UTF-7, etc. The exact bytes comprising the BOM will be whatever the Unicode character FEFF is

RE: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Jesse Pelton
Actually, the XML spec discusses the UTF-8 BOM. See http://www.w3.org/TR/2006/REC-xml-20060816/#sec-guessing-no-ext-info. Whether it makes sense is another question. I suppose it could be used to quickly distinguish UTF-8 from ASCII and similar encodings. Since conforming processors are

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Justin Dearing
On 10/30/07, David Bertoni [EMAIL PROTECTED] wrote: It would help to see the error message you're getting, and to know what tool is issuing it. My apologies. This is the java tool building version 2 of the xerces-c library on windows of course [XalanProcessor] Applying XSL sheet

XNI-based Xerces C++

2007-10-30 Thread Venky Raju
Hi, Bringing back a question from 2004 - is there a C++ version of Xerces based on XNI? From looking at the Xerces 2.8 code it does not appear that it is based on the XNI framework. Thanks in advance, Venky Raju Senior Staff Engineer Samsung Telecommunications America

RE: XNI-based Xerces C++

2007-10-30 Thread Venky Raju
Thanks, Michael! Venky -Original Message- From: Michael Glavassevich [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 30, 2007 1:36 PM To: c-dev@xerces.apache.org Subject: Re: XNI-based Xerces C++ Hi Venky, Venky Raju [EMAIL PROTECTED] wrote on 10/30/2007 02:28:02 PM: Hi, Bringing

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Boris Kolpackov
Scott Morgan [EMAIL PROTECTED] writes: http://unicode.org/faq/utf_bom.html#25 a BOM can be used as a signature no matter how the Unicode text is transformed: UTF-16, UTF-8, UTF-7, etc. The exact bytes comprising the BOM will be whatever the Unicode character FEFF is converted into by that