Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread David Bertoni
Justin Dearing wrote: On 10/30/07, David Bertoni <[EMAIL PROTECTED]> wrote: It would help to see the error message you're getting, and to know what tool is issuing it. My apologies. This is the java tool building version 2 of the xerces-c library on windows of course [XalanProcessor] Applying

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Boris Kolpackov
Scott Morgan <[EMAIL PROTECTED]> writes: > http://unicode.org/faq/utf_bom.html#25 > > "a BOM can be used as a signature no matter how the Unicode text is > transformed: UTF-16, UTF-8, UTF-7, etc. The exact bytes comprising the > BOM will be whatever the Unicode character FEFF is converted into by

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Justin Dearing
On 10/30/07, David Bertoni <[EMAIL PROTECTED]> wrote: > It would help to see the error message you're getting, and to know what > tool is issuing it. My apologies. This is the java tool building version 2 of the xerces-c library on windows of course [XalanProcessor] Applying XSL sheet "sbk:/style

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread David Bertoni
Justin Dearing wrote: Thank you all for you quick responses. On 10/30/07, Jesse Pelton <[EMAIL PROTECTED]> wrote: Actually, the XML spec discusses the UTF-8 BOM. See http://www.w3.org/TR/2006/REC-xml-20060816/#sec-guessing-no-ext-info. Whether it makes sense is another question. I suppose it

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Justin Dearing
Thank you all for you quick responses. On 10/30/07, Jesse Pelton <[EMAIL PROTECTED]> wrote: > Actually, the XML spec discusses the UTF-8 BOM. See > http://www.w3.org/TR/2006/REC-xml-20060816/#sec-guessing-no-ext-info. > > Whether it makes sense is another question. I suppose it could be used > t

RE: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Jesse Pelton
required to handle UTF-8 and UTF-16, but no other encodings, this might have some value. -Original Message- From: Boris Kolpackov [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 30, 2007 10:37 AM To: c-dev@xerces.apache.org Subject: Re: xerces-c createdocs.bat and the BOM character Hi Justin

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Scott Morgan
Boris Kolpackov wrote: > There is no such thing as BOM for UTF-8. > http://unicode.org/faq/utf_bom.html#25 "a BOM can be used as a signature no matter how the Unicode text is transformed: UTF-16, UTF-8, UTF-7, etc. The exact bytes comprising the BOM will be whatever the Unicode character FEFF

Re: xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Boris Kolpackov
Hi Justin, Justin Dearing <[EMAIL PROTECTED]> writes: > Gerald, the author of XML copy editor seems to think the BOM should be > there as the docs are UTF-8 and it is a UTF-8 BOM. BOM (byte order marker) does not make any sense for UTF-8 since it is a 1-byte encoding. > 1) What is the intended

xerces-c createdocs.bat and the BOM character

2007-10-30 Thread Justin Dearing
Folks, I was editing the documentation using XML Copy Editor and I was getting an error when I ran createdocs.bat. I eventually tracked the error down to it inserting a BOM character at the beginning of the document. Gerald, the author of XML copy editor seems to think the BOM should be there as