xerces-c createdocs.bat and the BOM character

Justin Dearing Tue, 30 Oct 2007 06:21:33 -0800

Folks,

I was editing the documentation using XML Copy Editor and I was getting an
error when I ran createdocs.bat. I eventually tracked the error down to it
inserting a BOM character  at the beginning of the document. Gerald, the
author of XML copy editor seems to think the BOM should be there as the docs
are UTF-8 and it is a UTF-8 BOM. I'm not sure what the encoding is or if
TortioseSVN is altering the encoding. I have a few questions regarding this:



1) What is the intended encoding of the documentation? Since the documents
are written in English my understanding is UTF-8 would work just fine but I
don't know a lot about unicode.
2) Does the java tool that builds the documentation handle BOMs correctly
for UTF-8 or is my editor at fault.
3) As a developer working on a windows platform how would I get encoding
information about a file?
4) As a developer working on a unix platform how would I get encoding
information about a file?

It seems the answers to 3 and 4 is there is no easy answer.

xerces-c createdocs.bat and the BOM character

Reply via email to