Folks, I was editing the documentation using XML Copy Editor and I was getting an error when I ran createdocs.bat. I eventually tracked the error down to it inserting a BOM character at the beginning of the document. Gerald, the author of XML copy editor seems to think the BOM should be there as the docs are UTF-8 and it is a UTF-8 BOM. I'm not sure what the encoding is or if TortioseSVN is altering the encoding. I have a few questions regarding this:
1) What is the intended encoding of the documentation? Since the documents are written in English my understanding is UTF-8 would work just fine but I don't know a lot about unicode. 2) Does the java tool that builds the documentation handle BOMs correctly for UTF-8 or is my editor at fault. 3) As a developer working on a windows platform how would I get encoding information about a file? 4) As a developer working on a unix platform how would I get encoding information about a file? It seems the answers to 3 and 4 is there is no easy answer.
