[dom4j-user] Re: How do I get the encoding in a document?

James Strachan Mon, 24 Nov 2003 07:44:03 -0800

The trick is to use the XMLWriter and use an OutputFormat object to set the encoding.

OutputFormat format = new OutputFormat();
format.setEncoding(...)
XMLWriter writer = new XMLWriter(format, new FileWriter("foo.xml"));
writer.write(someDoc);

On 24 Nov 2003, at 13:24, Klaus Engelhardt wrote:

Dear James Strachan,

I am unable to figure out how to get the original encoding of an XML
document using Dom4j. I have seen email comments, etc. stating that
this is impossible. My question is: is this true? and if so, why?

The reason I need this is that some documents are in ISO-8859-1
and others are in UTF-8. If I read in ISO-8859-1 with German
special characters and write it back again as UTF-8, then the resulting
document is no longer viewable with a simple ASCII text editor, at
least as far as the special characters are concerned. So I really do
need to get the original encoding in order to rewrite the modified
document correctly.

If I have to, I will pre-read the XML document to get the encoding
by hand. But I would really love to spare this hopefully unnecessary
effort!

If you could send me a comment on this problem or better yet, a solution, I would be very happy!

Thanks,
Klaus Engelhardt

Klaus Engelhardt
Metzstr. 8
81667 München
Tel. 0049/89/483491 (home)
Tel. 0049/89/3299-1409 (work)


James
-------
http://radio.weblogs.com/0112098/

-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?  SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
dom4j-user mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dom4j-user

[dom4j-user] Re: How do I get the encoding in a document?

Reply via email to