Hi all, I played around with $FORREST_HOME/etc/tidy-xml.pl in a custom project where I need to clean up the white spaces.
For now we have not set the encoding in our configuration, this however can lead to problems in combination with add-xml-decl: yes If you have a xml file that did not had a xml declaration, tidy will add one and use the default encoding which is "us-ascii". I needed to add "char-encoding: utf8" to the config to get rid of invalid character error that all my utf-8 characters had thrown. Another thing is the indent of all attributes. IMO that it just too much since if you have an element with 5 attributes you will have it now in 6 lines. I propose the following change: Index: etc/tidy-config.txt =================================================================== --- etc/tidy-config.txt (revision 748122) +++ etc/tidy-config.txt (working copy) @@ -1,8 +1,9 @@ add-xml-decl: yes +char-encoding: utf8 input-xml: yes output-xml:yes indent: auto -indent-attributes: yes +indent-attributes: no indent-spaces: 2 write-back: yes preserve-entities: yes wdyt? salu2 -- Thorsten Scherler <thorsten.at.apache.org> Open Source Java <consulting, training and solutions> Sociedad Andaluza para el Desarrollo de la Sociedad de la Información, S.A.U. (SADESI)