I meant the vdd.XML is Latin-1 encoded.
As a first workaround, I just add an encoding directive in the xml file at line
268 in VDD_generator.pm :
...
open OUTPUT, ">" . $arguments->{ output_directory } . "/" .
$VDD_DEFAULT_OUTPUT_FILE_XML;
+ print OUTPUT '<?xml version="1.0" encoding="iso-8859-1"?>';
print OUTPUT $vdd_data;
...
Then the process is going further :
- vdd.xml is correctly parsed, and transformed (no errors)
- vdd.sgml is generated (UTF-8 encoded apparently, but without encoding
attribute in <?xml version="1.0" ...>)
- fatal errors when generating PDF
- html generated, UTF-8 encoded, but without encoding attribute like
content="text/html; charset=UTF-8"
=> Bad display in my browser (bad guess of the encoding)
I also try to had a directive in the XSL transformation :
...
<?xml version="1.0" encoding="ISO-8859-1"?>
<xsl:stylesheet version="1.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
+ <xsl:output method="xml" encoding="iso-8859-1"/>
<xsl:template match="/">
...
It will explicitly control the encoding of the output
- vdd.sgml is generated, Latin-1 encoded
- html generated, Latin-1 encoded, but still without encoding attribute
=> Good guess, good display
My suggestions to avoid this kind of encoding problems
1 - stay with utf-8 encoding during all the process
2 - every xml / sgml / html file must have the encoding attribute
My questions
- In my case, the data in vdd.xml are Latin-1 encoded. I will investigate why
and try to change this. Can you tell me where I should look ?
- Do you know how to generate the contents/encoding attribute in the HTML with
docbook2html ?
About PDF generation error, I get the following messages in the log.
Do you know wnat I should do ?
...
! TeX capacity exceeded, sorry [save size=5000].
<recently read> \p@
l.5155 {start}}
\Link%
If you really absolutely need more capacity,
you can ask a wizard to enlarge me.
Here is how much of TeX's memory you used:
1126 strings out of 42085
26401 string characters out of 313418
250009 words of memory out of 1085000
14876 multiletter control sequences out of 10000+32500
23074 words of font info for 34 fonts, out of 500000 for 2000
580 hyphenation exceptions out of 8191
28i,11n,43p,217b,5000s stack positions out of 1500i,500n,1500p,200000b,5000s
PDF statistics:
472 PDF objects out of 300000
151 named destinations out of 131072
25 words of extra memory for PDF output out of 65536
! ==> Fatal error occurred, the output PDF file is not finished!
...
Regards
_______________________________________________
scmbug-users mailing list
[email protected]
http://lists.mkgnu.net/cgi-bin/mailman/listinfo/scmbug-users