We have a XML document in UTF-8 encoding. It contains characters for the sami (lappish) language. When we generate postscript documents with FOP/XEP the resulting document is missing 3 characters. These are all replaced by the character \240 in XEP and '#' in FOP. The characters in question are 0xc4 0x91, 0xc5 0x8b, 0xc4 0x8d in UTF-8. Why does this happen ? And how can we solve this problem ?

I have attatched a sample document which shows the problem.

Merry christmas !

Asbjørn Thorsen
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"; xmlns:fo="http://www.w3.org/1999/XSL/Format"; version="1.0">     
  <xsl:template match="/">
    <fo:root xmlns:fo="http://www.w3.org/1999/XSL/Format";
	     font-family="Courier">
      <fo:layout-master-set>
	<fo:simple-page-master master-name="frame"
			       margin-top="10mm"
			       page-height="297mm" page-width="209mm"
			       margin-left="10mm" margin-right="10mm">
	  <fo:region-body region-name="frame-body"/>
	</fo:simple-page-master>
      </fo:layout-master-set>
      <fo:page-sequence master-reference="frame">
	<fo:flow flow-name="frame-body">
	  <fo:block>
	    <xsl:value-of select="brev/tekst"/>
	  </fo:block>
	</fo:flow>
      </fo:page-sequence>
    </fo:root>
  </xsl:template>
</xsl:stylesheet>
<?xml version="1.0" encoding="UTF-8"?>
<brev>
    <tekst>
        Ohcan sisaváldot sierra dahje oktagaslaš árvvoštallama vuođul
        lea hilgojuvvon, go oahppobáikki árvvoštallama mielde, du
        báhpirat eai atte vuođu vuoruhit dábálaš čuokkisnjuolggadusaid
        olggobeale. Don it leat ožžon fálaldaga maŋŋil sierra dahje
        oktagaslaš árvvoštallama, dan dihte go oahppobáiki oaivvilda
        ahte dus eai leat seamma máhtolašvuođat go ohcciin geat leat
        ožžon fálaldaga
    </tekst>
</brev>
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to