I have a problem when i use pdftohtml -xml with a arabic pdf that give me
one character in each line , how can i fix this problem ?
<text top="270" left="2245" width="5" height="12" font="3"><b>¡</b></text>
<text top="270" left="2242" width="3" height="12" font="3"><b>É </b></text>
<text top="270" left="2236" width="3" height="12" font="3"><b>GC</b></text>
<text top="270" left="2231" width="5" height="12" font="3"><b>e</b></text>
<text top="270" left="2225" width="6" height="12" font="3"><b>ù</b></text>
<text top="270" left="2217" width="8" height="12" font="3"><b>¢</b></text>
<text top="270" left="2215" width="2" height="12" font="3"><b> G</b></text>
<text top="270" left="2208" width="4" height="12" font="3"><b>d</b></text>
<text top="270" left="2202" width="6" height="12" font="3"><b>ù</b></text>
<text top="270" left="2198" width="4" height="12" font="3"><b>°</b></text>
<text top="270" left="2194" width="4" height="12" font="3"><b>Ñ</b></text>
<text top="270" left="2184" width="10" height="12" font="3"><b>â</b></text>
_______________________________________________
poppler mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/poppler

Reply via email to