I have a problem when i use pdftohtml -xml with a arabic pdf that give me one character in each line , how can i fix this problem ? <text top="270" left="2245" width="5" height="12" font="3"><b>¡</b></text> <text top="270" left="2242" width="3" height="12" font="3"><b>É </b></text> <text top="270" left="2236" width="3" height="12" font="3"><b>GC</b></text> <text top="270" left="2231" width="5" height="12" font="3"><b>e</b></text> <text top="270" left="2225" width="6" height="12" font="3"><b>ù</b></text> <text top="270" left="2217" width="8" height="12" font="3"><b>¢</b></text> <text top="270" left="2215" width="2" height="12" font="3"><b> G</b></text> <text top="270" left="2208" width="4" height="12" font="3"><b>d</b></text> <text top="270" left="2202" width="6" height="12" font="3"><b>ù</b></text> <text top="270" left="2198" width="4" height="12" font="3"><b>°</b></text> <text top="270" left="2194" width="4" height="12" font="3"><b>Ñ</b></text> <text top="270" left="2184" width="10" height="12" font="3"><b>â</b></text>
_______________________________________________ poppler mailing list [email protected] https://lists.freedesktop.org/mailman/listinfo/poppler
