If you do not provide file (hocr) there is no way how to help you - error message is clear:
not well-formed (invalid token): line 135, column 397 Zdenko On Thu, Jan 30, 2014 at 8:49 PM, peiman F. <[email protected]> wrote: > hello > i got this error when tried to make a pdf file from images and hocr files > is there any body else to face this? > > python ./hocr-pdf a > Traceback (most recent call last): > File "./hocr-pdf", line 137, in <module> > export_pdf(sys.argv[1], 300) > File "./hocr-pdf", line 53, in export_pdf > add_text_layer(pdf, image, height, dpi) > File "./hocr-pdf", line 63, in add_text_layer > hocr.parse(hocrfile) > File "/usr/lib/python2.7/xml/etree/ElementTree.py", line 656, in parse > parser.feed(data) > File "/usr/lib/python2.7/xml/etree/ElementTree.py", line 1643, in feed > self._raiseerror(v) > File "/usr/lib/python2.7/xml/etree/ElementTree.py", line 1507, in > _raiseerror > raise err > xml.etree.ElementTree.ParseError: not well-formed (invalid token): line > 135, column 397 > > i checked filea are formated with correct name > > thank you > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

