3.03 was not released yet - it is in svn repository (e.g. you need to compile it. If you are not familiar with compiling from source than you need to wait ;-) ).
But as I wrote: I used patched hocr-pdf tool (on linux) and your files and it worked (no need to used dev version of tesseract)... Zdenko On Thu, Jan 30, 2014 at 11:12 PM, peiman F. <[email protected]> wrote: > where is 3.03? > i cant find it in google code! > > for this issue i got a trick in php > > $wrongFileData = file_get_contents($html.'hocr'); > $wrong = array('<em>','</em>'); > $corrected = str_replace($wrong, "", $wrongFileData); > file_put_contents($html.'hocr' , $corrected ); > > and resoved. > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

