3.03 was not released yet - it is in svn repository (e.g. you need to
compile it. If you are not familiar with compiling from source than you
need to wait ;-) ).

But as I wrote: I used patched hocr-pdf tool (on linux) and your files and
it worked (no need to used dev version of tesseract)...

Zdenko


On Thu, Jan 30, 2014 at 11:12 PM, peiman F. <[email protected]> wrote:

> where is 3.03?
> i cant find it in google code!
>
> for this issue i got a trick in php
>
>                     $wrongFileData = file_get_contents($html.'hocr');
>                     $wrong = array('<em>','</em>');
>                     $corrected = str_replace($wrong, "", $wrongFileData);
>                     file_put_contents($html.'hocr' , $corrected );
>
> and resoved.
>
> --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/groups/opt_out.
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to