* Janusz S. Bień jsb...@mimuw.edu.pl, 2010-03-05, 06:30:
[...]
ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If
you run ocrodjvu with the --debug option, you'll see that resulting hOCR
files don't contain anything legible. In fact, hOCR for page 2 contains
also some
On Fri, 5 Mar 2010 Jakub Wilk jw...@debian.org wrote:
[...]
That said, if you insist on ignoring exceptions, you can easily achieve
that with a simple shell script like:
cp in.djvu out.djvu
djvused -e remove-txt out.djvu
for p in $(seq 1 $(djvused -e n out.djvu))
do
ocrodjvu -p $p
Package: ocrodjvu
Version: 0.4.2-1
Severity: normal
On Mon, 01 Mar 2010 jsb...@mimuw.edu.pl wrote:
The input file is temporarily available at
http://fleksem.klf.uw.edu.pl/~jsbien/tmp/in.djvu.
Now I get:
retitle 572522 ocrodjvu: crashes with ValueError on malformed hOCR
severity 572522 minor
thanks
ocrodjvu --render all --engine cuneiform --language pol --clear-text -o
out.djvu in.djvu
Processing 'in.djvu':
- Page #1
- Page #2
Exception in thread Thread-2:
Traceback (most recent call last):
On Thu, 4 Mar 2010 Jakub Wilk jw...@debian.org wrote:
[...]
ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If
you run ocrodjvu with the --debug option, you'll see that resulting hOCR
files don't contain anything legible. In fact, hOCR for page 2 contains
also some
5 matches
Mail list logo