Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-05 Thread Jakub Wilk
* Janusz S. Bień jsb...@mimuw.edu.pl, 2010-03-05, 06:30: [...] ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If you run ocrodjvu with the --debug option, you'll see that resulting hOCR files don't contain anything legible. In fact, hOCR for page 2 contains also some

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-05 Thread Janusz S. =?UTF-8?Q?Bie=C5=84
On Fri, 5 Mar 2010 Jakub Wilk jw...@debian.org wrote: [...] That said, if you insist on ignoring exceptions, you can easily achieve that with a simple shell script like: cp in.djvu out.djvu djvused -e remove-txt out.djvu for p in $(seq 1 $(djvused -e n out.djvu)) do ocrodjvu -p $p

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread jsbien
Package: ocrodjvu Version: 0.4.2-1 Severity: normal On Mon, 01 Mar 2010 jsb...@mimuw.edu.pl wrote: The input file is temporarily available at http://fleksem.klf.uw.edu.pl/~jsbien/tmp/in.djvu. Now I get:

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread Jakub Wilk
retitle 572522 ocrodjvu: crashes with ValueError on malformed hOCR severity 572522 minor thanks ocrodjvu --render all --engine cuneiform --language pol --clear-text -o out.djvu in.djvu Processing 'in.djvu': - Page #1 - Page #2 Exception in thread Thread-2: Traceback (most recent call last):

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread Janusz S. =?UTF-8?Q?Bie=C5=84
On Thu, 4 Mar 2010 Jakub Wilk jw...@debian.org wrote: [...] ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If you run ocrodjvu with the --debug option, you'll see that resulting hOCR files don't contain anything legible. In fact, hOCR for page 2 contains also some