Package: ocropus Version: 0.3.1-3 Severity: normal
Running ocropus on a 64 bit system (amd64) I get errors where none are found on a 32 bit system (i386). $ /usr/bin/ocroscript recognize --charboxes 000069.pnm ocroscript: unicharset.cpp:76: const UNICHAR_ID UNICHARSET::unichar_to_id(const char*, int) const: Assertion `ids.contains(unichar_repr, length)' failed. Aborted This problem is probably actualy in tesseract, but I don't know how to run tesseract to reproduce it. This may be http://code.google.com/p/tesseract-ocr/issues/detail?id=265#c0 also reported as https://bugs.launchpad.net/ubuntu/+source/tesseract/+bug/565688 It is apparently fixed in tesseract 3.0 -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (500, 'testing') Architecture: amd64 (x86_64) Kernel: Linux 2.6.32-3-amd64 (SMP w/1 CPU core) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages ocropus depends on: ii libc6 2.11.2-2 Embedded GNU C Library: Shared lib ii libgcc1 1:4.4.4-7 GCC support library ii libiulib0 0.3-1+b1 C++ library of image understanding ii libjpeg62 6b1-1 The Independent JPEG Group's JPEG ii liblua5.1-0 5.1.4-5 Simple, extensible, embeddable pro ii libpng12-0 1.2.44-1 PNG library - runtime ii libstdc++6 4.4.4-7 The GNU Standard C++ Library v3 ii libtiff4 3.9.4-1 Tag Image File Format (TIFF) libra ii ocropus-data 0.3.1-3 document analysis and OCR system - ii zlib1g 1:1.2.3.4.dfsg-3 compression library - runtime Versions of packages ocropus recommends: ii tesseract-ocr 2.04-2 Command line OCR tool ocropus suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to [email protected] with a subject of "unsubscribe". Trouble? Contact [email protected]

