Package: gscan2pdf Version: 0.9.25-1 Severity: important
When I import a tiff file created by scanimage, it can be "OCRised" by tesseract. But if I crop the image (it's necessary with multi-columns text), I got the next message in the console from where I've loaded gscan2pdf: Tesseract Open Source OCR Engine check_legal_image_size:Error:Only 1,2,4,5,6,8 bpp are supported:16 *** unhandled exception in callback: *** Error: cannot open /tmp/NLA9Ssq5aV/DWUea9s5Xc.txt *** ignoring at /usr/bin/gscan2pdf line 1114. The same cropped image saved as a tiff file gives such information with tiffinfo: g...@fantasio:~$ tiffinfo texte3.tif TIFF Directory at offset 0xec7f8 (968696) Image Width: 961 Image Length: 504 Resolution: 300, 300 pixels/inch Position: 0.01, 1.04333 Bits/Sample: 16 Compression Scheme: None Photometric Interpretation: min-is-black FillOrder: msb-to-lsb Orientation: row 0 top, col 0 lhs Samples/Pixel: 1 Rows/Strip: 4 Planar Configuration: single image plane DocumentName: /tmp/NLA9Ssq5aV/8SVdkwLXUI.tif ImageDescription: SANE data follows Of course, she can't be treated by tesseract separatly. The choice of a mode of compression (of none) doesn't affect tesseract. Once cropped, the image can't be treated by unpaper. Error message on the console: Avertissement : Format d'image non reconnu at /usr/bin/gscan2pdf line 1394. *** unhandled exception in callback: *** `' is not of type Gtk2::Gdk::Pixbuf at /usr/share/perl5/Gtk2/Ex/Simple /TiedCommon.pm line 65. *** ignoring at /usr/bin/gscan2pdf line 1114. I've not tested other features of gscan2pdf, my needs was only to find a simple guy to proceed the OCR with tesseract. Regards, G.Vandemoortele -- System Information: Debian Release: 5.0.3 APT prefers stable APT policy: (500, 'stable') Architecture: i386 (i686) Kernel: Linux 2.6.26-1-686 (SMP w/1 CPU core) Locale: LANG=fr_BE.UTF-8, LC_CTYPE=fr_BE.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages gscan2pdf depends on: ii imagemagick 7:6.3.7.9.dfsg2-1~lenny3 image manipulation programs ii libconfig-gener 2.40-1 Generic Configuration Module ii libgtk2-ex-simp 0.50-1.1 A simple interface to Gtk2's compl ii libgtk2-imagevi 0.04-1+b1 Perl bindings for the GtkImageView ii liblocale-gette 1.05-4 Using libc functions for internati ii libpdf-api2-per 0.69-2 create or modify PDF documents in ii librsvg2-common 2.22.2-2lenny1 SAX-based renderer library for SVG ii libsane 1.0.19-23 API library for scanners ii libtiff-tools 3.8.2-11.2 TIFF manipulation and conversion t ii perlmagick 7:6.3.7.9.dfsg2-1~lenny3 Perl interface to the libMagick gr ii sane-utils 1.0.19-23 API library for scanners -- utilit Versions of packages gscan2pdf recommends: ii djvulibre-bin 3.5.20-8+lenny1 Utilities for the DjVu image forma ii gocr 0.45-2 A command line OCR ii libgtk2-ex-podviewer-per 0.17-2 Perl Gtk2 widget for displaying Pl ii sane 1.0.14-7 scanner graphical frontends ii tesseract-ocr 2.03-2 Command line OCR tool ii unpaper 0.3-1 post-processing tool for scanned p ii xdg-utils 1.0.2-6.1 desktop integration utilities from gscan2pdf suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to [email protected] with a subject of "unsubscribe". Trouble? Contact [email protected]

