Your message dated Wed, 19 May 2010 09:30:05 +0200
with message-id <20100519073005.gc2...@eeepc>
and subject line Bug#557827: gscan2pdf: cropped tiff files are 16bits-sampled
has caused the Debian Bug report #557827,
regarding gscan2pdf: cropped tiff files are 16bits-sampled and unusable by
tesseract
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
557827: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=557827
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: gscan2pdf
Version: 0.9.25-1
Severity: important
When I import a tiff file created by scanimage, it can be "OCRised" by
tesseract. But if I crop the image (it's necessary with multi-columns text),
I got the next message in the console from where I've loaded gscan2pdf:
Tesseract Open Source OCR Engine
check_legal_image_size:Error:Only 1,2,4,5,6,8 bpp are supported:16
*** unhandled exception in callback:
*** Error: cannot open /tmp/NLA9Ssq5aV/DWUea9s5Xc.txt
*** ignoring at /usr/bin/gscan2pdf line 1114.
The same cropped image saved as a tiff file gives such information
with tiffinfo:
g...@fantasio:~$ tiffinfo texte3.tif
TIFF Directory at offset 0xec7f8 (968696)
Image Width: 961 Image Length: 504
Resolution: 300, 300 pixels/inch
Position: 0.01, 1.04333
Bits/Sample: 16
Compression Scheme: None
Photometric Interpretation: min-is-black
FillOrder: msb-to-lsb
Orientation: row 0 top, col 0 lhs
Samples/Pixel: 1
Rows/Strip: 4
Planar Configuration: single image plane
DocumentName: /tmp/NLA9Ssq5aV/8SVdkwLXUI.tif
ImageDescription: SANE data follows
Of course, she can't be treated by tesseract separatly. The choice of a
mode of compression (of none) doesn't affect tesseract.
Once cropped, the image can't be treated by unpaper.
Error message on the console:
Avertissement : Format d'image non reconnu at /usr/bin/gscan2pdf line 1394.
*** unhandled exception in callback:
*** `' is not of type Gtk2::Gdk::Pixbuf at /usr/share/perl5/Gtk2/Ex/Simple
/TiedCommon.pm line 65.
*** ignoring at /usr/bin/gscan2pdf line 1114.
I've not tested other features of gscan2pdf, my needs was only to find a
simple guy to proceed the OCR with tesseract.
Regards,
G.Vandemoortele
-- System Information:
Debian Release: 5.0.3
APT prefers stable
APT policy: (500, 'stable')
Architecture: i386 (i686)
Kernel: Linux 2.6.26-1-686 (SMP w/1 CPU core)
Locale: LANG=fr_BE.UTF-8, LC_CTYPE=fr_BE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash
Versions of packages gscan2pdf depends on:
ii imagemagick 7:6.3.7.9.dfsg2-1~lenny3 image manipulation programs
ii libconfig-gener 2.40-1 Generic Configuration Module
ii libgtk2-ex-simp 0.50-1.1 A simple interface to Gtk2's compl
ii libgtk2-imagevi 0.04-1+b1 Perl bindings for the GtkImageView
ii liblocale-gette 1.05-4 Using libc functions for internati
ii libpdf-api2-per 0.69-2 create or modify PDF documents in
ii librsvg2-common 2.22.2-2lenny1 SAX-based renderer library for SVG
ii libsane 1.0.19-23 API library for scanners
ii libtiff-tools 3.8.2-11.2 TIFF manipulation and conversion t
ii perlmagick 7:6.3.7.9.dfsg2-1~lenny3 Perl interface to the libMagick gr
ii sane-utils 1.0.19-23 API library for scanners -- utilit
Versions of packages gscan2pdf recommends:
ii djvulibre-bin 3.5.20-8+lenny1 Utilities for the DjVu image forma
ii gocr 0.45-2 A command line OCR
ii libgtk2-ex-podviewer-per 0.17-2 Perl Gtk2 widget for displaying Pl
ii sane 1.0.14-7 scanner graphical frontends
ii tesseract-ocr 2.03-2 Command line OCR tool
ii unpaper 0.3-1 post-processing tool for scanned p
ii xdg-utils 1.0.2-6.1 desktop integration utilities from
gscan2pdf suggests no packages.
-- no debconf information
--- End Message ---
--- Begin Message ---
I'm closing this due to lack of response, assuming that updating
ImageMagick fixed things. Please reopen if you can still reproduce it.
Regards
Jeff
signature.asc
Description: Digital signature
--- End Message ---