Your message dated Wed, 19 May 2010 09:30:05 +0200
with message-id <20100519073005.gc2...@eeepc>
and subject line Bug#557827: gscan2pdf: cropped tiff files are 16bits-sampled
has caused the Debian Bug report #557827,
regarding gscan2pdf: cropped tiff files are 16bits-sampled and unusable by 
tesseract
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)


-- 
557827: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=557827
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: gscan2pdf
Version: 0.9.25-1
Severity: important


When I import a tiff file created by scanimage, it can be "OCRised" by 
tesseract. But if I crop the image (it's necessary with multi-columns text), 
I got the next message in the console from where I've loaded gscan2pdf:

Tesseract Open Source OCR Engine
 check_legal_image_size:Error:Only 1,2,4,5,6,8 bpp are supported:16
 *** unhandled exception in callback:
 ***   Error: cannot open /tmp/NLA9Ssq5aV/DWUea9s5Xc.txt
 ***  ignoring at /usr/bin/gscan2pdf line 1114.

The same cropped image saved as a tiff file gives such information 
with tiffinfo:

g...@fantasio:~$ tiffinfo texte3.tif 
  TIFF Directory at offset 0xec7f8 (968696)
  Image Width: 961 Image Length: 504
  Resolution: 300, 300 pixels/inch
  Position: 0.01, 1.04333
  Bits/Sample: 16
  Compression Scheme: None
  Photometric Interpretation: min-is-black
  FillOrder: msb-to-lsb
  Orientation: row 0 top, col 0 lhs
  Samples/Pixel: 1
  Rows/Strip: 4
  Planar Configuration: single image plane
  DocumentName: /tmp/NLA9Ssq5aV/8SVdkwLXUI.tif
  ImageDescription:  SANE data follows 

Of course, she can't be treated by tesseract separatly. The choice of a
mode of compression (of none) doesn't affect tesseract.

Once cropped, the image can't be treated by unpaper. 

Error message on the console: 

Avertissement : Format d'image non reconnu at /usr/bin/gscan2pdf line 1394.
 *** unhandled exception in callback:
 ***   `' is not of type Gtk2::Gdk::Pixbuf at /usr/share/perl5/Gtk2/Ex/Simple 
/TiedCommon.pm line 65.
 ***  ignoring at /usr/bin/gscan2pdf line 1114.

I've not tested other features of gscan2pdf, my needs was only to find a
simple guy to proceed the OCR with tesseract.

Regards,

G.Vandemoortele  

-- System Information:
Debian Release: 5.0.3
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-1-686 (SMP w/1 CPU core)
Locale: LANG=fr_BE.UTF-8, LC_CTYPE=fr_BE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages gscan2pdf depends on:
ii  imagemagick     7:6.3.7.9.dfsg2-1~lenny3 image manipulation programs
ii  libconfig-gener 2.40-1                   Generic Configuration Module
ii  libgtk2-ex-simp 0.50-1.1                 A simple interface to Gtk2's compl
ii  libgtk2-imagevi 0.04-1+b1                Perl bindings for the GtkImageView
ii  liblocale-gette 1.05-4                   Using libc functions for internati
ii  libpdf-api2-per 0.69-2                   create or modify PDF documents in 
ii  librsvg2-common 2.22.2-2lenny1           SAX-based renderer library for SVG
ii  libsane         1.0.19-23                API library for scanners
ii  libtiff-tools   3.8.2-11.2               TIFF manipulation and conversion t
ii  perlmagick      7:6.3.7.9.dfsg2-1~lenny3 Perl interface to the libMagick gr
ii  sane-utils      1.0.19-23                API library for scanners -- utilit

Versions of packages gscan2pdf recommends:
ii  djvulibre-bin            3.5.20-8+lenny1 Utilities for the DjVu image forma
ii  gocr                     0.45-2          A command line OCR
ii  libgtk2-ex-podviewer-per 0.17-2          Perl Gtk2 widget for displaying Pl
ii  sane                     1.0.14-7        scanner graphical frontends
ii  tesseract-ocr            2.03-2          Command line OCR tool
ii  unpaper                  0.3-1           post-processing tool for scanned p
ii  xdg-utils                1.0.2-6.1       desktop integration utilities from

gscan2pdf suggests no packages.

-- no debconf information



--- End Message ---
--- Begin Message ---
I'm closing this due to lack of response, assuming that updating
ImageMagick fixed things. Please reopen if you can still reproduce it.

Regards

Jeff

Attachment: signature.asc
Description: Digital signature


--- End Message ---

Reply via email to