Package: gscan2pdf
Version: 0.9.25-1
Severity: important

When I import a tiff file created by scanimage, it can be "OCRised" by 
tesseract. But if I crop the image (it's necessary with multi-columns text), 
I got the next message in the console from where I've loaded gscan2pdf:

Tesseract Open Source OCR Engine
 check_legal_image_size:Error:Only 1,2,4,5,6,8 bpp are supported:16
 *** unhandled exception in callback:
 ***   Error: cannot open /tmp/NLA9Ssq5aV/DWUea9s5Xc.txt
 ***  ignoring at /usr/bin/gscan2pdf line 1114.

The same cropped image saved as a tiff file gives such information 
with tiffinfo:

g...@fantasio:~$ tiffinfo texte3.tif 
  TIFF Directory at offset 0xec7f8 (968696)
  Image Width: 961 Image Length: 504
  Resolution: 300, 300 pixels/inch
  Position: 0.01, 1.04333
  Bits/Sample: 16
  Compression Scheme: None
  Photometric Interpretation: min-is-black
  FillOrder: msb-to-lsb
  Orientation: row 0 top, col 0 lhs
  Samples/Pixel: 1
  Rows/Strip: 4
  Planar Configuration: single image plane
  DocumentName: /tmp/NLA9Ssq5aV/8SVdkwLXUI.tif
  ImageDescription:  SANE data follows 

Of course, she can't be treated by tesseract separatly. The choice of a
mode of compression (of none) doesn't affect tesseract.

Once cropped, the image can't be treated by unpaper. 

Error message on the console: 

Avertissement : Format d'image non reconnu at /usr/bin/gscan2pdf line 1394.
 *** unhandled exception in callback:
 ***   `' is not of type Gtk2::Gdk::Pixbuf at /usr/share/perl5/Gtk2/Ex/Simple 
/TiedCommon.pm line 65.
 ***  ignoring at /usr/bin/gscan2pdf line 1114.

I've not tested other features of gscan2pdf, my needs was only to find a
simple guy to proceed the OCR with tesseract.

Regards,

G.Vandemoortele  

-- System Information:
Debian Release: 5.0.3
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-1-686 (SMP w/1 CPU core)
Locale: LANG=fr_BE.UTF-8, LC_CTYPE=fr_BE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages gscan2pdf depends on:
ii  imagemagick     7:6.3.7.9.dfsg2-1~lenny3 image manipulation programs
ii  libconfig-gener 2.40-1                   Generic Configuration Module
ii  libgtk2-ex-simp 0.50-1.1                 A simple interface to Gtk2's compl
ii  libgtk2-imagevi 0.04-1+b1                Perl bindings for the GtkImageView
ii  liblocale-gette 1.05-4                   Using libc functions for internati
ii  libpdf-api2-per 0.69-2                   create or modify PDF documents in 
ii  librsvg2-common 2.22.2-2lenny1           SAX-based renderer library for SVG
ii  libsane         1.0.19-23                API library for scanners
ii  libtiff-tools   3.8.2-11.2               TIFF manipulation and conversion t
ii  perlmagick      7:6.3.7.9.dfsg2-1~lenny3 Perl interface to the libMagick gr
ii  sane-utils      1.0.19-23                API library for scanners -- utilit

Versions of packages gscan2pdf recommends:
ii  djvulibre-bin            3.5.20-8+lenny1 Utilities for the DjVu image forma
ii  gocr                     0.45-2          A command line OCR
ii  libgtk2-ex-podviewer-per 0.17-2          Perl Gtk2 widget for displaying Pl
ii  sane                     1.0.14-7        scanner graphical frontends
ii  tesseract-ocr            2.03-2          Command line OCR tool
ii  unpaper                  0.3-1           post-processing tool for scanned p
ii  xdg-utils                1.0.2-6.1       desktop integration utilities from

gscan2pdf suggests no packages.

-- no debconf information



-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to