Package: ocropus
Version: 0.2-1
Severity: normal

According to the hOCR specification [1]:

| The OCR system is required to indicate the following using meta tags | in the header:
| * name=ocr-system content=“name version”
| * name=ocr-capabilities content=capabilities

However, OCRopus does not include the ocr-system information:

$ wget -q http://ocropus.googlecode.com/svn/trunk/data/pages/alice_1.png

$ ocroscript rec-tess alice_1.png | grep '<meta'
        <meta name="ocr-capabilities" content="ocr_line ocr_page" />
        <meta name="ocr-langs" content="en" />
        <meta name="ocr-scripts" content="Latn" />
        <meta name="ocr-microformats" content="" />


[1] https://docs.google.com/View?id=dfxcv4vc_67g844kf

-- System Information:
Debian Release: squeeze/sid
  APT prefers unstable
  APT policy: (900, 'unstable'), (500, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.30-1-686 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=pl_PL.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages ocropus depends on:
ii  libc6                    2.9-27          GNU C Library: Shared libraries
ii  libedit2                 2.11-20080614-1 BSD editline and history libraries
ii  libgcc1                  1:4.4.1-6       GCC support library
ii libjpeg62 6b-15 The Independent JPEG Group's JPEG ii libpng12-0 1.2.40-1 PNG library - runtime
ii  libstdc++6               4.4.1-6         The GNU Standard C++ Library v3
ii  libtiff4                 3.9.1-1         Tag Image File Format (TIFF) libra
ii  ocropus-data             0.2-1           document analysis and OCR system
ii  tesseract-ocr            2.04-1          Command line OCR tool

ocropus recommends no packages.

ocropus suggests no packages.

-- no debconf information

--
Jakub Wilk



--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to