Package: ocropus
Version: 0.2-1
Severity: normal
According to the hOCR specification [1]:
| The OCR system is required to indicate the following using meta tags
| in the header:
| * name=ocr-system content=“name version”
| * name=ocr-capabilities content=capabilities
However, OCRopus does not include the ocr-system information:
$ wget -q http://ocropus.googlecode.com/svn/trunk/data/pages/alice_1.png
$ ocroscript rec-tess alice_1.png | grep '<meta'
<meta name="ocr-capabilities" content="ocr_line ocr_page" />
<meta name="ocr-langs" content="en" />
<meta name="ocr-scripts" content="Latn" />
<meta name="ocr-microformats" content="" />
[1] https://docs.google.com/View?id=dfxcv4vc_67g844kf
-- System Information:
Debian Release: squeeze/sid
APT prefers unstable
APT policy: (900, 'unstable'), (500, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.30-1-686 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=pl_PL.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages ocropus depends on:
ii libc6 2.9-27 GNU C Library: Shared libraries
ii libedit2 2.11-20080614-1 BSD editline and history libraries
ii libgcc1 1:4.4.1-6 GCC support library
ii libjpeg62 6b-15 The Independent JPEG Group's JPEG
ii libpng12-0 1.2.40-1 PNG library - runtime
ii libstdc++6 4.4.1-6 The GNU Standard C++ Library v3
ii libtiff4 3.9.1-1 Tag Image File Format (TIFF) libra
ii ocropus-data 0.2-1 document analysis and OCR system
ii tesseract-ocr 2.04-1 Command line OCR tool
ocropus recommends no packages.
ocropus suggests no packages.
-- no debconf information
--
Jakub Wilk
--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]