Re: [Bug 623438] Re: Font size not correct in merged sandvich PDF

2011-08-10 Thread Igor Filippov
To be fair there are also OCRAD, GOCR, and Tesseract. Igor On Wed, 2011-08-10 at 08:53 +, Martin Wildam wrote: On Mon, Aug 8, 2011 at 09:40, Jussi Pakkanen jussi.pakka...@canonical.com wrote: I'd like to remind everyone that Cuneiform is currently unmaintained. No-one is working on

Re: [Cuneiform] [Bug 623438] Re: Font size not correct in merged sandvich PDF

2010-09-10 Thread Igor Filippov
Martin, Have you tried other OCR engines which can generate hOCR output? I'm not sure all of them can but here are a few free and open source OCR engines I've run on Linux: GOCR OCRAD Tesseract Does this issue affect them as well? Best, Igor On Fri, 2010-09-10 at 11:45 +, Martin Wildam

Re: [Cuneiform] [Bug 623438] Re: Font size not correct in merged sandvich PDF

2010-09-10 Thread Igor Filippov
Martin, I'm not using this functionality myself, so you most likely know best, but OCRAD is producing ORF output with -x command-line option. According to the README ORF file will contain bounding boxes for OCRed characters and lines. Igor On Fri, 2010-09-10 at 17:52 +, Martin Wildam wrote: