Re: [tesseract-ocr] tesseract output is of first page only

2019-08-08 Thread Zdenko Podobny
Provide exact information what you did. Make sure you use the latest tesseract and leptonica. Zdenko pi 9. 8. 2019 o 7:41 ilevy napísal(a): > I'm trying tesseract for the first time with a png of a multipage document > I saved out of a pdf (which itself was just an image). > > When I run tesse

[tesseract-ocr] Re: tesseract output is of first page only

2019-08-08 Thread ElGato ElMago
Is it possible to have multiple pages in a png file in the first place? 2019年8月9日金曜日 14時41分15秒 UTC+9 ilevy: > > I'm trying tesseract for the first time with a png of a multipage document > I saved out of a pdf (which itself was just an image). > > When I run tesseract, I get an output of the firs

[tesseract-ocr] tesseract output is of first page only

2019-08-08 Thread ilevy
I'm trying tesseract for the first time with a png of a multipage document I saved out of a pdf (which itself was just an image). When I run tesseract, I get an output of the first page, but that's all. I notice that there's a control-L (^L) at the end of the text file. How do I get the entire

Re: [tesseract-ocr] Trained data for E13B font

2019-08-08 Thread ElGato ElMago
Here's my sharing on GitHub. Hope it's of any use for somebody. https://github.com/ElMagoElGato/tess_e13b_training 2019年8月8日木曜日 9時35分17秒 UTC+9 ElGato ElMago: > > OK, I'll do so. I need to reorganize naming and so on a little bit. Will > be out there soon. > > 2019年8月7日水曜日 21時11分01秒 UTC+9 Mama

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread shree
I hope other members who use tesseract with python will provide the needed guidance. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Tommy Klausen
Thank you. Did you see my attached file above? Tommy -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view thi

Re: [tesseract-ocr] Re: Using my own detection instead of tesseract's

2019-08-08 Thread Timothy Snyder
On my project I detect and crop down to textline level on my own. Then, with PSM 13, I give tesseract a single line of text. On Wed, Aug 7, 2019 at 4:50 AM 'Nima Afshar' via tesseract-ocr < tesseract-ocr@googlegroups.com> wrote: > By detection i mean text detection,by the way your right i should'

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Shree Devi Kumar
Yes, it should be similar to hocr and you can write it at end of command. examples: tesseract phototest.tif - alto tesseract phototest.tif - -c tessedit_create_alto=1 http://www.loc.gov/standards/alto/ns-v3#"; xmlns:xlink=" http://www.w3.org/1999/xlink"; xmlns:xsi=" http://www.w3.org/2001/XM

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Tommy Klausen
Take look at the attached file. How can I implement ALTO in it and what will the command look like in teminal? Tommy torsdag 8. august 2019 12.04.23 UTC+2 skrev Tommy Klausen følgende: > > Ok. > > Because if a config file for alto exists (which didn`t for some reason in > the install) I can ju

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Tommy Klausen
Ok. Because if a config file for alto exists (which didn`t for some reason in the install) I can just write the command with "alto" in the end, right? Can you give me the two different commands for reading an image (with and without the confg file)? torsdag 8. august 2019 11.51.27 UTC+2 skrev

Re: [tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Shree Devi Kumar
https://github.com/tesseract-ocr/tesseract/blob/master/tessdata/configs/alto You can use `alto` config file or use the config variable as part of command -c tessedit_create_alto=1 On Thu, Aug 8, 2019 at 2:59 PM Tommy Klausen wrote: > Hi. > > Is the ALTO config option supported in the last lin

[tesseract-ocr] Support for alto - option in Tesseract for linux

2019-08-08 Thread Tommy Klausen
Hi. Is the ALTO config option supported in the last linux version of Tesseract? I have managed to use the HOCR but not ALTO. Is it something I need to do with the config files? Tommy -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscr

Re: [tesseract-ocr] Re: Tesseract 4.1.0 released

2019-08-08 Thread Alex Cohn
I believe that there is no true need to change anything. To run unittest (and even training) on Android, it's enough to choose __ANDROID_API__=28 (or higher). Methinks that this is a reasonable restriction. The production version of the library can still be built with __ANDROID_API__=16 and ex