[tesseract-ocr] Re: Tesseract security considerations

2017-01-13 Thread James R Barlow
On Thursday, December 8, 2016 at 9:39:20 PM UTC-8, José Luis Mendoza Azanza wrote: > > I am integrating Tesseract into an application, but I have some questions > before keep going with the process. > I think every application should have security filters and considerations > in order to avoid

Re: [tesseract-ocr] Re: pdf -> searchable PDF

2017-01-13 Thread ShreeDevi Kumar
Please see https://github.com/tesseract-ocr/tesseract/issues/83 and other PDF related issues in GitHub repo with similar discussion. - excuse the brevity, sent from mobile On 13-Jan-2017 10:15 PM, "James R Barlow" wrote: > Tesseract cannot rasterize PDFs. It is fairly

[tesseract-ocr] Re: pdf -> searchable PDF

2017-01-13 Thread James R Barlow
Tesseract cannot rasterize PDFs. It is fairly straightforward to write a PDF like does, but very complex to rasterize one. Programs like OCRmyPDF (which I develop) use Ghostscript, Tesseract and other tools to handle PDF to searchable PDF conversion. On Tuesday, January 10, 2017 at 9:34:57 PM