Re: Improving Tika OCR

2017-04-17 Thread Kranthi Kiran G V
Hello Luis, Yes, tesseract 4.0 is not yet a stable release. VGG group's model has a 3-clause BSD license. I see it as a long term effort which would help the Tika's community experience near state of art OCR. This is an investigation into it to see if we can try out this direction. Thanks for

Re: Improving Tika OCR

2017-04-17 Thread Luís Filipe Nassif
Hi Kranthi, That is an interesting comparison! But I think Tesseract 4.0 is still alpha? And do you know the VGG software license? Best, Luis Em 17 de abr de 2017 8:46 AM, "Kranthi Kiran G V" < kkran...@student.nitw.ac.in> escreveu: Hello Tim Allison, I am currently working on improving

Re: 1.15?

2017-04-17 Thread David Meikle
+1 from me too. Cheers, Dave On 13 April 2017 at 13:08, Konstantin Gribov wrote: > Preliminary +1 from me, I'll the a closer look this weekend > > чт, 13 апр. 2017, 0:00 Allison, Timothy B. : > > > All, > > POI is voting on rc1 of the next release.

Re: Improving Tika OCR

2017-04-17 Thread Thamme Gowda
Thanks, Kranthi, for volunteering to do this evaluation :-) Best, Thamme -- Thamme Gowda TG | @thammegowda ~Sent via somebody's IMAP server On Apr 17, 2017 4:46 AM, "Kranthi Kiran G V" wrote: Hello Tim Allison, I am currently working on improving Tika's OCR

Improving Tika OCR

2017-04-17 Thread Kranthi Kiran G V
Hello Tim Allison, I am currently working on improving Tika's OCR capabilities. After suggestion from Thamme Gowda (@thammegowda ), I started to work on comparison of Tesseract 4.0's neural network