Re: Improving Tika OCR

Kranthi Kiran G V Wed, 19 Apr 2017 06:12:57 -0700

Hello community,
I have successfully tested Tesseract 4.0 on various images of different
sizes, orientation and lightening
conditions. I would, in the next few days, publish the results on a blog
for you to have a look at.

Although I'm able to reliably measure the clock time, accuracy, etc, I am
not able to come up with a method
to reliably measure the memory consumed. Any pointers on this from the
developer community would be
appreciated.

VGG group has two models released
<http://www.robots.ox.ac.uk/~vgg/research/text/#sec-models>. I'm not able
to test any as of now due to no back compatibility with
the MatConvNet used. I use a recent version of MATLAB. As of now, I am
trying to get around it by updating
parts of the code. I'm also contacting the mainters of the repository to
help me address the issues.
I'm hopeful to run them.

Addressing Luis' concern, we won't be building VGG's models into Tika'
source. We would only be helping
the user deploy a REST API to which Tika's OCR subsystem passes the images
and retrieve the information
in the form of a string.

Thank you,
Kranthi Kiran GV,
CS 3/4 Undergrad,
NIT Warangal

On Tue, Apr 18, 2017 at 8:43 AM, Kranthi Kiran G V <
kkran...@student.nitw.ac.in> wrote:

> Hello Luis,
> Yes, tesseract 4.0 is not yet a stable release. VGG group's model has a
> 3-clause BSD license.
>
> I see it as a long term effort which would help the Tika's community
> experience near state of art OCR.
>
> This is an investigation into it to see if we can try out this direction.
> Thanks for expressing your views.
>
> Thank you,
> Kranthi Kiran GV
>
> On Apr 18, 2017 2:44 AM, "Luís Filipe Nassif" <lfcnas...@gmail.com> wrote:
>
> Hi Kranthi,
>
> That is an interesting comparison! But I think Tesseract 4.0 is still
> alpha? And do you know the VGG software license?
>
> Best,
> Luis
>
> Em 17 de abr de 2017 8:46 AM, "Kranthi Kiran G V" <
> kkran...@student.nitw.ac.in> escreveu:
>
> Hello Tim Allison,
>
> I am currently working on improving Tika's OCR capabilities.
> After suggestion from Thamme Gowda (@thammegowda
> <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=thammegowda
> >),
> I started to work on comparison of Tesseract 4.0's neural network
> <https://github.com/tesseract-ocr/tesseract/wiki/NeuralNetsInTesseract4.00
> >
> subsystem and Visual Geometry Group's (VGG) models
> <http://www.robots.ox.ac.uk/~vgg/research/text/>.
>
> It would be great if you provide the dataset to test the OCR as you
> mentioned in one of the issues.
>
> I would be comparing their running time for evaluation, accuracy, memory
> consumed and invariance to lighting, orientation, etc. And then I would be
> integrating the appropriate models into Tika's OCR.
>
> Thank you,
> Kranthi Kiran GV,
> CS 3/4 Undergrad,
> NIT Warangal
>
>
>

Re: Improving Tika OCR

Reply via email to