I have opened this as an issue at https://github.com/tesserac
t-ocr/tessdata/issues/77
You can provide additional feedback there.
@theraysmith is doing the training at Google. The examples you provide
will be helpful to him and improve future training.
ShreeDevi
spa and latin within best folders are moreless equivalent, there is no
significant difference, although there are several failures they are quite
reasonable. The one that provide real bad output are the official ones that
are automatically installed.
Do you need help training the data? (is a
Also see https://github.com/tesseract-ocr/tesseract/issues/221
On 29-Aug-2017 3:26 PM, "ShreeDevi Kumar" wrote:
> Check where the osd.traineddata and eng.trsineddata are installed.
> Download other trained data to same directory.
>
> On Linux, it is usually
Check where the osd.traineddata and eng.trsineddata are installed. Download
other trained data to same directory.
On Linux, it is usually /use/share/tessdata
On 29-Aug-2017 1:58 PM, "vikram charan" wrote:
> Hello,
> I'm working on project which base on scan many kind of
Hello,
I'm working on project which base on scan many kind of documents (like: -
Image that contain text, file, inquiry forms, documents etc.) . I'm using
Tesseract library to scan these documents. As mention on Github i followed
all step to setup Tesseract. I drag and drop tessdata folder
Take a look at improve quality page in wiki.
On 28-Aug-2017 6:16 PM, "Lada Tylich" wrote:
> Hi,
> I am confused that for the attached image it gives with parameter *-psm
> 7* result *88C. *It should detect such a picture, I guess.
> Am I missing something something?
>
>
Try first with
best/Latin.traineddata
that should handle text with diacritics
---
>>Pango suggested font Gandhari Unicode.
Use "Gandhari Unicode" within quotes as Font name
>>ERROR: Could not find training text file /usr/local/share/tessdata//
eng/eng.training_text
give script_dir
Hi,
Im new to tesseract and have a pdf file with diacritical marks. I tried to
run tesseract 4.0.0 with language eng. I see that it is not able to
recognize the text with diacritical marks. I found a font that can detect
diacritical mark.
Gandhari Unicode 5.1
8 matches
Mail list logo