As far as I know no one has replicated the LSTM training done from scratch
by Ray.
On Wed, Mar 25, 2020, 01:35 Essam Zaky wrote:
> Hi Dears ,
>
> I would like to build *.traindata from scratch specially for English and
> Arabic
>
> So lets talk about English as example
> my question how to
Hi Dears ,
I would like to build *.traindata from scratch specially for English and
Arabic
So lets talk about English as example
my question how to prepare fonts folder?
i read the
https://github.com/tesseract-ocr/tesseract/blob/master/src/training/language-specific.sh
file
i found the
Please see
https://github.com/Shreeshrii/tesstrain-xsa/blob/master/langdata/latin2unicode.sh
It has sed substitution commands for going from transliteration to Unicode
for xsa, based on mapping shown in Wikipedia and other web pages.
On Mon, Mar 23, 2020, 01:58 Wincent Balin wrote:
> Hi
>
> How comes that all characters appearing are Unicode replacement files? Did
> I misconfigure something?
>
This could be a locale or encoding issue. It needs to be a unicode text
file, I open in notepad++ in windows10, encode in utf-8. I run training on
a ubuntu machine remotely.
>
> Is the
Hello,
i am using the following version of the software:
tesseract 4.0.0
leptonica-1.76.0
libjpeg 9c : libpng 1.6.37 : libtiff 4.0.10 : zlib 1.2.11 : libwebp 1.0.1
: libopenjp2 2.3.0
Found AVX512BW
Found AVX512F
Found AVX2
Found AVX
Found SSE
I try to convert .tif in to PDF within a
5 matches
Mail list logo