I am working on training data for the Hebrew language when I have free
time. currently on v2.04
Is the training process different on v3.0 ? if so where can i read about it
?


Roi


On Fri, Dec 4, 2009 at 7:19 PM, Sven Pedersen <[email protected]>wrote:

> That list of languages possibly supported in 3.0 release is currently:
>
> bul -- Bulgarian
> cat -- Catalan / Valencian
> ces -- Czech
> dan -- Danish
> deu -- German
> ell -- Greek
> eng -- English
> fin -- Finish
> fra -- French
> hun -- Hungarian
> ind -- Indonesian / Bahasa Indonesia / Malay?
> ita -- Italian
> lav -- Latvian
> lit -- Lithuanian
> nld -- Dutch
> nor -- Norwegian
> pol -- Polish
> por -- Portuguese
> ron -- Romanian
> rus -- Russian
> slk -- Slovak
> slv -- Slovenian
> spa -- Spanish
> srp -- Serbian / Croatian
> swe -- Swedish
> tgl -- Tagalog
> tur -- Turkish
> ukr -- Ukrainian
> vie -- Vietnamese
>
> In addition, there is support from some community projects for various
> Indian languages with Indic scripts, and I believe someone was working
> on Chinese. A few of us are interested in seeing Arabic and Hebrew
> support, but there is a need (mentioned in the FAQ, I believe) for a
> de-italicizing algorithm to be implemented and some other clever
> stuff...
> --Sven
>
>
> On Fri, Dec 4, 2009 at 11:58 AM, nguyenq <[email protected]> wrote:
> > If you look in the tessdata folder at
> http://tesseract-ocr.googlecode.com/svn/trunk/
> > , there currently are more than two dozens.
> >
> > On Dec 2, 7:44 am, andmor <[email protected]> wrote:
> >> Hi,
> >>
> >> Which languages will be supported with the 3.0 release ?
> >> On the TessearctProjects page, Ray says that Google are working on
> >> many for the next release.
> >> Any idea when these extra languages will be availalbe ?
> >>
> >> Thanks in advance,
> >> Andrew
> >
> > --
> >
> > You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> > To post to this group, send email to [email protected].
> > To unsubscribe from this group, send email to
> [email protected]<tesseract-ocr%[email protected]>
> .
> > For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
> >
> >
> >
>
> --
>
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected]<tesseract-ocr%[email protected]>
> .
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
>

--

You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.


Reply via email to