Re: [Lazarus] Guessing the encoding of some text

José Mejuto via Lazarus Thu, 16 Nov 2017 04:03:21 -0800

El 16/11/2017 a las 11:25, Torsten Bonde Christiansen via Lazarus escribió:

Hi List.
I am reading some text of some .csv files, but the encoding of the filesis not always the same. In fact it may vary greatly from a lot ofeuropean encodings, UTF8 and asian encoding.


Hello,

Some years ago I wrote a code that must be trained which guess encodingand language. The problems are that it must be trained with large textsand of course the result are only statistical and only quite good overquite large texts (like 1000 chars or more) so it is not good for singlesentences.


If you are interested I can dive into old codes to catch it.


--

--
_______________________________________________
Lazarus mailing list
[email protected]
https://lists.lazarus-ide.org/listinfo/lazarus

Re: [Lazarus] Guessing the encoding of some text

Reply via email to