Probably the file command regonizes the UTF-8 Byte Order Mark as it does
other magic numbers.  

UTF-8 BOM is 0xEFBBBF, a signature that indicates the encoding of the file
is UTF-8.  If you have an application that is reading the file and needs to
know, read the first six bytes of the file and act accordingly

FWIW:  Here's a useful URL: http://www.unicode.org/faq/utf_bom.html#22

Dave McLellan --Consulting Software Engineer - SPEA Engineering
EMC Corporation
228 South St. Mail Stop: 228 LL/AA-24
Hopkinton, MA 01748  USA
+1-508-249-1257 F: +1-508-497-8030  [EMAIL PROTECTED]


-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Peter BENKO,VSE IT
Sluzby,+421-55-610-2045,+421-903-855532
Sent: Wednesday, August 17, 2005 1:11 PM
To: openssl-users@openssl.org
Subject: Re: How to validate UTF-8 in a file

On Wed, Aug 17, 2005 at 10:01:26AM -0500, Silvia Gisela Pavon Velasco wrote:
> 
> 
> 
> 
> Hello,
> 
> I would like some advice about how can I validate that a file is in utf-8
> format. I have set the proper unix environment variables to work with the
> utf-8 format; but however, I have the need to validate if a file is in
that
> format.
Using the 'file' command under the linux you can realize if file is
UTF8.

Example:
file aaa.txt
aaa.txt: UTF-8 Unicode English text

> 
> Regards,
> 
> Silvia Pavón
>
____________________________________________________________________________
_____
> Prepárate para hablar y navegar sin límite... visita www.masternet.com.mx
> 
> 
> NOTA: La información de este correo es de propiedad exclusiva y
> confidencial. Este mensaje es sólo para el destinatario se?alado, si usted
> no lo es, destrúyalo de inmediato. Ninguna información aquí contenida debe
> ser entendida como dada o avalada por Alestra, sus subsidiarias o sus
> empleados, salvo cuando ello expresamente se indique. Es responsabilidad
de
> quien recibe este correo de asegurarse que esté libre de virus, por lo
> tanto ni Alestra, sus subsidiarias ni sus empleados aceptan
responsabilidad
> alguna.
> NOTE:  The information in this email is proprietary and confidential. This
> message is for the designated recipient only, if you are not the intended
> recipient, you should destroy it immediately. Any information in this
> message shall not be understood as given or endorsed by Alestra, its
> subsidiaries or their employees, unless expressly so stated. It is the
> responsibility of the recipient to ensure that this email is virus free,
> therefore neither Alestra, its subsidiaries nor their employees accept any
> responsibility.
> 
> ______________________________________________________________________
> OpenSSL Project                                 http://www.openssl.org
> User Support Mailing List                    openssl-users@openssl.org
> Automated List Manager                           [EMAIL PROTECTED]
______________________________________________________________________
OpenSSL Project                                 http://www.openssl.org
User Support Mailing List                    openssl-users@openssl.org
Automated List Manager                           [EMAIL PROTECTED]
______________________________________________________________________
OpenSSL Project                                 http://www.openssl.org
User Support Mailing List                    openssl-users@openssl.org
Automated List Manager                           [EMAIL PROTECTED]

Reply via email to