ICU doesn't have a tool specifically to do that, but it does have API support for that (and character conversion), so it'd be very simple for you to write such a tool -- just opening the file (with whatever conversion is required) and scanning the contents. See http://oss.software.ibm.com/icu.
Mark __________ http://www.macchiato.com ◄ “Eppur si muove” ► ----- Original Message ----- From: "chuck clemens" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Wednesday, September 25, 2002 21:02 Subject: script detection program > Does anyone have a program or tool that can identify the scripts which the > characters in a UTF-16 encoded file belong to? > > I'd like a program that can scan the data and return script tag such as used > in http://www.unicode.org/unicode/reports/tr24/ > > so if I had a UTF-16 encoding file with latin and cyrillic characters, the > tool/program would scan the text and return the name "latn" and "cyrl" > > > > > _________________________________________________________________ > Send and receive Hotmail on your mobile device: http://mobile.msn.com > >

