On 01/16/2013 07:15 PM, Nishok Love wrote:
Thanks, Bob. Your command works but the same results occur. Further 
investigations here found this

When Pages is used to export as "Text", the resulting file may be of two kinds:

(1) if the document contained only characters included in Apple MacRoman 
charset, the file is a pure text file based on Apple MacRoman encoding.

(2) if the document contained extraneous characters the created text file take care of 
this feature and uses the UTF encoding (two bytes per character) and starts with the 
logical BOM: "FE FF".

which I've copied from the discussion on  
https://discussions.apple.com/message/9518841?messageID=9518841#9518841?messageID=9518841

Opening both files with TextEdit (which displays both of them correctly, ie 
without all those extra spaces), duplicating them and then watching the save 
options shows that one file (the one from Pages) is using UTF-16 whilst Word's 
Western (Mac OS Roman) export is in UTF-8. Using GetInfo I can now see that the 
UTF-16 file is twice the size of the other.

In short, text files are not as simple as they used to be!

So I'm still looking for a way for LiveCode to spot whether it's opening a file 
in UTF-8 or UTF-16 (or something else - aaarrgh!). Can I access the file 
header? read from file just gives me the data...

I could read the file, count the number of characters and how many of them are 
spaces and from that I could infer which format is being used. Probably this 
would be reliable for my purposes - just not very elegant!

Nishok



 Why not use RTF?

Richmond.

_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Reply via email to