On May 5, 2013, at 10:54 AM, Paolo Bonzini wrote: > Il 04/05/2013 18:51, Holger Hans Peter Freyther ha scritto: >> Hi, >> >> I am trying to convert/import all of Iliad (since the development had >> moved to Pharo) and I stumble up-on an issue I already had when I tried >> to convert Aida. >> >> This[1] monticello file in line 6417 starts to use UCS-4 for a string >> and then stops at some characters later. Importing the file using >> FileStream is failing in Pharo as well. > > And how does monticello file it in then?!? I'm adding the Pharo list. > > Paolo
It's a dirty little secret… The .st files stored in MC packages (at least written from Pharo/Squeak) are not Unicode-safe. The way it's concealed is that, by default, the binary format file in the MC package is used, from which WideStrings are read correctly. The way to "fix" it when porting to other dialects, is either a) Remove any non-latin1 from the source code b) Write a reader for the binary format (which isn't all that hard, there are a limited set of classes involved that needs to be converted to whatever the dialect-equivalent is) Cheers, Henry