Re: [MacRuby-devel] Strings, Encodings and IO

Manfred Stienstra Mon, 06 Apr 2009 23:15:27 -0700


On Apr 7, 2009, at 7:47 AM, Vincent Isambart wrote:


I have two small comments and a general statement about your essay;

A few functions of 1.9 may also be disabled (like force_encoding). Of
course it would be possible to add the full functionality of Ruby 1.9
strings on ByteString but it wouldn't be worth it.

The force_encoding method will be absolutely _vital_ to working withencodings in Ruby. Most library authors don't know anything aboutcharacter encoding and _will_ do the wrong things. And I'm not eventalking about libraries written for 1.8 which are totally unaware ofthe String changes. For example, in a fictional HTTP library thattotally doesn't exist today:


  response = HTTP.get('http://www.google.com')
  response.body.encoding #=> #<Encoding:US-ASCII>

Even though the headers clearly say: "Content-Type: text/html;charset=UTF-8". So we need force_encoding to fix these problems. Eventhe library author probably needs force_encoding method becausesomewhere deep down in the library there might be C / Obj-C code thatreturns a byte string to Ruby without specifying the encoding.

Ruby 1.9 also has default code and default external encodings
different depending on the environment, but I think always both of
them set to UTF-8 would be the best. (we may even completely ignore
the encoding pragmas in the code not to complicate the parser).

Also, a no-go. ERB uses this pragma to signal what the encoding of thetemplate is, encoding will break when you ignore this.

Finally; I don't think it's a good idea to discuss this a great lengthwithout actual code but in order to write a compatible implementationmost (if not all) of the String awkwardness will have to be implemented.


Manfred
_______________________________________________
MacRuby-devel mailing list
MacRuby-devel@lists.macosforge.org
http://lists.macosforge.org/mailman/listinfo.cgi/macruby-devel

Re: [MacRuby-devel] Strings, Encodings and IO

Reply via email to