On Fri, Sep 11, 2009 at 5:06 PM, Chris Hostetter <hossman_luc...@fucit.org> wrote:
> I must be missunderstanding something still ... based on your description, > it sounds like it shouldn't matter if the encoder knows that it's one > logical character or not, either way it should wind up outputing the same > number of bytes.... yes, the # of bytes is different: 6 bytes versus 4 bytes in UTF-8 -- Robert Muir rcm...@gmail.com