Re[2]: [PHP-DEV] unserialize() & unicode issues

val khokhlov Wed, 14 Sep 2005 06:53:42 -0700

Hello Tex,

Wednesday, September 14, 2005, 1:07:26 PM, you wrote:


TT> Bear with me, I don't know about how PHP uses the serialized info, but if
TT> your goal is to minimize the overhead for unicode data when
         no, now we're discussing the following issues:
1) should serialize format be portable between unicode and non-uniocde modes
   (Andrei says that no, for example)?
2) if yes, should it contain data in the fixed encoding (unicode), so that
   serialize() in non-unicode mode will do conversion, or in the script
   encoding (so, no conversion in serialize() is required, but we need to
   specify data encoding somehow)?

         now we have following ways of implementing serialize() in php6:
1) non-portable way: no conversion at all; serialize data created in
   unicode mode won't be unserialized in non-unicode mode and vice versa
2) quasi-portable way: no conversion at all; we will just unserialize what
   we get; but if we unserialize data in a script with different encoding,
   it can malfunction
3) unicode way: store all encoding-specific data in unicode (so, convert
   them if serialize() is called in non-unicode way); downconvert to script
   encoding in unserialize() is called in non-unicode way (so, you can see a
   double conversion in this particular case)
4) varying encoding way: store all data as-is, but also store script
   encoding; if we call unserialize() in a script with different encoding,
   convert them (double conversion can still occur, but not so often)

ps: sorry for mistakes, i'm writing in a hurry


-- 
Best regards,
 val                            mailto:[EMAIL PROTECTED]

-- 
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Re[2]: [PHP-DEV] unserialize() & unicode issues

Reply via email to