https://bugzilla.wikimedia.org/show_bug.cgi?id=41790

Xavier Combelle <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]

--- Comment #4 from Xavier Combelle <[email protected]> ---
A side problem is that the unicode character don't need to be escaped per JSON
spec so

the bit could be rewrited in

<text xml:space="preserve"
bytes="8538">{&quot;label&quot;:{&quot;en&quot;:&quot;Barack
Obama&quot;,&quot;fr&quot;:&quot;Barack
Obama&quot;,&quot;ar&quot;:&quot;باراك
أوباما&quot;,&quot;ru&quot;:&quot;Барак
Обама&quot;,&quot;nb&quot;:&quot;Barack
Obama&quot;,&quot;it&quot;:&quot;Barack Obama&quot;,&quot;de&quot;:&quot;Barack
Obama&quot;,&quot;be-tarask&quot;:&quot;Барак
Абама&quot;,&quot;nan&quot;:&quot;Barack
Obama&quot;,&quot;ca&quot;:&quot;Barack
Obama&quot;},&quot;description&quot;:{&quot;en&quot;:&quot;President of the
United States of America

which is 1.2 smaller in byte

but the big win is ofcourse CDATA escaping

<text xml:space="preserve"
bytes="8538"><CDATA[[{"label":{"en":"Barack
Obama","fr":"Barack
Obama","ar":"باراك
أوباما","ru":"Барак
Обама","nb":"Barack
Obama","it":"Barack Obama","de":"Barack
Obama","be-tarask":"Барак
Абама","nan":"Barack
Obama","ca":"Barack
Obama"},"description":{"en":"President of the
United States of America

which is twice smaller

if CDATA way is chosen one should take care transforming ]]> sequences in
]]]]><CDATA[[> as explained here:
http://stackoverflow.com/questions/223652/is-there-a-way-to-escape-a-cdata-end-token-in-xml

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to