Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Aryeh Gregor
On Tue, Jun 8, 2010 at 10:28 AM, Paul Houle p...@ontology2.com wrote:     As a person who has labored mightily to make sense of dbpedia,  I think that one reason why varbinary is preferable to varchar in many applications in wikimedia is that varchar() string comparisons are case insensitive

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Gerard Meijssen
Hoi, The difference is that is actually does sort according to the CLDR.. It would be really nice if we did that. Thanks, GerardM On 8 June 2010 21:38, Aryeh Gregor simetrical+wikil...@gmail.comsimetrical%2bwikil...@gmail.com wrote: On Tue, Jun 8, 2010 at 10:28 AM, Paul Houle

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Ryan Chan
Hi, On Mon, Jun 7, 2010 at 1:36 AM, Platonides platoni...@gmail.com wrote: No. The BMP problem is only present with the MySQL 4.1/5.0 UTF-8 option. It seems 5.0 and 5.1 support for UTF-8 are the same http://dev.mysql.com/doc/refman/5.0/en/charset-unicode.html

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Gerard Meijssen
Hoi, Do you mean to say the way MySQL supports Unicode is the same ie the way it is stored? Thanks, GerardM On 7 June 2010 17:17, Ryan Chan ryanchan...@gmail.com wrote: Hi, On Mon, Jun 7, 2010 at 1:36 AM, Platonides platoni...@gmail.com wrote: No. The BMP problem is only present with

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Aryeh Gregor
On Mon, Jun 7, 2010 at 11:17 AM, Ryan Chan ryanchan...@gmail.com wrote: It seems 5.0 and 5.1 support for UTF-8 are the same http://dev.mysql.com/doc/refman/5.0/en/charset-unicode.html http://dev.mysql.com/doc/refman/5.1/en/charset-unicode.html Yes, but the binary encoding doesn't use MySQL's

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Platonides
Ryan Chan wrote: Hello all, I remember in old days, UTF-8 string are stored as varbinary, are there reason to change to varchar(255) binary? Also, what is the default server/connection/client character set settings now? Thanks. MediaWiki supports both ways. Wikipedia still uses the

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Ryan Chan
Hello, On Sun, Jun 6, 2010 at 11:12 PM, Platonides platoni...@gmail.com wrote: MediaWiki supports both ways. Wikipedia still uses the mysql 4 compatible options, and since mysql chars only support the bmp, it isn't likely to change. It all depends on what you choose on install. Currently, the

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Platonides
Ryan Chan wrote: Hello, On Sun, Jun 6, 2010 at 11:12 PM, Platonides platoni...@gmail.com wrote: MediaWiki supports both ways. Wikipedia still uses the mysql 4 compatible options, and since mysql chars only support the bmp, it isn't likely to change. It all depends on what you choose on

[Wikitech-l] varchar(255) binary in tables.sql

2010-06-05 Thread Ryan Chan
Hello all, I remember in old days, UTF-8 string are stored as varbinary, are there reason to change to varchar(255) binary? Also, what is the default server/connection/client character set settings now? Thanks. ___ Wikitech-l mailing list