https://bugzilla.wikimedia.org/show_bug.cgi?id=29102

--- Comment #20 from Ralf Baechle <[email protected]> 2011-12-09 23:12:30 UTC 
---
Something had already gone wrong with the non-ASCII characters in my wiki. 
Some of öäüßÖÄÜé were correct, other instances of the same characters were
corrupted.  So when I ran into the upgrade issue, I decided to go for the
sledgehammer solution.  With the mixed charsets already in the table I
converted the table from the old charset which was latin1 to UTF-8 stored in
binary format by executing something like

  ALTER TABLE cur CONVERT TO CHARACTER SET utf8;
  ALTER TABLE cur CONVERT TO CHARACTER SET binary;

on every table.  This may have done bad things to the non-latin1 (ISO-8859-1)
characters in the table but that was hard to avoid at this stage.

The ALTER commands will fail on a bunch of tables that have indexes on columns
that need to be converted between charsets.  I dealt with those cases by
dropping and re-creating the index.  I'm sure the SQL wizard are screaming now
and I'm interested in there prefered solution.

Anyway, after this procedure the Mediawiki upgradeskript ran successful; I then
went through all pages manually fixing up the damaged non-ASCII characters. 
Fortunately I had to do this part only for one of my 6 affected wikis and that
was small wiki only.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to