Bean49 has uploaded a new change for review. Change subject: (bug 13721) Truncate comment at 255 Bytes ......................................................................
(bug 13721) Truncate comment at 255 Bytes If a 2-byte or 3-byte char is cut off at the 255-byte boundary when stored, it becomes an invalid char. The XML dump outputter runs UTF-8 validation and turns the bad char into a valid U+FFFD ... which is 3 bytes of UTF-8, over the 255-char limit again. Truncate comment at 255 Bytes. Change-Id: Ieff7eba1523c2d89465141fe59a4aef1ca42c1f4 --- M src/org/mediawiki/importer/SqlWriter.java M src/org/mediawiki/importer/SqlWriter15.java 2 files changed, 13 insertions(+), 1 deletion(-) git pull ssh://gerrit.wikimedia.org:29418/mediawiki/tools/mwdumper refs/changes/32/30932/1 -- To view, visit https://gerrit.wikimedia.org/r/30932 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: newchange Gerrit-Change-Id: Ieff7eba1523c2d89465141fe59a4aef1ca42c1f4 Gerrit-PatchSet: 1 Gerrit-Project: mediawiki/tools/mwdumper Gerrit-Branch: master Gerrit-Owner: Bean49 <[email protected]> _______________________________________________ MediaWiki-commits mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits
