Bean49 has uploaded a new change for review.

Change subject: (bug 13721) Truncate comment at 255 Bytes
......................................................................

(bug 13721) Truncate comment at 255 Bytes

If a 2-byte or 3-byte char is cut off at the 255-byte boundary when
stored, it becomes an invalid char. The XML dump outputter runs UTF-8
validation and turns the bad char into a valid U+FFFD ... which is 3 bytes
of UTF-8, over the 255-char limit again. Truncate comment at 255 Bytes.

Change-Id: Ieff7eba1523c2d89465141fe59a4aef1ca42c1f4
---
M src/org/mediawiki/importer/SqlWriter.java
M src/org/mediawiki/importer/SqlWriter15.java
2 files changed, 13 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.wikimedia.org:29418/mediawiki/tools/mwdumper 
refs/changes/32/30932/1
--
To view, visit https://gerrit.wikimedia.org/r/30932
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: Ieff7eba1523c2d89465141fe59a4aef1ca42c1f4
Gerrit-PatchSet: 1
Gerrit-Project: mediawiki/tools/mwdumper
Gerrit-Branch: master
Gerrit-Owner: Bean49 <[email protected]>

_______________________________________________
MediaWiki-commits mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits

Reply via email to