ArielGlenn added a comment.
Perhaps there should be a maintenance script to delete such revisions; if we
have them, other folks might have them also. I can see such a script getting
the raw revision text for a specified id/batch of ids, checking that each one
is a self-reference of that sort, and then deleting it without serialization,
if there a code path which avoids that.
As far as the dumps in general, they don't expand wikitext, except for the
abstracts dumps. For that case, in AbstractFilter.php we can likely add
something to writeClosePage which, if it encounters a fatal error of the
MWContentSerializationException type, it could log the issue and do
$xml .= Xml::element( 'abstract', [ 'serialization-error' => '' ] ) . "\n";
so that the script can continue on. Thoughts?
TASK DETAIL
https://phabricator.wikimedia.org/T217329
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: ArielGlenn
Cc: Addshore, WMDE-leszek, Lucas_Werkmeister_WMDE, hashar, ArielGlenn,
alaa_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, Lunewa, QZanden,
LawExplorer, _jensen, rosalieper, Jonas, gnosygnu, Wikidata-bugs, aude,
Lydia_Pintscher, Jdforrester-WMF, Mbch331, Jay8g, Krenair
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs