hoo added a comment.

I just ran a patched version of dumpJson along with an unmodified version (on mwdebug1001/mwdebug1002). While both servers have the same specs, the runs might not be totally comparable and I only did one run.

I ran sudo -u www-data timeout 1800 php /srv/mediawiki/multiversion/MWScript.php extensions/Wikidata/extensions/Wikibase/repo/maintenance/dumpJson.php --wiki wikidatawiki --sharding-factor 4 --shard 0 --snippet > /dev/null, the currently deployed version managed to dump 109223 entities, while the modified version managed to dump 194039 in the same time. That's a speedup of 77.7%!

It should be noted that these results might be inflated as the entities with lower ids probably have a way better hit rate in the entity cache than those with higher entity ids (which are less often used).


TASK DETAIL
https://phabricator.wikimedia.org/T178247

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: hoo
Cc: gerritbot, daniel, Aklapper, hoo, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, Lewizho99, Maathavan, Wikidata-bugs, aude, Svick, Mbch331, jeremyb
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to