hoo added a comment.

Using sql wikidatawiki "SELECT page_title FROM (SELECT page_title, page_namespace FROM page ORDER BY page_id DESC LIMIT 1100000) AS 1Mpages WHERE page_namespace = 0 ORDER BY rand() LIMIT 2500" I got a random list of 2500 recently created Items.

I just re-did the above tests with --list-file /tmp/2500-ids.txt (no shard set):

Non-optimized nt vs. ttl:
(38.450+38.485+39.022+38.650+38.813+38.278)/(31.397+30.718+30.874+30.857+31.300+31.566) = 1.240

Optimized nt vs. ttl:
(35.546+34.241+34.222+34.829+34.638+34.891)/(30.747+30.314+30.968+30.761+30.607+31.159) = 1.129

Ratio non-optimized/ optimized version:
(38.450+38.485+39.022+38.650+38.813+38.278)/(35.546+34.241+34.222+34.829+34.638+34.891) = 1.112

Ratio first/ second ttl run (in a perfect world this would be 1):
(31.397+30.718+30.874+30.857+31.300+31.566)/(30.747+30.314+30.968+30.761+30.607+31.159) = 1.012


TASK DETAIL
https://phabricator.wikimedia.org/T176844

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: hoo
Cc: thiemowmde, Stashbot, gerritbot, Aklapper, daniel, Lydia_Pintscher, aude, hoo, ArielGlenn, GoranSMilovanovic, QZanden, Izno, Wikidata-bugs, Svick, Mbch331, jeremyb
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to