| hoo added a comment. |
Using sql wikidatawiki "SELECT page_title FROM (SELECT page_title, page_namespace FROM page ORDER BY page_id DESC LIMIT 1100000) AS 1Mpages WHERE page_namespace = 0 ORDER BY rand() LIMIT 2500" I got a random list of 2500 recently created Items.
I just re-did the above tests with --list-file /tmp/2500-ids.txt (no shard set):
Non-optimized nt vs. ttl:
(38.450+38.485+39.022+38.650+38.813+38.278)/(31.397+30.718+30.874+30.857+31.300+31.566) = 1.240
Optimized nt vs. ttl:
(35.546+34.241+34.222+34.829+34.638+34.891)/(30.747+30.314+30.968+30.761+30.607+31.159) = 1.129
Ratio non-optimized/ optimized version:
(38.450+38.485+39.022+38.650+38.813+38.278)/(35.546+34.241+34.222+34.829+34.638+34.891) = 1.112
Ratio first/ second ttl run (in a perfect world this would be 1):
(31.397+30.718+30.874+30.857+31.300+31.566)/(30.747+30.314+30.968+30.761+30.607+31.159) = 1.012
Cc: thiemowmde, Stashbot, gerritbot, Aklapper, daniel, Lydia_Pintscher, aude, hoo, ArielGlenn, GoranSMilovanovic, QZanden, Izno, Wikidata-bugs, Svick, Mbch331, jeremyb
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
