ArielGlenn added a comment.
I looked at the author list. But even with around 2000 authors, if we gave
each one of them 80 bytes (plenty for first name, last name and an id) we'd
have 160k of data, not 1.5 megabytes.
But one author is represented this way:
{"mainsnak":
{
"snaktype":"value",
"property":"P2093",
"hash":"daf91abd8b2cac6e9057fd945ca97ff06624",
"datavalue":
{
"value":"G. Akimoto",
"type":"string"
}
},
"type":"statement",
"qualifiers":
{
"P1545":
[
{
"snaktype":"value",
"property":"P1545",
"hash":"d8baedaa705c5d31356a6c9dd39d4b5b185d1882",
"datavalue":
{
"value":"28",
"type":"string"
}
}
]
},
"qualifiers-order": ["P1545"],
"id":"Q57009452$47838677-5214-47D0-AB3B-9F1F35EE82BB",
"rank":"normal"
},
(spaces added for readability). That's over 400 bytes per author, and no
wonder the article size is ballooning.
Isn't there some way to be more concise in these entries? So far there's only
around 250 of them, but each one of them is over 1GB of data for all of its
revisions, *compressed*. We kind of expect articles to take their time to get
huge...
TASK DETAIL
https://phabricator.wikimedia.org/T221504
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: ArielGlenn
Cc: Mahir256, ArielGlenn, alaa_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic,
Lunewa, QZanden, LawExplorer, _jensen, rosalieper, gnosygnu, Wikidata-bugs,
aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs