Ladsgroup added a comment.
Okay, I have had enough of the json dumps moving around breaking the script.
I ran this hadoop query (that's basically two weeks old)
SELECT
regexp_extract(claim.mainsnak.datavalue.value,',\"language\"\\:"(.+?)"',1),
count(*) as hitcount
FROM wmf.wikidata_entity
LATERAL VIEW explode(claims) t AS claim
WHERE snapshot='2021-06-07'
AND typ = 'item' and claim.mainsnak.datatype = 'monolingualtext'
group by
regexp_extract(claim.mainsnak.datavalue.value,',\"language\"\\:"(.+?)"',1)
order by hitcount desc
LIMIT 1000;
The result is P16694 <https://phabricator.wikimedia.org/P16694>
TASK DETAIL
https://phabricator.wikimedia.org/T180771
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Mbch331, Ladsgroup
Cc: Manuel, noarave, Mbch331, Nikki, Lydia_Pintscher, Nikerabbit,
wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, Ladsgroup,
Ab6399, Kizule, jhsoby, GerardM, Davidzdh, Yejianfei, Liuxinyu970226, Aklapper,
C933103, Biggs657, Invadibot, Lalamarie69, maantietaja, Alter-paule, Beast1978,
Un1tY, Akuckartz, Hook696, Iflorez, Kent7301, alaa_wmde, joker88john, CucyNoiD,
Nandana, Gaboe420, lucamauri, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420,
Bsandipan, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, Lewizho99,
Maathavan, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]