Update: dawiki category "Personer" seems to have some category tree cycles in higher depths. Here are articles from that category (one layer deep) with no P31 in the item: http://petscan.wmflabs.org/?psid=8128462
On Tue, Mar 5, 2019 at 1:00 PM Magnus Manske <[email protected]> wrote: > If you make the gender optional, you also get the items without gender: > http://tinyurl.com/yygze9da > > "People on Danish Wikipedia but not on Wikidata" is either: > * a subset of "Danish Wikipedia articles not on Wikidata". You can get all > of these (currently, 91) via my tool: > https://tools.wmflabs.org/wikidata-todo/duplicity.php?wiki=dawiki&mode=list > * "people on Danish Wikipedia with an item but no P31". Using dawiki > category "Personer", I am currently running PetScan but it's slow, will > keep you posted > (for all dawiki items without P31 or P279, see http://tinyurl.com/y4u6lwyj > ) > > On Tue, Mar 5, 2019 at 11:55 AM <[email protected]> wrote: > >> Dear any Wikidata Query Service expert, >> >> >> In connection with an editathon, I have made statistics of the number of >> women and men on the Danish Wikipedia. I have used WDQS for that and the >> query is listed below: >> >> SELECT ?count ?gender ?genderLabel >> WITH { >> SELECT ?gender (COUNT(*) AS ?count) WHERE { >> ?item wdt:P31 wd:Q5 . >> ?item wdt:P21 ?gender . >> ?article schema:about ?item. >> ?article schema:isPartOf <https://da.wikipedia.org/> >> } >> GROUP BY ?gender >> } AS %results >> WHERE { >> INCLUDE %results >> SERVICE wikibase:label { bd:serviceParam wikibase:language "da,en". } >> } >> ORDER BY DESC(?count) >> LIMIT 25 >> >> http://tinyurl.com/y8twboe5 >> >> As the statistics could potentially create some discussion (and ready >> seems to have) I am wondering whether there are some experts that could >> peer review the SPARQL query and tell me if there are any issues. I hope >> I have not made a blunder... >> >> The minor issues I can think of are: >> >> - Missing gender in Wikidata. We have around 360 of these. >> >> - People on the Danish Wikipedia not on Wikidata. Probably tens-ish or >> hundreds-ish!? >> >> - People not being humans. The gendered items I sampled were all >> fictional humans. >> >> >> We previously reached 17.2% females. Now we are below 17% due to >> mass-import of Japanese football players, - as far as we can see. >> >> >> best regards >> Finn Årup Nielsen >> http://people.compute.dtu.dk/faan/ >> >> _______________________________________________ >> Wikidata mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikidata >> >
_______________________________________________ Wikidata mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata
