Update: dawiki category "Personer" seems to have some category tree cycles
in higher depths. Here are articles from that category (one layer deep)
with no P31 in the item:
http://petscan.wmflabs.org/?psid=8128462


On Tue, Mar 5, 2019 at 1:00 PM Magnus Manske <[email protected]>
wrote:

> If you make the gender optional, you also get the items without gender:
> http://tinyurl.com/yygze9da
>
> "People on Danish Wikipedia but not on Wikidata" is either:
> * a subset of "Danish Wikipedia articles not on Wikidata". You can get all
> of these (currently, 91) via my tool:
> https://tools.wmflabs.org/wikidata-todo/duplicity.php?wiki=dawiki&mode=list
> * "people on Danish Wikipedia with an item but no P31". Using dawiki
> category "Personer", I am currently running PetScan but it's slow, will
> keep you posted
> (for all dawiki items without P31 or P279, see http://tinyurl.com/y4u6lwyj
> )
>
> On Tue, Mar 5, 2019 at 11:55 AM <[email protected]> wrote:
>
>> Dear any Wikidata Query Service expert,
>>
>>
>> In connection with an editathon, I have made statistics of the number of
>> women and men on the Danish Wikipedia. I have used WDQS for that and the
>> query is listed below:
>>
>> SELECT ?count ?gender ?genderLabel
>> WITH {
>>    SELECT ?gender (COUNT(*) AS ?count) WHERE {
>>      ?item wdt:P31 wd:Q5 .
>>      ?item wdt:P21 ?gender .
>>      ?article schema:about ?item.
>>      ?article schema:isPartOf <https://da.wikipedia.org/>
>>    }
>>    GROUP BY ?gender
>> } AS %results
>> WHERE {
>>    INCLUDE %results
>>    SERVICE wikibase:label { bd:serviceParam wikibase:language "da,en". }
>> }
>> ORDER BY DESC(?count)
>> LIMIT 25
>>
>> http://tinyurl.com/y8twboe5
>>
>> As the statistics could potentially create some discussion (and ready
>> seems to have) I am wondering whether there are some experts that could
>> peer review the SPARQL query and tell me if there are any issues. I hope
>> I have not made a blunder...
>>
>> The minor issues I can think of are:
>>
>> - Missing gender in Wikidata. We have around 360 of these.
>>
>> - People on the Danish Wikipedia not on Wikidata. Probably tens-ish or
>> hundreds-ish!?
>>
>> - People not being humans. The gendered items I sampled were all
>> fictional humans.
>>
>>
>> We previously reached 17.2% females. Now we are below 17% due to
>> mass-import of Japanese football players, - as far as we can see.
>>
>>
>> best regards
>> Finn Årup Nielsen
>> http://people.compute.dtu.dk/faan/
>>
>> _______________________________________________
>> Wikidata mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>
_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to