Re: [Wikimedia-l] Fresh data on the gender gap in content

2020-02-10 Thread Asaf Bartov
On Mon, Feb 10, 2020 at 6:14 PM Robert Fernandez 
wrote:

> How do you get negative one biographies?
>
> 149 Mongolian 2 -1 -50.00
> 148 Punjabi 2 -1
>

The lines you quote were from the "diff" section, i.e. there were two
articles about men more than the previous period, and one less about
women.  (This can mean an article about a woman was deleted, or it could
mean three articles about women were created but four were deleted, etc.)

   A.

-- 
Asaf Bartov
Wikimedia Foundation 

Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us make it a reality!
https://donate.wikimedia.org
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2020-02-10 Thread Robert Fernandez
How do you get negative one biographies?

149 Mongolian 2 -1 -50.00
148 Punjabi 2 -1

On Thu, Jun 16, 2016 at 5:15 PM Florence Devouard 
wrote:

> Hello Asaf
>
> Just making sure that you knew about WHGI :
> http://whgi.wmflabs.org/gender-by-language.html
>
> Do you know if there are differences in analysis between the two analysis ?
>
> I checked a few figures and it fits pretty well.
>
> Flo
>
>
> Le 16/06/16 à 21:14, Asaf Bartov a écrit :
> > Hullo everyone.
> >
> > I was asked by a volunteer for help getting stats on the gender gap in
> > content on a certain Wikipedia, and came up with simple Wikidata Query
> > Service[1] queries that pulled the total number of articles on a given
> > Wikipedia about men and about women, to calculate *the proportion of
> > articles about women out of all articles about humans*.
> >
> > Then I was curious about how that wiki compared to other wikis, so I ran
> > the queries on a bunch of languages, and gathered the results into a
> table,
> > here:
> > https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
> >
> > (please see the *caveat* there.)
> >
> > I don't have time to fully write-up everything I find interesting in
> those
> > results, but I will quickly point out the following:
> >
> > 1. The Nepali statistic is simply astonishing! There must be a story
> > there.  I'm keen on learning more about this, if anyone can shed light.
> >
> > 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> > articles about women among all biographies.
> >
> > 3. among the top 10 largest wikis, Japanese is the least imbalanced.
> Good
> > job, Japanese Wikipedians!  I wonder if you have a good sense of what
> > drives this relatively better balance. (my instinctive guess is pop
> culture
> > coverage.)
> >
> > 4. among the top 10 largest wikis, Russian is the most imbalanced.
> >
> > 5. I intend to re-generate these stats every two months or so, to
> > eventually have some sense of trends and changes.
> >
> > 6. Your efforts, particularly on small-to-medium wikis, can really make a
> > dent in these numbers!  For example, it seems I am personally
> > responsible[2] for almost 1% of the coverage of women on Hebrew
> Wikipedia!
> > :)
> >
> > 7. I encourage you to share these numbers with your communities.  Perhaps
> > you'd like to overtake the wiki just above yours? :)
> >
> > 8. I'm happy to add additional languages to the table, by request.  Or
> you
> > can do it yourself, too. :)
> >
> >A.
> >
> > [1] https://query.wikidata.org/
> > [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> >
>
>
>
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2020-02-10 Thread Magnus Manske via Wikimedia-l
I also had a tweet, about gender ratio in Wikidata, grouped by external
property:
https://twitter.com/MagnusManske/status/1224703188693069827

On Sat, Feb 8, 2020 at 9:25 AM Asaf Bartov  wrote:

> Hello, everyone.
>
> Back in June 2016, on this thread, I had published some data on the
> *content
> gender gap* in Wikipedia biographies (i.e. how many articles are about men
> versus how many are about women). I intended to keep generating snapshots
> of these stats, to track progress against the gap, but then realized the
> excellent WHGI project[1] is already collecting and preserving snapshots of
> the data, so I did not do so myself.
>
> However, being curious about the numbers for a particular community, I
> realized WHGI does not provide an easy way to compare snapshots across
> time.  The data is all there[2], but a convenient visualization and
> comparison tool between two arbitrary snapshots is not available yet.  So
> while I did not build such a tool myself, I did update my little page with
> fresh numbers, *plus a comparison to the June 2016 *numbers.
>
> You may find the data interesting:
>
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap#Data_as_of_February_2020
> (note that the table is sortable by clicking the column heads)
>
> While it is difficult to ascribe the change to specific programs, overall
> there can be little doubt that the increased attention to the content
> gender gap, alongside specific programs and groups such as Women in Red,
> Art+Feminism, WikiMujeres, WikiDonne, and even certain runs of
> #100wikidays, have all contributed to narrowing the gap on most Wikipedias.
>
> Special congratulations to the Punjabi, Malayalam, and Odia Wikipedias,
> which have all narrowed the gap by more than 10 points!  Of the larger
> wikis, great progress was made by the Vietnamese and Armenian Wikipedias.
>
> As before, I'll note:
>
> 1. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!
>
> 2. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> Cheers,
>
>A.
>
> [1] https://whgi.wmflabs.org/gender-by-language.html
> [2] https://whgi.wmflabs.org/snapshot_data/
>
> On Thu, Jun 16, 2016 at 10:14 PM Asaf Bartov 
> wrote:
>
> > Hullo everyone.
> >
> > I was asked by a volunteer for help getting stats on the gender gap in
> > content on a certain Wikipedia, and came up with simple Wikidata Query
> > Service[1] queries that pulled the total number of articles on a given
> > Wikipedia about men and about women, to calculate *the proportion of
> > articles about women out of all articles about humans*.
> >
> > Then I was curious about how that wiki compared to other wikis, so I ran
> > the queries on a bunch of languages, and gathered the results into a
> table,
> > here:
> > https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
> >
> > (please see the *caveat* there.)
> >
> > I don't have time to fully write-up everything I find interesting in
> those
> > results, but I will quickly point out the following:
> >
> > 1. The Nepali statistic is simply astonishing! There must be a story
> > there.  I'm keen on learning more about this, if anyone can shed light.
> >
> > 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> > articles about women among all biographies.
> >
> > 3. among the top 10 largest wikis, Japanese is the least imbalanced.
> Good
> > job, Japanese Wikipedians!  I wonder if you have a good sense of what
> > drives this relatively better balance. (my instinctive guess is pop
> culture
> > coverage.)
> >
> > 4. among the top 10 largest wikis, Russian is the most imbalanced.
> >
> > 5. I intend to re-generate these stats every two months or so, to
> > eventually have some sense of trends and changes.
> >
> > 6. Your efforts, particularly on small-to-medium wikis, can really make a
> > dent in these numbers!  For example, it seems I am personally
> > responsible[2] for almost 1% of the coverage of women on Hebrew
> Wikipedia!
> > :)
> >
> > 7. I encourage you to share these numbers with your communities.  Perhaps
> > you'd like to overtake the wiki just above yours? :)
> >
> > 8. I'm happy to add additional languages to the table, by request.  Or
> you
> > can do it yourself, too. :)
> >
> >A.
> >
> > [1] https://query.wikidata.org/
> > [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> > --
> > Asaf Bartov
> > Wikimedia Foundation 
> >
> > Imagine a world in which every single human being can freely share in the
> > sum of all knowledge. Help us make it a reality!
> > https://donate.wikimedia.org
> >
>
>
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
> 

Re: [Wikimedia-l] Fresh data on the gender gap in content

2020-02-08 Thread Asaf Bartov
Hello, everyone.

Back in June 2016, on this thread, I had published some data on the *content
gender gap* in Wikipedia biographies (i.e. how many articles are about men
versus how many are about women). I intended to keep generating snapshots
of these stats, to track progress against the gap, but then realized the
excellent WHGI project[1] is already collecting and preserving snapshots of
the data, so I did not do so myself.

However, being curious about the numbers for a particular community, I
realized WHGI does not provide an easy way to compare snapshots across
time.  The data is all there[2], but a convenient visualization and
comparison tool between two arbitrary snapshots is not available yet.  So
while I did not build such a tool myself, I did update my little page with
fresh numbers, *plus a comparison to the June 2016 *numbers.

You may find the data interesting:
https://meta.wikimedia.org/wiki/User:Ijon/Content_gap#Data_as_of_February_2020
(note that the table is sortable by clicking the column heads)

While it is difficult to ascribe the change to specific programs, overall
there can be little doubt that the increased attention to the content
gender gap, alongside specific programs and groups such as Women in Red,
Art+Feminism, WikiMujeres, WikiDonne, and even certain runs of
#100wikidays, have all contributed to narrowing the gap on most Wikipedias.

Special congratulations to the Punjabi, Malayalam, and Odia Wikipedias,
which have all narrowed the gap by more than 10 points!  Of the larger
wikis, great progress was made by the Vietnamese and Armenian Wikipedias.

As before, I'll note:

1. Your efforts, particularly on small-to-medium wikis, can really make a
dent in these numbers!

2. I encourage you to share these numbers with your communities.  Perhaps
you'd like to overtake the wiki just above yours? :)

Cheers,

   A.

[1] https://whgi.wmflabs.org/gender-by-language.html
[2] https://whgi.wmflabs.org/snapshot_data/

On Thu, Jun 16, 2016 at 10:14 PM Asaf Bartov  wrote:

> Hullo everyone.
>
> I was asked by a volunteer for help getting stats on the gender gap in
> content on a certain Wikipedia, and came up with simple Wikidata Query
> Service[1] queries that pulled the total number of articles on a given
> Wikipedia about men and about women, to calculate *the proportion of
> articles about women out of all articles about humans*.
>
> Then I was curious about how that wiki compared to other wikis, so I ran
> the queries on a bunch of languages, and gathered the results into a table,
> here:
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
>
> (please see the *caveat* there.)
>
> I don't have time to fully write-up everything I find interesting in those
> results, but I will quickly point out the following:
>
> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.
>
> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> articles about women among all biographies.
>
> 3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> drives this relatively better balance. (my instinctive guess is pop culture
> coverage.)
>
> 4. among the top 10 largest wikis, Russian is the most imbalanced.
>
> 5. I intend to re-generate these stats every two months or so, to
> eventually have some sense of trends and changes.
>
> 6. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!  For example, it seems I am personally
> responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
> :)
>
> 7. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> 8. I'm happy to add additional languages to the table, by request.  Or you
> can do it yourself, too. :)
>
>A.
>
> [1] https://query.wikidata.org/
> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
>


-- 
Asaf Bartov
Wikimedia Foundation 

Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us make it a reality!
https://donate.wikimedia.org
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-18 Thread Andre Engels
On Thu, Jun 16, 2016 at 9:14 PM, Asaf Bartov  wrote:

> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.

I looked, and it seems that a large proportion consists of porn
actresses... See the attached sample, giving an almost-randomly chosen
sample of 11 of the 1400 or so identified people on Nepali Wikipedia.


-- 
André Engels, andreeng...@gmail.com
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-18 Thread Tanweer Morshed
Hi Asaf,

I asked Ganesh Paudel, one of the veteran Nepali Wikipedians, to know their
story behind such astonishing statistics (48% women biographies). Here is
what he said:

'We have been conducting WikiWomen editathon since march 2013. That was
focused to increase quality n quantity of women related issues. This
focused approach might have grown the contents.'



Thanks.

On Fri, Jun 17, 2016 at 1:14 AM, Asaf Bartov  wrote:

> Hullo everyone.
>
> I was asked by a volunteer for help getting stats on the gender gap in
> content on a certain Wikipedia, and came up with simple Wikidata Query
> Service[1] queries that pulled the total number of articles on a given
> Wikipedia about men and about women, to calculate *the proportion of
> articles about women out of all articles about humans*.
>
> Then I was curious about how that wiki compared to other wikis, so I ran
> the queries on a bunch of languages, and gathered the results into a table,
> here:
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
>
> (please see the *caveat* there.)
>
> I don't have time to fully write-up everything I find interesting in those
> results, but I will quickly point out the following:
>
> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.
>
> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> articles about women among all biographies.
>
> 3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> drives this relatively better balance. (my instinctive guess is pop culture
> coverage.)
>
> 4. among the top 10 largest wikis, Russian is the most imbalanced.
>
> 5. I intend to re-generate these stats every two months or so, to
> eventually have some sense of trends and changes.
>
> 6. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!  For example, it seems I am personally
> responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
> :)
>
> 7. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> 8. I'm happy to add additional languages to the table, by request.  Or you
> can do it yourself, too. :)
>
>A.
>
> [1] https://query.wikidata.org/
> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 




-- 
Regards,
Tanweer
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-17 Thread Magnus Manske
Nepali seems to have come a long way since I ran my analysis:
http://magnusmanske.de/wordpress/?p=250
(see the big dot-plot; note that I compare the length of the articles,
rather than their number)

On Fri, Jun 17, 2016 at 11:01 AM Ting Chen  wrote:

> Hello Asaf, hello Florence,
>
> the inbalance is surely partly due to the cultural bias (both
> contemprary as well as historical) of our world. Across the cultures in
> before 18th century women found less notice in the historical
> documentation. As far as I know about Japanese history, this bias was
> less prominant in the Japanese history, this only change since the rise
> of the Shoguns (about 12th century). Could this be part of the reason of
> the ja-wp result?
>
> If it is the case than Wikipedia and Wikidata could be a very valuable
> resource for cultural history researches since never before there was
> such a systematic gathering of so much correlated data.
>
> Take the 20th century, would the data reflect the change (or reluctance
> to change) of the bias in art, politics and economy?
>
> If yes, I think we should spread word inside of the research community
> because in my opinion the research community had until now still didn't
> pay attention to this pile of data that all the volunteers had put
> together and are just waiting for them to mine.
>
> Greetings
> Ting
>
>
> Am 16.06.2016 um 23:14 schrieb Florence Devouard:
> > Hello Asaf,
> > Just making sure that you knew about WHGI :
> > http://whgi.wmflabs.org/gender-by-language.html
> >
> > Do you know if there are differences in analysis between the two
> > analysis ?
> >
> > I checked a few figures and it fits pretty well.
> >
> > Flo
> >
> >
> > Le 16/06/16 à 21:14, Asaf Bartov a écrit :
> >> Hullo everyone.
> >>
> >> I was asked by a volunteer for help getting stats on the gender gap in
> >> content on a certain Wikipedia, and came up with simple Wikidata Query
> >> Service[1] queries that pulled the total number of articles on a given
> >> Wikipedia about men and about women, to calculate *the proportion of
> >> articles about women out of all articles about humans*.
> >>
> >> Then I was curious about how that wiki compared to other wikis, so I ran
> >> the queries on a bunch of languages, and gathered the results into a
> >> table,
> >> here:
> >> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
> >>
> >> (please see the *caveat* there.)
> >>
> >> I don't have time to fully write-up everything I find interesting in
> >> those
> >> results, but I will quickly point out the following:
> >>
> >> 1. The Nepali statistic is simply astonishing! There must be a story
> >> there.  I'm keen on learning more about this, if anyone can shed light.
> >>
> >> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> >> articles about women among all biographies.
> >>
> >> 3. among the top 10 largest wikis, Japanese is the least imbalanced.
> >> Good
> >> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> >> drives this relatively better balance. (my instinctive guess is pop
> >> culture
> >> coverage.)
> >>
> >> 4. among the top 10 largest wikis, Russian is the most imbalanced.
> >>
> >> 5. I intend to re-generate these stats every two months or so, to
> >> eventually have some sense of trends and changes.
> >>
> >> 6. Your efforts, particularly on small-to-medium wikis, can really
> >> make a
> >> dent in these numbers!  For example, it seems I am personally
> >> responsible[2] for almost 1% of the coverage of women on Hebrew
> >> Wikipedia!
> >> :)
> >>
> >> 7. I encourage you to share these numbers with your communities.
> >> Perhaps
> >> you'd like to overtake the wiki just above yours? :)
> >>
> >> 8. I'm happy to add additional languages to the table, by request.
> >> Or you
> >> can do it yourself, too. :)
> >>
> >>A.
> >>
> >> [1] https://query.wikidata.org/
> >> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> >>
> >
> >
> >
> > ___
> > Wikimedia-l mailing list, guidelines at:
> > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > New messages to: Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > 
>
>
>
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 

Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-17 Thread Ting Chen

Hello Asaf, hello Florence,

the inbalance is surely partly due to the cultural bias (both 
contemprary as well as historical) of our world. Across the cultures in 
before 18th century women found less notice in the historical 
documentation. As far as I know about Japanese history, this bias was 
less prominant in the Japanese history, this only change since the rise 
of the Shoguns (about 12th century). Could this be part of the reason of 
the ja-wp result?


If it is the case than Wikipedia and Wikidata could be a very valuable 
resource for cultural history researches since never before there was 
such a systematic gathering of so much correlated data.


Take the 20th century, would the data reflect the change (or reluctance 
to change) of the bias in art, politics and economy?


If yes, I think we should spread word inside of the research community 
because in my opinion the research community had until now still didn't 
pay attention to this pile of data that all the volunteers had put 
together and are just waiting for them to mine.


Greetings
Ting


Am 16.06.2016 um 23:14 schrieb Florence Devouard:

Hello Asaf,
Just making sure that you knew about WHGI : 
http://whgi.wmflabs.org/gender-by-language.html


Do you know if there are differences in analysis between the two 
analysis ?


I checked a few figures and it fits pretty well.

Flo


Le 16/06/16 à 21:14, Asaf Bartov a écrit :

Hullo everyone.

I was asked by a volunteer for help getting stats on the gender gap in
content on a certain Wikipedia, and came up with simple Wikidata Query
Service[1] queries that pulled the total number of articles on a given
Wikipedia about men and about women, to calculate *the proportion of
articles about women out of all articles about humans*.

Then I was curious about how that wiki compared to other wikis, so I ran
the queries on a bunch of languages, and gathered the results into a 
table,

here:
https://meta.wikimedia.org/wiki/User:Ijon/Content_gap

(please see the *caveat* there.)

I don't have time to fully write-up everything I find interesting in 
those

results, but I will quickly point out the following:

1. The Nepali statistic is simply astonishing! There must be a story
there.  I'm keen on learning more about this, if anyone can shed light.

2. Evidently, ~13%-17% seems like a robust average of the proportion of
articles about women among all biographies.

3. among the top 10 largest wikis, Japanese is the least imbalanced.  
Good

job, Japanese Wikipedians!  I wonder if you have a good sense of what
drives this relatively better balance. (my instinctive guess is pop 
culture

coverage.)

4. among the top 10 largest wikis, Russian is the most imbalanced.

5. I intend to re-generate these stats every two months or so, to
eventually have some sense of trends and changes.

6. Your efforts, particularly on small-to-medium wikis, can really 
make a

dent in these numbers!  For example, it seems I am personally
responsible[2] for almost 1% of the coverage of women on Hebrew 
Wikipedia!

:)

7. I encourage you to share these numbers with your communities.  
Perhaps

you'd like to overtake the wiki just above yours? :)

8. I'm happy to add additional languages to the table, by request.  
Or you

can do it yourself, too. :)

   A.

[1] https://query.wikidata.org/
[2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays





___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines

New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 





___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-17 Thread Tanweer Morshed
Hi Asaf,

The table is quite interesting! This is the first time I could imagine the
actual situation about the coverage of women biographies in various
Wikipedias, especially in my home Wiki, the Bengali Wikipedia. I'm going to
spread this statistics among the community.

And I have asked Ganesh, one of the veteran Nepali Wikipedian, as to why
the statistics of Nepali WP is that much astonishing. I'll convey his reply
to this thread.


Thanks.

On Fri, Jun 17, 2016 at 6:01 AM, Florence Devouard 
wrote:

> I think the project evolved over time. It may be that this page did not
> exist in the first version.
>
> Clearly, the site has been modified recently because the name changed from
> WIGI to WHGI. And I see the main four pages has been updated.
>
> If you look at "Gender by language" for example, it mentions "As of
> January 2016 about 98% of biographies were attached to at least one
> Wikipedia site, so this data is mostly complete."
>
> Gender by date of birth and death mentions "As of January 2016, only about
> 72% and 36% of biographies, had date of birth and date of death data,
> respectively, so this data is incomplete."
>
> Gender by country mentions "As of January 2016, only about 30% of
> biographies, had place of birth, so this data is incomplete."
>
> and "As of January 2016, only about 65% of biographies, had culture, so
> this data is incomplete."
>
> Recently, these dates were much further in the past. I know because I
> noticed and ask Max for fresher data
> https://groups.google.com/forum/#!topic/wigi-project/CCLSKN2tmQE
> A small table published there may answer part of your questions.
>
> Ant
>
> Le 17/06/16 à 00:22, Asaf Bartov a écrit :
>
> Thanks, Florence!
>>
>> I was aware of the WIGI research project (and have linked to it in the
>> ==See Also== section of my page), but I was not aware of this page.  Neat!
>> So I won't need to update my own page. :)
>>
>>A.
>>
>> On Thu, Jun 16, 2016 at 2:14 PM, Florence Devouard 
>> wrote:
>>
>> Hello Asaf
>>>
>>> Just making sure that you knew about WHGI :
>>> http://whgi.wmflabs.org/gender-by-language.html
>>>
>>> Do you know if there are differences in analysis between the two
>>> analysis ?
>>>
>>> I checked a few figures and it fits pretty well.
>>>
>>> Flo
>>>
>>>
>>> Le 16/06/16 à 21:14, Asaf Bartov a écrit :
>>>
>>> Hullo everyone.

 I was asked by a volunteer for help getting stats on the gender gap in
 content on a certain Wikipedia, and came up with simple Wikidata Query
 Service[1] queries that pulled the total number of articles on a given
 Wikipedia about men and about women, to calculate *the proportion of
 articles about women out of all articles about humans*.

 Then I was curious about how that wiki compared to other wikis, so I ran
 the queries on a bunch of languages, and gathered the results into a
 table,
 here:
 https://meta.wikimedia.org/wiki/User:Ijon/Content_gap

 (please see the *caveat* there.)

 I don't have time to fully write-up everything I find interesting in
 those
 results, but I will quickly point out the following:

 1. The Nepali statistic is simply astonishing! There must be a story
 there.  I'm keen on learning more about this, if anyone can shed light.

 2. Evidently, ~13%-17% seems like a robust average of the proportion of
 articles about women among all biographies.

 3. among the top 10 largest wikis, Japanese is the least imbalanced.
 Good
 job, Japanese Wikipedians!  I wonder if you have a good sense of what
 drives this relatively better balance. (my instinctive guess is pop
 culture
 coverage.)

 4. among the top 10 largest wikis, Russian is the most imbalanced.

 5. I intend to re-generate these stats every two months or so, to
 eventually have some sense of trends and changes.

 6. Your efforts, particularly on small-to-medium wikis, can really make
 a
 dent in these numbers!  For example, it seems I am personally
 responsible[2] for almost 1% of the coverage of women on Hebrew
 Wikipedia!
 :)

 7. I encourage you to share these numbers with your communities.
 Perhaps
 you'd like to overtake the wiki just above yours? :)

 8. I'm happy to add additional languages to the table, by request.  Or
 you
 can do it yourself, too. :)

A.

 [1] https://query.wikidata.org/
 [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays



>>>
>>> ___
>>> Wikimedia-l mailing list, guidelines at:
>>> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
>>> New messages to: Wikimedia-l@lists.wikimedia.org
>>> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
>>> 
>>>
>>>
>>
>>
>>
>
>
> 

Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Florence Devouard
I think the project evolved over time. It may be that this page did not 
exist in the first version.


Clearly, the site has been modified recently because the name changed 
from WIGI to WHGI. And I see the main four pages has been updated.


If you look at "Gender by language" for example, it mentions "As of 
January 2016 about 98% of biographies were attached to at least one 
Wikipedia site, so this data is mostly complete."


Gender by date of birth and death mentions "As of January 2016, only 
about 72% and 36% of biographies, had date of birth and date of death 
data, respectively, so this data is incomplete."


Gender by country mentions "As of January 2016, only about 30% of 
biographies, had place of birth, so this data is incomplete."


and "As of January 2016, only about 65% of biographies, had culture, so 
this data is incomplete."


Recently, these dates were much further in the past. I know because I 
noticed and ask Max for fresher data

https://groups.google.com/forum/#!topic/wigi-project/CCLSKN2tmQE
A small table published there may answer part of your questions.

Ant

Le 17/06/16 à 00:22, Asaf Bartov a écrit :

Thanks, Florence!

I was aware of the WIGI research project (and have linked to it in the
==See Also== section of my page), but I was not aware of this page.  Neat!
So I won't need to update my own page. :)

   A.

On Thu, Jun 16, 2016 at 2:14 PM, Florence Devouard 
wrote:


Hello Asaf

Just making sure that you knew about WHGI :
http://whgi.wmflabs.org/gender-by-language.html

Do you know if there are differences in analysis between the two analysis ?

I checked a few figures and it fits pretty well.

Flo


Le 16/06/16 à 21:14, Asaf Bartov a écrit :


Hullo everyone.

I was asked by a volunteer for help getting stats on the gender gap in
content on a certain Wikipedia, and came up with simple Wikidata Query
Service[1] queries that pulled the total number of articles on a given
Wikipedia about men and about women, to calculate *the proportion of
articles about women out of all articles about humans*.

Then I was curious about how that wiki compared to other wikis, so I ran
the queries on a bunch of languages, and gathered the results into a
table,
here:
https://meta.wikimedia.org/wiki/User:Ijon/Content_gap

(please see the *caveat* there.)

I don't have time to fully write-up everything I find interesting in those
results, but I will quickly point out the following:

1. The Nepali statistic is simply astonishing! There must be a story
there.  I'm keen on learning more about this, if anyone can shed light.

2. Evidently, ~13%-17% seems like a robust average of the proportion of
articles about women among all biographies.

3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
job, Japanese Wikipedians!  I wonder if you have a good sense of what
drives this relatively better balance. (my instinctive guess is pop
culture
coverage.)

4. among the top 10 largest wikis, Russian is the most imbalanced.

5. I intend to re-generate these stats every two months or so, to
eventually have some sense of trends and changes.

6. Your efforts, particularly on small-to-medium wikis, can really make a
dent in these numbers!  For example, it seems I am personally
responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
:)

7. I encourage you to share these numbers with your communities.  Perhaps
you'd like to overtake the wiki just above yours? :)

8. I'm happy to add additional languages to the table, by request.  Or you
can do it yourself, too. :)

   A.

[1] https://query.wikidata.org/
[2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays





___
Wikimedia-l mailing list, guidelines at:
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,










___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Asaf Bartov
Hi, Giuseppe.

On Thu, Jun 16, 2016 at 1:10 PM, Giuseppe Profiti 
wrote:

> Thanks Asaf, it is quite interesting.
> I have a couple of questions and one comment:
> - why is Italian wikipedia missing? Is there a problem with wikidata links?
>

Just an oversight -- I accidentally overwrote that line when posting.  Now
added. (~14.6%)

- is this kind of analysis possible also to the average user?
>

Sure!  Here's a query for number of article about women on ITWP:
http://tinyurl.com/hzrdfub

And here's one about men:
http://tinyurl.com/h7obnnq

And you can just change the language code to get other languages.  The
queries are split by gender even though it's possible to get both counts in
one query, to decrease the odds of running out of time for the query (an
unfortunate limitation due to lacking hardware, which also makes it
impossible to count articles about men on ENWP without help from a WMF
engineer.)


> The comment is about measuring such ratio in other similar products
> (encyclopedia, books with biographies etc) to see how wel we fare
> compared to others.
>

That would indeed be interesting, but I'm leaving that as an exercise for
someone else. :)

   A.
-- 
Asaf Bartov
Wikimedia Foundation 

Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us make it a reality!
https://donate.wikimedia.org
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Vitor Mazuco
So interesting this list Asaf.

2016-06-16 16:14 GMT-03:00, Asaf Bartov :
> Hullo everyone.
>
> I was asked by a volunteer for help getting stats on the gender gap in
> content on a certain Wikipedia, and came up with simple Wikidata Query
> Service[1] queries that pulled the total number of articles on a given
> Wikipedia about men and about women, to calculate *the proportion of
> articles about women out of all articles about humans*.
>
> Then I was curious about how that wiki compared to other wikis, so I ran
> the queries on a bunch of languages, and gathered the results into a table,
> here:
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
>
> (please see the *caveat* there.)
>
> I don't have time to fully write-up everything I find interesting in those
> results, but I will quickly point out the following:
>
> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.
>
> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> articles about women among all biographies.
>
> 3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> drives this relatively better balance. (my instinctive guess is pop culture
> coverage.)
>
> 4. among the top 10 largest wikis, Russian is the most imbalanced.
>
> 5. I intend to re-generate these stats every two months or so, to
> eventually have some sense of trends and changes.
>
> 6. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!  For example, it seems I am personally
> responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
> :)
>
> 7. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> 8. I'm happy to add additional languages to the table, by request.  Or you
> can do it yourself, too. :)
>
>A.
>
> [1] https://query.wikidata.org/
> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Giuseppe Profiti
Thanks Asaf, it is quite interesting.
I have a couple of questions and one comment:
- why is Italian wikipedia missing? Is there a problem with wikidata links?
- is this kind of analysis possible also to the average user?

The comment is about measuring such ratio in other similar products
(encyclopedia, books with biographies etc) to see how wel we fare
compared to others.

Best,
Giuseppe

2016-06-16 21:14 GMT+02:00 Asaf Bartov :
> Hullo everyone.
>
> I was asked by a volunteer for help getting stats on the gender gap in
> content on a certain Wikipedia, and came up with simple Wikidata Query
> Service[1] queries that pulled the total number of articles on a given
> Wikipedia about men and about women, to calculate *the proportion of
> articles about women out of all articles about humans*.
>
> Then I was curious about how that wiki compared to other wikis, so I ran
> the queries on a bunch of languages, and gathered the results into a table,
> here:
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
>
> (please see the *caveat* there.)
>
> I don't have time to fully write-up everything I find interesting in those
> results, but I will quickly point out the following:
>
> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.
>
> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> articles about women among all biographies.
>
> 3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> drives this relatively better balance. (my instinctive guess is pop culture
> coverage.)
>
> 4. among the top 10 largest wikis, Russian is the most imbalanced.
>
> 5. I intend to re-generate these stats every two months or so, to
> eventually have some sense of trends and changes.
>
> 6. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!  For example, it seems I am personally
> responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
> :)
>
> 7. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> 8. I'm happy to add additional languages to the table, by request.  Or you
> can do it yourself, too. :)
>
>A.
>
> [1] https://query.wikidata.org/
> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
> ___
> Wikimedia-l mailing list, guidelines at: 
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
> 

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Florence Devouard

Hello Asaf

Just making sure that you knew about WHGI : 
http://whgi.wmflabs.org/gender-by-language.html


Do you know if there are differences in analysis between the two analysis ?

I checked a few figures and it fits pretty well.

Flo


Le 16/06/16 à 21:14, Asaf Bartov a écrit :

Hullo everyone.

I was asked by a volunteer for help getting stats on the gender gap in
content on a certain Wikipedia, and came up with simple Wikidata Query
Service[1] queries that pulled the total number of articles on a given
Wikipedia about men and about women, to calculate *the proportion of
articles about women out of all articles about humans*.

Then I was curious about how that wiki compared to other wikis, so I ran
the queries on a bunch of languages, and gathered the results into a table,
here:
https://meta.wikimedia.org/wiki/User:Ijon/Content_gap

(please see the *caveat* there.)

I don't have time to fully write-up everything I find interesting in those
results, but I will quickly point out the following:

1. The Nepali statistic is simply astonishing! There must be a story
there.  I'm keen on learning more about this, if anyone can shed light.

2. Evidently, ~13%-17% seems like a robust average of the proportion of
articles about women among all biographies.

3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
job, Japanese Wikipedians!  I wonder if you have a good sense of what
drives this relatively better balance. (my instinctive guess is pop culture
coverage.)

4. among the top 10 largest wikis, Russian is the most imbalanced.

5. I intend to re-generate these stats every two months or so, to
eventually have some sense of trends and changes.

6. Your efforts, particularly on small-to-medium wikis, can really make a
dent in these numbers!  For example, it seems I am personally
responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
:)

7. I encourage you to share these numbers with your communities.  Perhaps
you'd like to overtake the wiki just above yours? :)

8. I'm happy to add additional languages to the table, by request.  Or you
can do it yourself, too. :)

   A.

[1] https://query.wikidata.org/
[2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays





___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Shani
This is so cool! Thanks. :-)

(CCing some wiki-women from He-Wiki).

Shani.
On 16 Jun 2016 23:29, "Alex Wang"  wrote:

> Thanks, Asaf!
>
> This is very useful and interesting. There is much work to be done.
>
> Cheers,
>
> Alex
>
> On Thu, Jun 16, 2016 at 1:19 PM, Anna Stillwell 
> wrote:
>
> > Thank you for this, Asaf. Very useful.
> > /a
> >
> > On Thu, Jun 16, 2016 at 12:14 PM, Asaf Bartov 
> > wrote:
> >
> > > Hullo everyone.
> > >
> > > I was asked by a volunteer for help getting stats on the gender gap in
> > > content on a certain Wikipedia, and came up with simple Wikidata Query
> > > Service[1] queries that pulled the total number of articles on a given
> > > Wikipedia about men and about women, to calculate *the proportion of
> > > articles about women out of all articles about humans*.
> > >
> > > Then I was curious about how that wiki compared to other wikis, so I
> ran
> > > the queries on a bunch of languages, and gathered the results into a
> > table,
> > > here:
> > > https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
> > >
> > > (please see the *caveat* there.)
> > >
> > > I don't have time to fully write-up everything I find interesting in
> > those
> > > results, but I will quickly point out the following:
> > >
> > > 1. The Nepali statistic is simply astonishing! There must be a story
> > > there.  I'm keen on learning more about this, if anyone can shed light.
> > >
> > > 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> > > articles about women among all biographies.
> > >
> > > 3. among the top 10 largest wikis, Japanese is the least imbalanced.
> > Good
> > > job, Japanese Wikipedians!  I wonder if you have a good sense of what
> > > drives this relatively better balance. (my instinctive guess is pop
> > culture
> > > coverage.)
> > >
> > > 4. among the top 10 largest wikis, Russian is the most imbalanced.
> > >
> > > 5. I intend to re-generate these stats every two months or so, to
> > > eventually have some sense of trends and changes.
> > >
> > > 6. Your efforts, particularly on small-to-medium wikis, can really
> make a
> > > dent in these numbers!  For example, it seems I am personally
> > > responsible[2] for almost 1% of the coverage of women on Hebrew
> > Wikipedia!
> > > :)
> > >
> > > 7. I encourage you to share these numbers with your communities.
> Perhaps
> > > you'd like to overtake the wiki just above yours? :)
> > >
> > > 8. I'm happy to add additional languages to the table, by request.  Or
> > you
> > > can do it yourself, too. :)
> > >
> > >A.
> > >
> > > [1] https://query.wikidata.org/
> > > [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> > > --
> > > Asaf Bartov
> > > Wikimedia Foundation 
> > >
> > > Imagine a world in which every single human being can freely share in
> the
> > > sum of all knowledge. Help us make it a reality!
> > > https://donate.wikimedia.org
> > > ___
> > > Wikimedia-l mailing list, guidelines at:
> > > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > > New messages to: Wikimedia-l@lists.wikimedia.org
> > > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > > 
> >
> >
> >
> >
> > --
> > Anna Stillwell
> > Director of Culture and Collaboration
> > Wikimedia Foundation
> > 415.806.1536
> > *www.wikimediafoundation.org *
> > ___
> > Wikimedia-l mailing list, guidelines at:
> > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > New messages to: Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > 
> >
>
>
>
> --
> Alexandra Wang
> Program Officer
> Community Resources
> Wikimedia Foundation 
> +1 415-839-6885
> Skype: alexvwang
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Alex Wang
Thanks, Asaf!

This is very useful and interesting. There is much work to be done.

Cheers,

Alex

On Thu, Jun 16, 2016 at 1:19 PM, Anna Stillwell 
wrote:

> Thank you for this, Asaf. Very useful.
> /a
>
> On Thu, Jun 16, 2016 at 12:14 PM, Asaf Bartov 
> wrote:
>
> > Hullo everyone.
> >
> > I was asked by a volunteer for help getting stats on the gender gap in
> > content on a certain Wikipedia, and came up with simple Wikidata Query
> > Service[1] queries that pulled the total number of articles on a given
> > Wikipedia about men and about women, to calculate *the proportion of
> > articles about women out of all articles about humans*.
> >
> > Then I was curious about how that wiki compared to other wikis, so I ran
> > the queries on a bunch of languages, and gathered the results into a
> table,
> > here:
> > https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
> >
> > (please see the *caveat* there.)
> >
> > I don't have time to fully write-up everything I find interesting in
> those
> > results, but I will quickly point out the following:
> >
> > 1. The Nepali statistic is simply astonishing! There must be a story
> > there.  I'm keen on learning more about this, if anyone can shed light.
> >
> > 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> > articles about women among all biographies.
> >
> > 3. among the top 10 largest wikis, Japanese is the least imbalanced.
> Good
> > job, Japanese Wikipedians!  I wonder if you have a good sense of what
> > drives this relatively better balance. (my instinctive guess is pop
> culture
> > coverage.)
> >
> > 4. among the top 10 largest wikis, Russian is the most imbalanced.
> >
> > 5. I intend to re-generate these stats every two months or so, to
> > eventually have some sense of trends and changes.
> >
> > 6. Your efforts, particularly on small-to-medium wikis, can really make a
> > dent in these numbers!  For example, it seems I am personally
> > responsible[2] for almost 1% of the coverage of women on Hebrew
> Wikipedia!
> > :)
> >
> > 7. I encourage you to share these numbers with your communities.  Perhaps
> > you'd like to overtake the wiki just above yours? :)
> >
> > 8. I'm happy to add additional languages to the table, by request.  Or
> you
> > can do it yourself, too. :)
> >
> >A.
> >
> > [1] https://query.wikidata.org/
> > [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> > --
> > Asaf Bartov
> > Wikimedia Foundation 
> >
> > Imagine a world in which every single human being can freely share in the
> > sum of all knowledge. Help us make it a reality!
> > https://donate.wikimedia.org
> > ___
> > Wikimedia-l mailing list, guidelines at:
> > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > New messages to: Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > 
>
>
>
>
> --
> Anna Stillwell
> Director of Culture and Collaboration
> Wikimedia Foundation
> 415.806.1536
> *www.wikimediafoundation.org *
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 
>



-- 
Alexandra Wang
Program Officer
Community Resources
Wikimedia Foundation 
+1 415-839-6885
Skype: alexvwang
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Anna Stillwell
Thank you for this, Asaf. Very useful.
/a

On Thu, Jun 16, 2016 at 12:14 PM, Asaf Bartov  wrote:

> Hullo everyone.
>
> I was asked by a volunteer for help getting stats on the gender gap in
> content on a certain Wikipedia, and came up with simple Wikidata Query
> Service[1] queries that pulled the total number of articles on a given
> Wikipedia about men and about women, to calculate *the proportion of
> articles about women out of all articles about humans*.
>
> Then I was curious about how that wiki compared to other wikis, so I ran
> the queries on a bunch of languages, and gathered the results into a table,
> here:
> https://meta.wikimedia.org/wiki/User:Ijon/Content_gap
>
> (please see the *caveat* there.)
>
> I don't have time to fully write-up everything I find interesting in those
> results, but I will quickly point out the following:
>
> 1. The Nepali statistic is simply astonishing! There must be a story
> there.  I'm keen on learning more about this, if anyone can shed light.
>
> 2. Evidently, ~13%-17% seems like a robust average of the proportion of
> articles about women among all biographies.
>
> 3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
> job, Japanese Wikipedians!  I wonder if you have a good sense of what
> drives this relatively better balance. (my instinctive guess is pop culture
> coverage.)
>
> 4. among the top 10 largest wikis, Russian is the most imbalanced.
>
> 5. I intend to re-generate these stats every two months or so, to
> eventually have some sense of trends and changes.
>
> 6. Your efforts, particularly on small-to-medium wikis, can really make a
> dent in these numbers!  For example, it seems I am personally
> responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
> :)
>
> 7. I encourage you to share these numbers with your communities.  Perhaps
> you'd like to overtake the wiki just above yours? :)
>
> 8. I'm happy to add additional languages to the table, by request.  Or you
> can do it yourself, too. :)
>
>A.
>
> [1] https://query.wikidata.org/
> [2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
> --
> Asaf Bartov
> Wikimedia Foundation 
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 




-- 
Anna Stillwell
Director of Culture and Collaboration
Wikimedia Foundation
415.806.1536
*www.wikimediafoundation.org *
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


[Wikimedia-l] Fresh data on the gender gap in content

2016-06-16 Thread Asaf Bartov
Hullo everyone.

I was asked by a volunteer for help getting stats on the gender gap in
content on a certain Wikipedia, and came up with simple Wikidata Query
Service[1] queries that pulled the total number of articles on a given
Wikipedia about men and about women, to calculate *the proportion of
articles about women out of all articles about humans*.

Then I was curious about how that wiki compared to other wikis, so I ran
the queries on a bunch of languages, and gathered the results into a table,
here:
https://meta.wikimedia.org/wiki/User:Ijon/Content_gap

(please see the *caveat* there.)

I don't have time to fully write-up everything I find interesting in those
results, but I will quickly point out the following:

1. The Nepali statistic is simply astonishing! There must be a story
there.  I'm keen on learning more about this, if anyone can shed light.

2. Evidently, ~13%-17% seems like a robust average of the proportion of
articles about women among all biographies.

3. among the top 10 largest wikis, Japanese is the least imbalanced.  Good
job, Japanese Wikipedians!  I wonder if you have a good sense of what
drives this relatively better balance. (my instinctive guess is pop culture
coverage.)

4. among the top 10 largest wikis, Russian is the most imbalanced.

5. I intend to re-generate these stats every two months or so, to
eventually have some sense of trends and changes.

6. Your efforts, particularly on small-to-medium wikis, can really make a
dent in these numbers!  For example, it seems I am personally
responsible[2] for almost 1% of the coverage of women on Hebrew Wikipedia!
:)

7. I encourage you to share these numbers with your communities.  Perhaps
you'd like to overtake the wiki just above yours? :)

8. I'm happy to add additional languages to the table, by request.  Or you
can do it yourself, too. :)

   A.

[1] https://query.wikidata.org/
[2] Yay #100wikidays :) https://meta.wikimedia.org/wiki/100wikidays
-- 
Asaf Bartov
Wikimedia Foundation 

Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us make it a reality!
https://donate.wikimedia.org
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,