[Wikimedia-l] Re: Results of the Universal Code of Conduct Enforcement Guidelines Vote

2023-02-15 Thread F. Xavier Dengra i Grau via Wikimedia-l
Thank you Adel for adding up very valuable input and thoughts on my initial 
concern. You explained the turnout, participation and the evaluation issues of 
such an enforcement much better than I did in my first email.

Kind regards/Salutacions,

Xaviet Dengra

El dc, 15 febr., 2023 a 18:19,  va escriure:

> مرحبًا، لست معترضًا على النتائج أو السياسة، بل العكس هو أمر جيد من الناحية 
> التنظيمية ومكافحة التحرش والإساءة لكن لدي بعض الملاحظات حول العملية وليس على 
> المحتوى:
> * نسبة المشاركة تعد ضئيلة جدًا مشاركة 3097 ناخباً من بين 68745 ناخب مؤهل
> * عدد الموافقين على الإنفاذ 2,290 ناخب بمعنى بمعنى أقل 4% من المجتمع العالمي 
> النشط
> * استحوذت 3 مجتمعات فقط على أكثر من نصف الأصوات وهذه المجتمعات معظمها من 
> أوربا الغربية وأمريكا الشمالية
> والتساؤل:
> * لماذا لا يتم ذكر هذه الإحصائيات وتحليلها واستنباط الأثر منها على المستقبل؟ 
> وذلك لأن باقي المجتمعات لم تشارك أو كانت مشاركتها ضئيلة فهي غير مهتمة أو لم 
> يكن هنا حملة قوية لجلب الاهتمام أو غيرها من الأمور مما قد يؤدي في الأخير عدم 
> تبني السياسة أصلًا أو قد يتعاملون معها كقانون جبري
> * هل مشاركة المجتمعات القوية فقط لا يُعزز المفهوم الغربي لطريقة الانفاذ؟
> * هل عدم اهتمام المجتمعات الأخرى نذير بوجود فجوات كبيرة وجب إصلاحها أو نستمر 
> وكأن الأمر ليس ذو شأن؟
> تمنيت لو كانت قراءة عميقة نستخلص بها النتائج لتطويلا مستقبلي للحركة ودفعها 
> للأمام بدل الجداول والأرقام التي قد لا يفقهها الكثير منا وقد تُعطي انطباعا 
> خاطئا
> تحياتي
> ___
> Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
> https://meta.wikimedia.org/wiki/Wikimedia-l
> Public archives at 
> https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/BITDAPENFZSAYHRFR3HDPNNLOD54TWWR/
> To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/PXM3A3MVDC7YGGQBCDJ5UFOX77DJTHYF/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org

[Wikimedia-l] Re: Chat GPT

2023-02-15 Thread Eduardo Testart
Hi,

This podcast might be interesting for some on this thread:
https://www.nytimes.com/2023/02/15/podcasts/the-daily/chat-gpt-microsoft-bing-artificial-intelligence.html

There might be chance that something different or new is happening.

Who knows...


Best,

On Mon, Feb 6, 2023, 07:26 Peter Southwood 
wrote:

> It would depend on whether it uses the text or the information/data. My
> guess is that the more it uses its own words, the more drift in meaning
> there will be, and the less reliable the result, but I have no way to test
> this hypothesis.
>
>  Cheers, Peter
>
>
>
> *From:* Ilario Valdelli [mailto:valde...@gmail.com]
> *Sent:* 06 February 2023 09:38
> *To:* Wikimedia Mailing List
> *Subject:* [Wikimedia-l] Re: Chat GPT
>
>
>
> And this is a problem.
>
>
>
> If ChatGPT uses open content, there is an infringement of license.
>
>
>
> Specifically the CC-by-sa if it uses Wikipedia. In this case the
> attribution must be present.
>
>
>
> Kind regards
>
>
>
> On Sun, 5 Feb 2023, 08:12 Peter Southwood, 
> wrote:
>
> “Not citing sources is probably a conscious design choice, as citing
> sources would mean sharing the sources used to train the language models”
> This may be a choice that comes back to bite them. Without citing their
> sources, they are unreliable as a source for anything one does not know
> already. Someone will have a bad consequence from relying on the
> information and will sue the publisher. It will be interesting to see how
> they plan to weasel their way out of legal responsibility while retaining
> any credibility. My guess is there will be a requirement to state that the
> information is AI generated and of entirely unknown and untested
> reliability. How soon to the first class action, I wonder. Lots of money
> for the lawyers. Cheers, Peter.
>
>
>
> *From:* Subhashish [mailto:psubhash...@gmail.com]
> *Sent:* 05 February 2023 06:37
> *To:* Wikimedia Mailing List
> *Subject:* [Wikimedia-l] Re: Chat GPT
>
>
>
> Just to clarify, my point was not about Getty to begin with. Whether Getty
> would win and whether a big corporation should own such a large amount of
> visual content are questions outside this particular thread. It would
> certainly be interesting to see how things roll.
>
>
>
> But AI/ML is way more than just looking. Training with large models is a
> very sophisticated and technical process. Data annotation among many other
> forms of labour are done by real people. the article I had linked earlier
> tells a lot about the real world consequences of AI. I'm certain AI/ML,
> especially when we're talking about language models like ChatGPT, are far
> from innocent looking/reading. For starters, derivative of works, except
> Public Domain ones, must attribute the authors. Any provision for
> attribution is deliberately removed from systems like ChatGPT and that only
> gives corporations like OpenAI a free ride sans accountability.
>
>
>
> Subhashish
>
>
>
>
>
> On Sat, Feb 4, 2023, 4:41 PM Todd Allen  wrote:
>
> I'm not so sure Getty's got a case, though. If the images are on the Web,
> is using them to train an AI something copyright would cover? That to me
> seems more equivalent to just looking at the images, and there's no
> copyright problem in going to Getty's site and just looking at a bunch of
> their pictures.
>
>
>
> But it will be interesting to see how that one shakes out.
>
>
>
> Todd
>
>
>
> On Sat, Feb 4, 2023 at 11:47 AM Subhashish  wrote:
>
> Not citing sources is probably a conscious design choice, as citing
> sources would mean sharing the sources used to train the language models.
> Getty has just sued Stability AI, alleging the use of 12 million
> photographs without permission or compensation. Imagine if Stability had to
> purchase from Getty through a legal process. For starters, Getty might not
> have agreed in the first place. Bulk-scaping publicly visible text in
> text-based AIs like ChatGPT would mean scraping text with copyright. But
> even reusing CC BY-SA content would require attribution. None of the AI
> platforms attributes their sources because they did not acquire content in
> legal and ethical ways [1]. Large language models won't be large and
> releases won't happen fast if they actually start acquiring content
> gradually from trustworthy sources. It took so many years for hundreds and
> thousands of Wikimedians to take Wikipedias in different languages to where
> they are for a reason.
>
>
>
> 1. https://time.com/6247678/openai-chatgpt-kenya-workers/
>
>
> Subhashish
>
>
>
>
>
> On Sat, Feb 4, 2023 at 1:06 PM Peter Southwood <
> peter.southw...@telkomsa.net> wrote:
>
> From what I have seen the AIs are not great on citing sources. If they
> start citing reliable sources, their contributions can be verified, or not.
> If they produce verifiable, adequately sourced, well written information,
> are they a problem or a solution?
>
> Cheers,
>
> Peter
>
>
>
> *From:* Gnangarra [mailto:gnanga...@gmail.com]
> *Sent:* 04 February 2023 

[Wikimedia-l] Re: Information about Wikimedia Hackathon 2023 satellite events

2023-02-15 Thread Ali Kia
Hi.
Thank you for your cooperation.

در تاریخ پنجشنبه ۱۶ فوریهٔ ۲۰۲۳،‏ ۵:۱۳ Srishti Sethi 
نوشت:

> Hello everyone,
>
> If you are interested in organizing or joining a hackathon event, but
> cannot attend the in-person Hackathon event in May in Athens, Greece, this
> email is for you!
>
> We encourage communities, user groups or chapters to organize satellite
> events connected to the in-person Hackathon. These events are to be
> organized autonomously and share the hackathon's purpose: bringing the
> global technical community together to connect, hack, run technical
> discussions, and explore new ideas.
>
> You can work with your wiki community to organize these events before,
> during, or after the main event to onboard newcomers to the technical
> aspects of the Wikimedia movement, hosting watch parties or meetups in your
> region to offer an alternative to people who cannot join the in-person
> event in Athens.
>
> To obtain help with organizing an event, you can apply for funds via the 
> *Rapid
> Grants* maintained by the Wikimedia Foundation. The deadline to apply for
> funding is *March 20*. When preparing for your event, you can reach out
> to the Hackathon organizing team for support with resources, designing the
> program, and guidance on getting involved in the global event.
>
> Learn more about the satellite events, funding process, and a checklist
> for organizing on the wiki page: <
> https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2023/Satellite_events>
> [1]
>
> Cheers,
> Srishti
>
> On behalf of the Hackathon organizing team
>
> [1]
> https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2023/Satellite_events
>
> *Srishti Sethi*
> Senior Developer Advocate
> Wikimedia Foundation 
>
> ___
> Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines
> at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
> https://meta.wikimedia.org/wiki/Wikimedia-l
> Public archives at
> https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/6A2KX75ECOITZFTVYAAAQCPNAWDSTHO7/
> To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org
___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/ABFCROVMWEKVCGHIDZZWMTFWMS4DGAZP/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org

[Wikimedia-l] Re: Results of the Universal Code of Conduct Enforcement Guidelines Vote

2023-02-15 Thread Adel . nehaoua . wiki
مرحبًا، لست معترضًا على النتائج أو السياسة، بل العكس هو أمر جيد من الناحية 
التنظيمية ومكافحة التحرش والإساءة لكن لدي بعض الملاحظات حول العملية وليس على 
المحتوى:
* نسبة المشاركة تعد ضئيلة جدًا مشاركة 3097 ناخباً  من بين 68745 ناخب مؤهل 
* عدد الموافقين على الإنفاذ 2,290 ناخب بمعنى  بمعنى أقل 4% من المجتمع العالمي 
النشط
* استحوذت 3 مجتمعات فقط على أكثر من نصف الأصوات وهذه المجتمعات معظمها من أوربا 
الغربية وأمريكا الشمالية 
والتساؤل:
* لماذا لا يتم ذكر هذه الإحصائيات وتحليلها واستنباط الأثر منها على المستقبل؟ 
وذلك لأن باقي المجتمعات لم تشارك أو كانت  مشاركتها   ضئيلة فهي غير مهتمة أو لم 
يكن هنا حملة قوية لجلب الاهتمام أو غيرها من الأمور مما قد يؤدي في الأخير عدم 
تبني السياسة أصلًا أو قد يتعاملون معها كقانون جبري 
* هل مشاركة المجتمعات القوية فقط  لا يُعزز المفهوم الغربي لطريقة الانفاذ؟
* هل عدم اهتمام المجتمعات الأخرى نذير بوجود فجوات كبيرة وجب إصلاحها أو نستمر 
وكأن الأمر ليس ذو شأن؟
تمنيت لو كانت قراءة عميقة نستخلص بها النتائج لتطويلا مستقبلي للحركة ودفعها 
للأمام بدل الجداول والأرقام التي قد لا يفقهها الكثير منا وقد تُعطي انطباعا 
خاطئا 
تحياتي
___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/BITDAPENFZSAYHRFR3HDPNNLOD54TWWR/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org

[Wikimedia-l] Re: [Wikimedia Research Showcase] February 15 at 9:30AM PT, 17:30 UTC

2023-02-15 Thread Ali Kia
Hi.
Thank you for your cooperation.

در تاریخ چهارشنبه ۱۵ فوریهٔ ۲۰۲۳،‏ ۲۰:۰۲ Emily Lescak 
نوشت:

> A reminder that this is starting in about an hour! We hope you can join us!
>
> Best,
> Emily
>
> On Wed, Feb 8, 2023 at 2:27 PM Emily Lescak  wrote:
>
>> Hello everyone,
>>
>> The next Research Showcase will be livestreamed next Wednesday, February
>> 15 at 9:30AM PT / 17:30 UTC. The theme is The Free Knowledge Ecosystem.
>>
>> YouTube stream: https://www.youtube.com/watch?v=8VJmR-3lTac
>>
>> We welcome you to join the conversation on IRC at #wikimedia-research.
>> You can also watch our past research showcases:
>> https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
>>
>> This month's presentations:
>>
>> The evolution of humanitarian mapping in OpenStreetMap (OSM) and how it
>> affects map completeness and inequalities in OSMBy *Benjamin Herfort,
>> Heidelberg Institute for Geoinformation Technology*Mapping efforts of
>> communities in OpenStreetMap (OSM) over the previous decade have created a
>> unique global geographic database, which is accessible to all with no
>> licensing costs. The collaborative maps of OSM have been used to support
>> humanitarian efforts around the world as well as to fill important data
>> gaps for implementing major development frameworks such as the Sustainable
>> Development Goals (SDGs). Besides the well-examined Global North - Global
>> South bias in OSM, the OSM data as of 2023 shows a much more spatially
>> diverse spread pattern than previously considered, which was shaped by
>> regional, socio-economic and demographic factors across several scales.
>> Humanitarian mapping efforts of the previous decade have already made OSM
>> more inclusive, contributing to diversify and expand the spatial footprint
>> of the areas mapped. However, methods to quantify and account for the
>> remaining biases in OSM’s coverage are needed so that researchers and
>> practitioners will be able to draw the right conclusions, e .g. about
>> progress towards the SDGs in cities.
>>
>>
>> Dataset reuseː Toward translating principles to practiceBy *Laura
>> Koesten, University of Vienna*The web provides access to millions of
>> datasets. These data can have additional impact when used beyond the
>> context for which they were originally created. But using a dataset beyond
>> the context in which it originated remains challenging. Simply making data
>> available does not mean it will be or can be easily used by others. At the
>> same time, we have little empirical insight into what makes a dataset
>> reusable and which of the existing guidelines and frameworks have an
>> impact.In this talk, I will discuss our research on what makes data
>> reusable in practice. This is informed by a synthesis of literature on the
>> topic, our studies on how people evaluate and make sense of data, and a
>> case study on datasets on GitHub. In the case study, we describe a corpus
>> of more than 1.4 million data files from over 65,000 repositories. Building
>> on reuse features from the literature, we use GitHub’s engagement metrics
>> as proxies for dataset reuse and devise an initial model, using deep neural
>> networks, to predict a dataset’s reusability. This demonstrates the
>> practical gap between principles and actionable insights that might allow
>> data publishers and tool designers to implement functionalities that
>> facilitate reuse.
>> We hope you can join us!
>>
>> Warm regards,
>> Emily
>>
>>
>> --
>> Emily Lescak (she / her)
>> Senior Research Community Officer
>> The Wikimedia Foundation
>>
> ___
> Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines
> at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
> https://meta.wikimedia.org/wiki/Wikimedia-l
> Public archives at
> https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/SP4FQLZCMFONGUT6FZSNIBPTGERSOM2Z/
> To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org
___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/MCV354A7KT5EYFFDB3HB3ZSLKFOTUZXT/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org

[Wikimedia-l] Information about Wikimedia Hackathon 2023 satellite events

2023-02-15 Thread Srishti Sethi
Hello everyone,

If you are interested in organizing or joining a hackathon event, but
cannot attend the in-person Hackathon event in May in Athens, Greece, this
email is for you!

We encourage communities, user groups or chapters to organize satellite
events connected to the in-person Hackathon. These events are to be
organized autonomously and share the hackathon's purpose: bringing the
global technical community together to connect, hack, run technical
discussions, and explore new ideas.

You can work with your wiki community to organize these events before,
during, or after the main event to onboard newcomers to the technical
aspects of the Wikimedia movement, hosting watch parties or meetups in your
region to offer an alternative to people who cannot join the in-person
event in Athens.

To obtain help with organizing an event, you can apply for funds via the *Rapid
Grants* maintained by the Wikimedia Foundation. The deadline to apply for
funding is *March 20*. When preparing for your event, you can reach out to
the Hackathon organizing team for support with resources, designing the
program, and guidance on getting involved in the global event.

Learn more about the satellite events, funding process, and a checklist for
organizing on the wiki page: <
https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2023/Satellite_events>
[1]

Cheers,
Srishti

On behalf of the Hackathon organizing team

[1] https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2023/Satellite_events

*Srishti Sethi*
Senior Developer Advocate
Wikimedia Foundation 
___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/6A2KX75ECOITZFTVYAAAQCPNAWDSTHO7/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org

[Wikimedia-l] Re: [Wikimedia Research Showcase] February 15 at 9:30AM PT, 17:30 UTC

2023-02-15 Thread Emily Lescak
A reminder that this is starting in about an hour! We hope you can join us!

Best,
Emily

On Wed, Feb 8, 2023 at 2:27 PM Emily Lescak  wrote:

> Hello everyone,
>
> The next Research Showcase will be livestreamed next Wednesday, February
> 15 at 9:30AM PT / 17:30 UTC. The theme is The Free Knowledge Ecosystem.
>
> YouTube stream: https://www.youtube.com/watch?v=8VJmR-3lTac
>
> We welcome you to join the conversation on IRC at #wikimedia-research. You
> can also watch our past research showcases:
> https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
>
> This month's presentations:
>
> The evolution of humanitarian mapping in OpenStreetMap (OSM) and how it
> affects map completeness and inequalities in OSMBy *Benjamin Herfort,
> Heidelberg Institute for Geoinformation Technology*Mapping efforts of
> communities in OpenStreetMap (OSM) over the previous decade have created a
> unique global geographic database, which is accessible to all with no
> licensing costs. The collaborative maps of OSM have been used to support
> humanitarian efforts around the world as well as to fill important data
> gaps for implementing major development frameworks such as the Sustainable
> Development Goals (SDGs). Besides the well-examined Global North - Global
> South bias in OSM, the OSM data as of 2023 shows a much more spatially
> diverse spread pattern than previously considered, which was shaped by
> regional, socio-economic and demographic factors across several scales.
> Humanitarian mapping efforts of the previous decade have already made OSM
> more inclusive, contributing to diversify and expand the spatial footprint
> of the areas mapped. However, methods to quantify and account for the
> remaining biases in OSM’s coverage are needed so that researchers and
> practitioners will be able to draw the right conclusions, e .g. about
> progress towards the SDGs in cities.
>
>
> Dataset reuseː Toward translating principles to practiceBy *Laura
> Koesten, University of Vienna*The web provides access to millions of
> datasets. These data can have additional impact when used beyond the
> context for which they were originally created. But using a dataset beyond
> the context in which it originated remains challenging. Simply making data
> available does not mean it will be or can be easily used by others. At the
> same time, we have little empirical insight into what makes a dataset
> reusable and which of the existing guidelines and frameworks have an
> impact.In this talk, I will discuss our research on what makes data
> reusable in practice. This is informed by a synthesis of literature on the
> topic, our studies on how people evaluate and make sense of data, and a
> case study on datasets on GitHub. In the case study, we describe a corpus
> of more than 1.4 million data files from over 65,000 repositories. Building
> on reuse features from the literature, we use GitHub’s engagement metrics
> as proxies for dataset reuse and devise an initial model, using deep neural
> networks, to predict a dataset’s reusability. This demonstrates the
> practical gap between principles and actionable insights that might allow
> data publishers and tool designers to implement functionalities that
> facilitate reuse.
> We hope you can join us!
>
> Warm regards,
> Emily
>
>
> --
> Emily Lescak (she / her)
> Senior Research Community Officer
> The Wikimedia Foundation
>
___
Wikimedia-l mailing list -- wikimedia-l@lists.wikimedia.org, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/wikimedia-l@lists.wikimedia.org/message/SP4FQLZCMFONGUT6FZSNIBPTGERSOM2Z/
To unsubscribe send an email to wikimedia-l-le...@lists.wikimedia.org