Re: [Wikidata] Edit history-revisions

2020-09-11 Thread Sumit Asthana
Hi Elisavet,

You can identify reverts using the sha1 checksum of revisions You can use
the mwreverts library[0] to do that in the dump. Editquality[1] repository
has such a use case for detecting reverts. You will not be able to detect
partial reverts but it will detect identity reverts which form majority of
the reverts.

- Regards
Sumit Asthana

[0] - https://pythonhosted.org/mwreverts/
[1] -
https://github.com/wikimedia/editquality/blob/master/editquality/utilities/extract_damaging.py#L160


On Fri, Sep 11, 2020 at 2:55 AM Elisavet Koutsiana <
elisavetkoutsi...@gmail.com> wrote:

> Hello,
>
> I wanted to ask if there is any canonical way to identify deletion,
> reverts etc in the edit history xml files. I can understand that the action
> of every revision is described in the "comment" element of the xml format,
> but is there a code name or number or anything else that will help me to
> identify one revision for example as deletion?
>
> Thank you,
> Elisavet
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Evolution of the community communications roles for Wikidata

2020-09-11 Thread Samuel Klein
Fantastic.  Thanks for maintaining the weekly newsletter + space to
annotate the roadmap!

An infobridge in every box,  SJ

On Fri, Sep 11, 2020 at 4:38 AM Léa Lacroix 
wrote:

> Hello all,
>
> For the past four years, I’ve been working in the software department at
> Wikimedia Germany, taking care of the communication between the Wikidata
> development team  and the
> community, announcing new features, collecting bug reports and feature
> requests from you. On top of that, I’ve been coordinating various projects,
> bringing the WikidataCon
> 
>  to
> life, coordinating the Wikidata decentralized birthday
> , creating a
> prototype for Wikidata Train the Trainers
> , and taking
> care of various onsite or online workshops, meetups and other events.
>
> Over the past years, with the Wikidata community growing, the development
> team growing as well, more and more events happening, and the ecosystem of
> Wikibase users forming a distinct group with different needs, it became
> pretty clear that one person was not enough to keep track of everything and
> provide the best support for the Wikidata editors. That’s the reason why,
> earlier this year, we had the pleasure to announce the arrival of a new
> colleague who you already know from being an active Wikidata editor, Mohammed
> Sadat (WMDE) .
>
> We already started a smooth transition of our roles: while Mohammed will
> become the main person in charge of community communications for the
> Wikidata and Wikibase communities, I will focus more on organizing
> Wikidata-related events and supporting community members with their own
> events and projects. As you may have noticed, Mohammed already took over
> editing the weekly newsletter
> , monitoring
> the social media, and various announcements for Wikidata and Wikibase. As
> for myself, I will not disappear completely from the Wikidata channels: I
> will keep supporting Mohammed on community communication, for example with
> projects like the Wikidata Bridge
> , in which I’ve been
> involved since the start.
>
> During this transition phase, we will review and improve our existing
> communication processes, and you can for example give feedback on the
> experience you had while reporting bugs or feature requests
> .
> Feel free to reach out to Mohammed if you have any questions regarding 
> Wikidata’s
> development roadmap
> .
>
> I’m looking forward to continuing working with you on various projects:
> feel free to contact me if you want to discuss Wikidata-related events,
> training, online events, or any other ideas you have in mind to gather the
> Wikidata community and onboard new editors.
>
> Cheers,
> --
> Léa Lacroix
> Community Engagement Coordinator
>
> Wikimedia Deutschland e.V.
> Tempelhofer Ufer 23-24
> 10963 Berlin
> www.wikimedia.de
>
> Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
>
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
> unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt
> für Körperschaften I Berlin, Steuernummer 27/029/42207.
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>


-- 
Samuel Klein  @metasj   w:user:sj  +1 617 529 4266
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] [ANN] DBpedia Autumn Hackathon, starting Sept 21st

2020-09-11 Thread Sebastian Hellmann

Apologies for cross-posting


Dear DBpedians, Linked Data savvies and Ontologists,


We would like to invite you to join the DBpedia Autumn Hackathon 2020 as 
a new format to contribute to DBpedia, gain fame, win small prizes and 
experience the latest technology provided by DBpedia Association 
members. The hackathon is part of the Knowledge Graphs in Action 
conference on October 6, 2020. Please check here: 
https://wiki.dbpedia.org/meetings/KnowledgeGraphsInAction



# Timeline

 *

   Registration of participants - main communication channel will be
   the #hackathon channel in DBpedia Slack (sign up
   https://dbpedia-slack.herokuapp.com/, then add yourself to the
   channel). If you wish to receive a reminder email on Sep 21st, you
   can leave your email address in this form: https://tinyurl.com/y24ps5jt

 *

   Until September 14th - preparation phase, participating
   organisations prepare details, track formation, additional tracks
   can be proposed, please contact dbpedia-eve...@infai.org
   

 *

   September 21st - Announcement of details for each track, including
   prizes, participating data, demos, tools and tasks. Check updates on
   hackathon website
   https://wiki.dbpedia.org/events/dbpedia-autumn-hackathon-2020

 *

   September 21st to October 1st - hacking period, coordinated via
   DBpedia slack

 *

   October 1st, 23:59 Hawaii Time -  Submission of hacking result (3
   min video and 2-3 paragraph summary with links, if not stated
   otherwise in the track)

 *

   October 5th, 16:00 CEST - Final Event, each track chair presents a
   short recap of the track, announces prizes or summarizes the result
   of hacking.

 *

   October 6th, 9:50 - 15:30 CEST - Knowledge Graphs in Action Event

 *

   Results and videos are documented on the DBpedia Website and the
   DBpedia Youtube channel.


# Member Tracks

The member tracks are hosted by DBpedia Association members, who are 
technology leaders in the area of Knowledge Engineering. Additional 
tracks can be proposed until Sep 14th, please contact 
dbpedia-eve...@infai.org .



 *

   timbr SQL Knowledge Graph: Learn how to model, map and query
   ontologies in timbr and then model an ontology of GDELT, map it to
   the GDELT database, and answer a number of questions that currently
   are quite impossible to get from the BigQuery GDELT database. Cash
   prizes planned. https://www.timbr.ai/

 *

   GNOSS Knowledge Graph Builder: Give meaning to your organisation’s
   documents and data with a Knowledge Graph.
   https://www.gnoss.com/en/products/semantic-framework

 *

   ImageSnippets: Labeling images with semantic descriptions. Use
   DBpedia spotlight and an entity matching lookup to select DBpedia
   terms to describe images. Then explore the resulting dataset through
   searches over inference graphs and explore the ImageSnippets dataset
   through our SPARQL endpoint. Prizes planned.
   http://www.imagesnippets.com

 *

   Diffbot: Build Your Own Knowledge Graph! Use the Natural Language
   API to extract triples from natural language text and expand these
   triples with data from the Diffbot Knowledge Graph (10+ billion
   entities, 1+ trillion facts). Check out the demo
   http://demo.nl.diffbot.com/. All participants will receive access to
   the Diffbot KG and tools for (non-commercial) research for one year
   ($10,000 value).


# Dutch National Knowledge Graph Track

Following the DBpedia FlexiFusion approach, we are currently 
flexi-fusing a huge, dbpedia-style knowledge graph that will connect 
many Linked Data sources and data silos relevant to the country of the 
Netherlands. We hope that this will eventually crystallize a 
well-connected sub-community linked open data (LOD) cloud in the same 
manner as DBpedia crystallized the original LOD cloud with some 
improvements (you could call it LOD Mark II). Data and hackathon details 
will be announced on 21st of September.



# Improve DBpedia Track

A community track, where everybody can participate and contribute in 
improving existing DBpedia components, in particular the extraction 
framework, the mappings, the ontology, data quality test cases, new 
extractors, links and other extensions. Best individual contributions 
will be acknowledged on the DBpedia website by anointing the WebID/Foaf 
profile.


(chaired by Milan Dojchinovski and Marvin Hofer from the DBpedia 
Association & InfAI and the DBpedia Hacking Committee)



# DBpedia Open Innovation Track

(not part of the hackathon, pre-announcement)

For the DBpedia Spring Event 2021, we are planning an Open Innovation 
Track, where DBpedians can showcase their applications. This endeavour 
will not be part of the hackathon as we are looking for significant 
showcases with development effort of months & years built on the core 
infrastructure of DBpedia such as the SPARQL endpoint, the data, lookup, 
spotlight, DBpedia Live, etc. Details will be an

Re: [Wikidata] Evolution of the community communications roles for Wikidata

2020-09-11 Thread Gerard Meijssen
Hoi,
Congratulations to you both.. Good to see how Wikidata matures :)
Thanks,
  GerardM

On Fri, 11 Sep 2020 at 10:38, Léa Lacroix  wrote:

> Hello all,
>
> For the past four years, I’ve been working in the software department at
> Wikimedia Germany, taking care of the communication between the Wikidata
> development team  and the
> community, announcing new features, collecting bug reports and feature
> requests from you. On top of that, I’ve been coordinating various projects,
> bringing the WikidataCon
> 
>  to
> life, coordinating the Wikidata decentralized birthday
> , creating a
> prototype for Wikidata Train the Trainers
> , and taking
> care of various onsite or online workshops, meetups and other events.
>
> Over the past years, with the Wikidata community growing, the development
> team growing as well, more and more events happening, and the ecosystem of
> Wikibase users forming a distinct group with different needs, it became
> pretty clear that one person was not enough to keep track of everything and
> provide the best support for the Wikidata editors. That’s the reason why,
> earlier this year, we had the pleasure to announce the arrival of a new
> colleague who you already know from being an active Wikidata editor, Mohammed
> Sadat (WMDE) .
>
> We already started a smooth transition of our roles: while Mohammed will
> become the main person in charge of community communications for the
> Wikidata and Wikibase communities, I will focus more on organizing
> Wikidata-related events and supporting community members with their own
> events and projects. As you may have noticed, Mohammed already took over
> editing the weekly newsletter
> , monitoring
> the social media, and various announcements for Wikidata and Wikibase. As
> for myself, I will not disappear completely from the Wikidata channels: I
> will keep supporting Mohammed on community communication, for example with
> projects like the Wikidata Bridge
> , in which I’ve been
> involved since the start.
>
> During this transition phase, we will review and improve our existing
> communication processes, and you can for example give feedback on the
> experience you had while reporting bugs or feature requests
> .
> Feel free to reach out to Mohammed if you have any questions regarding 
> Wikidata’s
> development roadmap
> .
>
> I’m looking forward to continuing working with you on various projects:
> feel free to contact me if you want to discuss Wikidata-related events,
> training, online events, or any other ideas you have in mind to gather the
> Wikidata community and onboard new editors.
>
> Cheers,
> --
> Léa Lacroix
> Community Engagement Coordinator
>
> Wikimedia Deutschland e.V.
> Tempelhofer Ufer 23-24
> 10963 Berlin
> www.wikimedia.de
>
> Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
>
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
> unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt
> für Körperschaften I Berlin, Steuernummer 27/029/42207.
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] Evolution of the community communications roles for Wikidata

2020-09-11 Thread Léa Lacroix
Hello all,

For the past four years, I’ve been working in the software department at
Wikimedia Germany, taking care of the communication between the Wikidata
development team  and the
community, announcing new features, collecting bug reports and feature
requests from you. On top of that, I’ve been coordinating various projects,
bringing the WikidataCon

to
life, coordinating the Wikidata decentralized birthday
, creating a
prototype for Wikidata Train the Trainers
, and taking
care of various onsite or online workshops, meetups and other events.

Over the past years, with the Wikidata community growing, the development
team growing as well, more and more events happening, and the ecosystem of
Wikibase users forming a distinct group with different needs, it became
pretty clear that one person was not enough to keep track of everything and
provide the best support for the Wikidata editors. That’s the reason why,
earlier this year, we had the pleasure to announce the arrival of a new
colleague who you already know from being an active Wikidata editor, Mohammed
Sadat (WMDE) .

We already started a smooth transition of our roles: while Mohammed will
become the main person in charge of community communications for the
Wikidata and Wikibase communities, I will focus more on organizing
Wikidata-related events and supporting community members with their own
events and projects. As you may have noticed, Mohammed already took over
editing the weekly newsletter
, monitoring
the social media, and various announcements for Wikidata and Wikibase. As
for myself, I will not disappear completely from the Wikidata channels: I
will keep supporting Mohammed on community communication, for example with
projects like the Wikidata Bridge
, in which I’ve been
involved since the start.

During this transition phase, we will review and improve our existing
communication processes, and you can for example give feedback on the
experience you had while reporting bugs or feature requests
.
Feel free to reach out to Mohammed if you have any questions regarding
Wikidata’s
development roadmap
.

I’m looking forward to continuing working with you on various projects:
feel free to contact me if you want to discuss Wikidata-related events,
training, online events, or any other ideas you have in mind to gather the
Wikidata community and onboard new editors.

Cheers,
-- 
Léa Lacroix
Community Engagement Coordinator

Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata