Hey Mohammed,
thanks for sharing! I've used the Item Quality Evaluator and it seems
very useful indeed. I like the fact that the lowest ranking items are
put on top, so they can get the TLC they deserve. Also, i didn't
realise that the full query editor was integrated in the SPARQL field,
a pleasant surprise! I also really like how in the source many parts
are split into Vue components, which will make reuse in other projects
easier.

I did notice a very large query (all paintings) timed out, but maybe
that is to be expected with such a large set.

Kind regards,
-- Hay

On Tue, Apr 20, 2021 at 6:29 PM Mohammed Sadat Abdulai
<[email protected]> wrote:
>
> Hi everyone,
>
>
> This is to announce that over the past month we started to look at ways to 
> help us all get a better understanding of the quality of Wikidata's data in a 
> specific area of interest. For this purpose we worked on building two tools; 
> an Item Quality Evaluator and a Constraint Violation Checker  - both of these 
> tools are now available at:
>
>
> Item Quality Evaluator
>
> Constraint Violation Checker
>
>
> Data quality on Wikidata has many aspects. The constraint violations and ORES 
> quality scores that these tools use are two helpful indicators of certain 
> aspects of quality that we hope will be helpful for you.
>
>
> As you may know, Wikidata’s data quality is very unevenly distributed - some 
> areas are very well maintained and others not so much. We only currently 
> provide ORES quality scores on a global and per-Item level. This has two 
> effects, however:
>
> Editors taking care of a specific area of Wikidata want to improve that area 
> but currently don’t have an easy way to find the Items with the lowest 
> quality they can focus their time on in order to raise the quality of that 
> area.
>
> Re-user of Wikidata’s data are usually only interested in a subset of 
> Wikidata’s Items and by extension the quality of that subset. It is currently 
> hard for them to know what quality level they are getting for their subset of 
> interest.
>
>
> To address this issue we put together two small tools. The Item Quality 
> Evaluator is a simple website that provides ORES quality scores for a list of 
> Items in Wikidata. The Constraint Violation Checker is a small command-line 
> script that retrieves the number of constraint violations and ORES scores for 
> a list of Items for further analysis.
>
>
> How does the Item Quality Evaluator tool work?
>
> You provide it with a list of Item IDs or a SPARQL query and then it'll get 
> the ORES score for each of them as well as the average score over all the 
> Items you
>
> provided in a nice webpage. This way, you can more easily identify the Items 
> in an area you are interested in that have the lowest quality and improve 
> them.
>
>
> How does the Constraint Violation Checker script work?
>
> When you run it, it outputs a CSV file with the number of statements, the 
> number of constraint violations for each severity level, the number of 
> sitelinks to all projects and to Wikipedia and the ORES score for each of 
> those Items.
>
>
> Why didn't we integrate the constraint violations data into the Item Quality 
> Evaluator?
>
> We want to do that in the long-term but right now it is not possible because 
> the constraint violation data is not easily accessible and retrieving it 
> takes several hours to run for a large list of items.
>
>
> Please try these tools and let us know if you encounter any issues. If you 
> want to provide general feedback, feel free to let us know.
>
>
> Cheers,
>
>
> --
> Mohammed Sadat
> Community Communications Manager for Wikidata/Wikibase
>
> Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
> Phone: +49 (0)30 219 158 26-0
> https://wikimedia.de
>
> Keep up to date! Current news and exciting stories about Wikimedia, Wikipedia 
> and Free Knowledge in our newsletter (in German): Subscribe now.
>
> Imagine a world in which every single human being can freely share in the sum 
> of all knowledge. Help us to achieve our vision!
> https://spenden.wikimedia.de
>
> Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e. V. 
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter 
> der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für 
> Körperschaften I Berlin, Steuernummer 27/029/42207.
> _______________________________________________
> Wikidata mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikidata

_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to