Nikki added a comment.

I don't have any tools but I have used the wb_terms table in Quarry a number of times. I am usually trying to select all terms for a particular language, all terms which match a particular regex or to count how many terms there are. Things like finding labels containing disambiguation information, finding descriptions written like sentences, finding terms containing HTML entities, finding labels which have namespace prefixes when they shouldn't or vice versa, finding misspelt words, listing the most common descriptions for a language...

I use the columns term_full_entity_id (or term_entity_id in older queries), term_entity_type, term_language, term_type and term_text.

It is usually not possible to use SPARQL because the queries are too slow and the timeout for queries in the query service is much lower than for Quarry. In particular, querying for all terms in a particular language is very slow which I already created a ticket for - T167361 .


TASK DETAIL
https://phabricator.wikimedia.org/T197161

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Nikki
Cc: Nikki, Magnus, Mahir256, Nikerabbit, GoranSMilovanovic, Lea_Lacroix_WMDE, WMDE-leszek, Aklapper, Lahi, Gq86, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to