On Tue, 18 Feb 2025 at 07:51, Jamie Willoughby <[email protected]> wrote:

> I've spent a long time searching, but the volume of available data is 
> overwhelming. Can you recommend a specific Wiki dataset?

Using the Wikidata Query Service:

   
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/Wikidata_Query_Help

you can extract any subset of data from Wikidata, according to your
chosen criteria.

For example, you could query for all the items about museums in a
given country, with their opening dates (where given) and coordinates;
all the cities in the world with a female mayor, and the date they
were elected; the birth and death dates and burial places of everyone
born in New York in 1901, and any awards that they won; all the art
galleries with a painting by van Gogh, the size of the painting, and
the year it was painted. The permutations are endless.

You can then download that data in a number of formats, such as CSV or JSON.

Perhaps you can start by thinking of the number of records you need in
your data set, then extracting something appropriate?

-- 
Andy Mabbett
http://pigsonthewing.org.uk
_______________________________________________
Wikidata mailing list -- [email protected]
Public archives at 
https://lists.wikimedia.org/hyperkitty/list/[email protected]/message/YCSICH4C6SZF7DLY3MKEJ5DXSW6PAOBT/
To unsubscribe send an email to [email protected]

Reply via email to