On Tue, 18 Feb 2025 at 07:51, Jamie Willoughby <[email protected]> wrote:
> I've spent a long time searching, but the volume of available data is > overwhelming. Can you recommend a specific Wiki dataset? Using the Wikidata Query Service: https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/Wikidata_Query_Help you can extract any subset of data from Wikidata, according to your chosen criteria. For example, you could query for all the items about museums in a given country, with their opening dates (where given) and coordinates; all the cities in the world with a female mayor, and the date they were elected; the birth and death dates and burial places of everyone born in New York in 1901, and any awards that they won; all the art galleries with a painting by van Gogh, the size of the painting, and the year it was painted. The permutations are endless. You can then download that data in a number of formats, such as CSV or JSON. Perhaps you can start by thinking of the number of records you need in your data set, then extracting something appropriate? -- Andy Mabbett http://pigsonthewing.org.uk _______________________________________________ Wikidata mailing list -- [email protected] Public archives at https://lists.wikimedia.org/hyperkitty/list/[email protected]/message/YCSICH4C6SZF7DLY3MKEJ5DXSW6PAOBT/ To unsubscribe send an email to [email protected]
