On Mon, 27 Jul 2020 at 14:54, Wolfgang Fahl <[email protected]> wrote:
> Dear Jena Users, > > at > > http://wiki.bitplan.com/index.php/Get_your_own_copy_of_WikiData > > i have documented several attempts to import WikiData into Jena. After > Jonas Sourlier reported a success using a 4 TB SSD i tried > the same approach. It is documented at: > > http://wiki.bitplan.com/index.php/WikiData_Import_2020-07-15 > > The attempt was limited to the truthy statements but still gives full > access to the full content of WikiData. This is especially important for > longer running > queries. One of the followup questions would be how queries can be sped > up in this environment. > > E.g. i tried: > > SELECT (COUNT(*) as ?Triples) WHERE { ?s ?p ?o} > > > which took 5516 secs to answer that 5.250.681.892 triples are in the data > set. > Thanks for sharing this work! You might also be interested in https://yago-knowledge.org/downloads/yago-4 which maps Wikidata to Schema.org and then offers a couple of subsets. There are a bunch of other discussions around Wikidata subsetting that might be relevant too, see https://www.wikidata.org/wiki/Wikidata:WikiProject_Schemas/Subsetting https://docs.google.com/document/d/1MmrpEQ9O7xA6frNk6gceu_IbQrUiEYGI9vcQjDvTL9c/edit#heading=h.7xg3cywpkgfq and also https://addshore.com/2019/10/your-own-wikidata-query-service-with-no-limits-part-1/ cheers, Dan > Yours > > Wolfgang > > > -- > > Wolfgang Fahl > Pater-Delp-Str. 1, D-47877 Willich Schiefbahn > Tel. +49 2154 811-480, Fax +49 2154 811-481 > Web: http://www.bitplan.de > >
