Re: [Wikidata] Wikidata ontology
Hi! > The best you can get in terms of "downloading the wikidata ontology" would be > to > download all properties and all the items representing classes. We currently > don't have a separate dump for these. Also, do not expect this to be a concise > or consistent model that can be used for reasoning. You are bound to find > contradictions and lose ends. Also, Wikidata Toolkit (https://github.com/Wikidata/Wikidata-Toolkit) can be used to generate something like taxonomy - see e.g. http://tools.wmflabs.org/wikidata-exports/rdf/exports/20160801/dump_download.html But one has to be careful with it as Wikidata may not (and frequently does not) follow assumptions that are true for proper OWL models - there are no limits on what can be considered a class, a subclass, an instance, etc. Same entity can be treated both as class and individual, and there may be some weird structures, including even outright errors such as cycles in subclass graph, etc. And, of course, it changes all the time :) -- Stas Malyshev smalys...@wikimedia.org ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] Wikidata ontology
Hi Rüdiger, Daniel refers to several independent aspects of Wikidata: (1) The ontology is not separated from the data. Schematic information is mostly managed by encoding it in data as well. Therefore, if you want some of it (but not the rest), then some extraction will be necessary. The Wikidata SPARQL service is your friend for not-too-big (up to some 100K triples) on-the-fly data exports, enough to get the whole class hierarchy, for example. We also have created some ontology-like excerpts in the past [1]. These have been done offline by processing the data dump using Wikidata Toolkit. (2) The ontology is very lightweight. Wikidata mostly encodes properties and their types, some hierarchical information on properties and classes, and some "weak" hints on things like domain and range for some properties. So there are no complex OWL axioms there. This is also the reason why the ontology should not contain any logical contradictions -- when Daniel refers to "contradictions" I guess he means incoherences in the overall modelling (which contradict human intuition). (3) The ontology may change at any time. This is a consequence of (1) and the fact that Wikidata is controlled by a global community. For all of these reasons, there cannot be one "Wikidata ontology" but there might still be many useful ontological things you can get without too much effort. If you are interested in learning about the classes and properties used in Wikidata to get an informal idea of its current schema and content, then you could also browse this data in SQID [2]. Best regards, Markus [1] http://tools.wmflabs.org/wikidata-exports/rdf/exports/20160801/dump_download.html [2] https://tools.wmflabs.org/sqid/#/browse?type=properties On 05.01.2017 16:15, Daniel Kinzler wrote: Am 04.01.2017 um 11:00 schrieb Léa Lacroix: Hello, You can find it here: http://wikiba.se/ontology-1.0.owl If you have questions regarding the ontology, feel free to ask. Please note that this is the *wikibase* ontology, which thefines the meta-model for the information on Wikidata. It defines models statements, sitelinks, source references, etc. This ontology does not model "real world" concepts or properties like location or color or children, etc. Modeling on this level is done on Wikidata itself, there is no fixed RDF or OWL schema or ontology. The best you can get in terms of "downloading the wikidata ontology" would be to download all properties and all the items representing classes. We currently don't have a separate dump for these. Also, do not expect this to be a concise or consistent model that can be used for reasoning. You are bound to find contradictions and lose ends. ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] Wikidata ontology
Am 04.01.2017 um 11:00 schrieb Léa Lacroix: > Hello, > > You can find it here: http://wikiba.se/ontology-1.0.owl > > If you have questions regarding the ontology, feel free to ask. Please note that this is the *wikibase* ontology, which thefines the meta-model for the information on Wikidata. It defines models statements, sitelinks, source references, etc. This ontology does not model "real world" concepts or properties like location or color or children, etc. Modeling on this level is done on Wikidata itself, there is no fixed RDF or OWL schema or ontology. The best you can get in terms of "downloading the wikidata ontology" would be to download all properties and all the items representing classes. We currently don't have a separate dump for these. Also, do not expect this to be a concise or consistent model that can be used for reasoning. You are bound to find contradictions and lose ends. -- Daniel Kinzler Senior Software Developer Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V. ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] Greater than 400 char limit for Wikidata string data types
Hey folks :) Andy and Pasleim just brought this topic to my attention again. Sorry for having dropped the ball a bit. I've created https://phabricator.wikimedia.org/T154660 with a strawman proposal for the still open question of which length it should be. Please add your arguments there. Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207. ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] next IRC office hour on January 5th
Hello all, Just a quick reminder, our IRC office hour is today, check the information below :) Bests, Léa On 27 December 2016 at 16:54, Lydia Pintscher wrote: > Hey folks :) > > We'll do the next office hour on IRC on the 5th of January at 19:00 > Berlin time in #wikimedia-office. See > https://www.timeanddate.com/worldclock/fixedtime.html? > hour=18&min=00&sec=0&day=05&month=01&year=2017 > for your time. > As usual we'll take a look back at the last quarter and see what's > coming up next. Please let me know if there are any other topics you'd > like to put on the agenda. > > > Cheers > Lydia > > -- > Lydia Pintscher - http://about.me/lydia.pintscher > Product Manager for Wikidata > > Wikimedia Deutschland e.V. > Tempelhofer Ufer 23-24 > 10963 Berlin > www.wikimedia.de > > Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. > > Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg > unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das > Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207. > > ___ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata > -- Léa Lacroix Project Manager Community Communication for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207. ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata