Re: [Wikidata] Wikidata ontology

2017-01-05 Thread Stas Malyshev
Hi!

> The best you can get in terms of "downloading the wikidata ontology" would be 
> to
> download all properties and all the items representing classes. We currently
> don't have a separate dump for these. Also, do not expect this to be a concise
> or consistent model that can be used for reasoning. You are bound to find
> contradictions and lose ends.

Also, Wikidata Toolkit (https://github.com/Wikidata/Wikidata-Toolkit)
can be used to generate something like taxonomy - see e.g.
http://tools.wmflabs.org/wikidata-exports/rdf/exports/20160801/dump_download.html

But one has to be careful with it as Wikidata may not (and frequently
does not) follow assumptions that are true for proper OWL models - there
are no limits on what can be considered a class, a subclass, an
instance, etc. Same entity can be treated both as class and individual,
and there may be some weird structures, including even outright errors
such as cycles in subclass graph, etc. And, of course, it changes all
the time :)

-- 
Stas Malyshev
smalys...@wikimedia.org

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Wikidata ontology

2017-01-05 Thread Markus Kroetzsch

Hi Rüdiger,

Daniel refers to several independent aspects of Wikidata:

(1) The ontology is not separated from the data. Schematic information 
is mostly managed by encoding it in data as well. Therefore, if you want 
some of it (but not the rest), then some extraction will be necessary. 
The Wikidata SPARQL service is your friend for not-too-big (up to some 
100K triples) on-the-fly data exports, enough to get the whole class 
hierarchy, for example. We also have created some ontology-like excerpts 
in the past [1]. These have been done offline by processing the data 
dump using Wikidata Toolkit.


(2) The ontology is very lightweight. Wikidata mostly encodes properties 
and their types, some hierarchical information on properties and 
classes, and some "weak" hints on things like domain and range for some 
properties. So there are no complex OWL axioms there. This is also the 
reason why the ontology should not contain any logical contradictions -- 
when Daniel refers to "contradictions" I guess he means incoherences in 
the overall modelling (which contradict human intuition).


(3) The ontology may change at any time. This is a consequence of (1) 
and the fact that Wikidata is controlled by a global community.


For all of these reasons, there cannot be one "Wikidata ontology" but 
there might still be many useful ontological things you can get without 
too much effort.


If you are interested in learning about the classes and properties used 
in Wikidata to get an informal idea of its current schema and content, 
then you could also browse this data in SQID [2].


Best regards,

Markus

[1] 
http://tools.wmflabs.org/wikidata-exports/rdf/exports/20160801/dump_download.html

[2] https://tools.wmflabs.org/sqid/#/browse?type=properties

On 05.01.2017 16:15, Daniel Kinzler wrote:

Am 04.01.2017 um 11:00 schrieb Léa Lacroix:

Hello,

You can find it here: http://wikiba.se/ontology-1.0.owl

If you have questions regarding the ontology, feel free to ask.



Please note that this is the *wikibase* ontology, which thefines the meta-model
for the information on Wikidata. It defines models statements, sitelinks, source
references, etc.

This ontology does not model "real world" concepts or properties like location
or color or children, etc. Modeling on this level is done on Wikidata itself,
there is no fixed RDF or OWL schema or ontology.

The best you can get in terms of "downloading the wikidata ontology" would be to
download all properties and all the items representing classes. We currently
don't have a separate dump for these. Also, do not expect this to be a concise
or consistent model that can be used for reasoning. You are bound to find
contradictions and lose ends.




___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Wikidata ontology

2017-01-05 Thread Daniel Kinzler
Am 04.01.2017 um 11:00 schrieb Léa Lacroix:
> Hello,
> 
> You can find it here: http://wikiba.se/ontology-1.0.owl
> 
> If you have questions regarding the ontology, feel free to ask.


Please note that this is the *wikibase* ontology, which thefines the meta-model
for the information on Wikidata. It defines models statements, sitelinks, source
references, etc.

This ontology does not model "real world" concepts or properties like location
or color or children, etc. Modeling on this level is done on Wikidata itself,
there is no fixed RDF or OWL schema or ontology.

The best you can get in terms of "downloading the wikidata ontology" would be to
download all properties and all the items representing classes. We currently
don't have a separate dump for these. Also, do not expect this to be a concise
or consistent model that can be used for reasoning. You are bound to find
contradictions and lose ends.


-- 
Daniel Kinzler
Senior Software Developer

Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2017-01-05 Thread Lydia Pintscher
Hey folks :)

Andy and Pasleim just brought this topic to my attention again. Sorry
for having dropped the ball a bit.
I've created https://phabricator.wikimedia.org/T154660 with a strawman
proposal for the still open question of which length it should be.
Please add your arguments there.


Cheers
Lydia

-- 
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata

Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] next IRC office hour on January 5th

2017-01-05 Thread Léa Lacroix
Hello all,

Just a quick reminder, our IRC office hour is today, check the information
below :)

Bests, Léa

On 27 December 2016 at 16:54, Lydia Pintscher 
wrote:

> Hey folks :)
>
> We'll do the next office hour on IRC on the 5th of January at 19:00
> Berlin time in #wikimedia-office. See
> https://www.timeanddate.com/worldclock/fixedtime.html?
> hour=18&min=00&sec=0&day=05&month=01&year=2017
> for your time.
> As usual we'll take a look back at the last quarter and see what's
> coming up next. Please let me know if there are any other topics you'd
> like to put on the agenda.
>
>
> Cheers
> Lydia
>
> --
> Lydia Pintscher - http://about.me/lydia.pintscher
> Product Manager for Wikidata
>
> Wikimedia Deutschland e.V.
> Tempelhofer Ufer 23-24
> 10963 Berlin
> www.wikimedia.de
>
> Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
>
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
> unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
> Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>



-- 
Léa Lacroix
Project Manager Community Communication for Wikidata

Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata