Re: Pushing back into Wikipedia? Re: ANN: DBpedia 3.2 release, including DBpedia Ontology and RDF links to Freebase

2008-11-18 Thread Sören Auer


Tim Berners-Lee wrote:
 Now that there has been so much clean-up work which has been done, has
 there been any discussion of pushing back the cleanliness into the
 wikipedia pages themselves, so that the wikipedia gains in consistency?

Yes, we are thinking about this quite a while. The first step will be to 
set up some kind of live-syncronization between Wikipedia and DBpedia. 
For this we already got access to the live-stream of Wikipedia updates 
from Wikimedia's Brion Vibber. As a second step the DBpedia additions 
will be integrated back as annotations into Wikipedia pages. As a result 
there would be some kind of roundtrip-engineering between bot possible: 
If people see a error or mistake they can correct in Wikipedia and the 
correction will show up on DBpedia. However, we have to be careful not 
to overstrain Wikipedians, since they are usually more interested in 
texts than structure ;-)


Best,

Sören



Re: ANN: DBpedia 3.2 release, including DBpedia Ontology and RDF links to Freebase

2008-11-17 Thread Kingsley Idehen


Chris Bizer wrote:

Hi all,

we are happy to announce the release of DBpedia version 3.2.  


The new knowledge base has been extracted from the October 2008 Wikipedia
dumps. Compared to the last release, the new knowledge base provides three
mayor improvements:


1. DBpedia Ontology

DBpedia now features a shallow, cross-domain ontology, which has been
manually created based on the most commonly used infoboxes within Wikipedia.
The ontology currently covers over 170 classes which form a subsumption
hierarchy and have 940 properties. The ontology is instanciated by a new
infobox data extraction method which is based on hand-generated mappings of
Wikipedia infoboxes to the DBpedia ontology. The mappings define
fine-granular rules on how to parse infobox values. The mappings also adjust
weaknesses in the Wikipedia infobox system, like having different infoboxes
for the same class (currently 350 Wikipedia templates are mapped to 170
ontology classes), using different property names for the same property
(currently 2350 Wikipedia template properties are mapped to 940 ontology
properties), and not having clearly defined datatypes for property values.
Therefore, the instance data within the infobox ontology is much cleaner and
better structured than the infobox data within the DBpedia infobox dataset
that is generated using the old infobox extraction code. The DBpedia
ontology currently contains about 882.000 instances.

More information about the ontology is found at:
http://wiki.dbpedia.org/Ontology 



2. RDF Links to Freebase

Freebase is an open-license database which provides data about million of
things from various domains. Freebase has recently released an Linked Data
interface to their content. As there is a big overlap between DBpedia and
Freebase, we have added 2.4 million RDF links to DBpedia pointing at the
corresponding things in Freebase. These links can be used to smush and fuse
data about a thing from DBpedia and Freebase.

For more information about the Freebase links see:
http://blog.dbpedia.org/2008/11/15/dbpedia-is-now-interlinked-with-freebase-
links-to-opencyc-updated/


3. Cleaner Abstacts

Within the old DBpedia dataset it occurred that the abstracts for different
languages contained Wikpedia markup and other strange characters. For the
3.2 release, we have improved DBpedia's abstract extraction code which
results in much cleaner abstracts that can safely be displayed in user
interfaces. 



The new DBpedia release can be downloaded from:

http://wiki.dbpedia.org/Downloads32

and is also available via the DBpedia SPARQL endpoint at

http://dbpedia.org/sparql

and via DBpedia's Linked Data interface. Example URIs: 


http://dbpedia.org/resource/Berlin
http://dbpedia.org/page/Oliver_Stone

More information about DBpedia in general is found at:

http://wiki.dbpedia.org/About


Lots of thanks to everybody who contributed to the Dbpedia 3.2 release! 


Especially:

1. Georgi Kobilarov (Freie Universität Berlin) who designed and implemented
the new infobox extraction framework. 
2. Anja Jentsch (Freie Universität Berlin) who contributed to implementing

the new extraction framework and wrote the infobox to ontology class
mappings. 
3. Paul Kreis (Freie Universität Berlin) who improved the datatype
extraction code. 
4. Andreas Schultz (Freie Universität Berlin) for generating the Freebase to

DBpedia RDF links.
5. Everybody at OpenLink Software for hosting DBpedia on a Virtuoso server
and for providing the statistics about the new Dbpedia knowledge base.

Have fun with the new DBpedia knowledge base!

Cheers

Chris


--
Prof. Dr. Christian Bizer
Web-based Systems Group
Freie Universität Berlin
+49 30 838 55509
http://www.bizer.de
[EMAIL PROTECTED]




  

All,

A few additional items re. the new DBpedia 3.2 release:

1. Integration with OpenCyc [1] has been updated (* the owl:sameAs links 
from DBpedia now point to the latest OpenCyc URIs to identical individuals*)
2. UMBEL [2] ontology links and inference rules are in place that enable 
OpenCyc upper ontology combined with externally linked ontology (e.g. 
FOAF, SIOC, Music Ontology etc.) reasoning over DBpedia


So looking at the DBpedia data space resource: 
http://dbpedia.org/resource/Oliver_Stone you now have the combined 
richness of three ontologies with regards to resource type, as depicted 
below:


Yago:

   * yago:LivingPeople
   * yago:AmericanScreenwriters
   * yago:YaleUniversityAlumni
   * yago:UnitedStatesArmySoldiers
   * yago:AmericanFilmDirectors
   * yago:PeopleFromNewYorkCity
   * yago:Actor109765278
   * yago:FilmMaker110088390
   * yago:FilmDirector110088200
   * yago:Person17846
   * yago:Screenwriter110564400

New DBpedia Ontology:

   * dbpedia-owl:Artist
   * dbpedia-owl:Actor
   * dbpedia-owl:Resource
   * dbpedia-owl:Person

UMBEL (which is OpenCyc + collection of established public shared 
ontologies also referred to as external ontologies, relative to UMBEL's 
data space):


   * umbel-sc:Actor
   * 

AW: ANN: DBpedia 3.2 release, including DBpedia Ontology and RDF links to Freebase

2008-11-17 Thread Chris Bizer

Hi Andreas,

we for sure want to do this, but also did not want to postpone the DBpedia
3.2 release any further.

So be ensured that the upcoming public user interface for defining the
infobox-to-ontology mappings will include the possibility to reuse existing
classes and properties and that external classes and properties will be used
within the 3.3 release.

Defining the infobox-to-ontology mappings that we currently have was already
a lot of work (Anja thanks again), so please be patient with the
mappings/reuse of external ontologies.

Cheers

Chris

 

 -Ursprüngliche Nachricht-
 Von: Andreas Harth [mailto:[EMAIL PROTECTED]
 Gesendet: Montag, 17. November 2008 16:55
 An: Chris Bizer
 Cc: public-lod@w3.org; 'Semantic Web'; dbpedia-
 [EMAIL PROTECTED]; dbpedia-
 [EMAIL PROTECTED]
 Betreff: Re: ANN: DBpedia 3.2 release, including DBpedia Ontology and
 RDF links to Freebase
 
 Hi Chris,
 
 Chris Bizer wrote:
  1. DBpedia Ontology
 
  DBpedia now features a shallow, cross-domain ontology, which has been
  manually created based on the most commonly used infoboxes within
 Wikipedia
 great work!
 
 One thing: what's the reason for creating your own classes rather
 than re-using or sub-classing existing ones (foaf:Person,
 geonames:Feature...)?  Same for properties (foaf:name, dc:date...).
 
 Regards,
 Andreas.
 
 --
 http://swse.deri.org/




Re: [Dbpedia-announcements] ANN: DBpedia 3.2 release, including DBpedia Ontology and RDF links to Freebase

2008-11-17 Thread Kingsley Idehen


Chris Bizer wrote:

Hi Andreas,

we for sure want to do this, but also did not want to postpone the DBpedia
3.2 release any further.

So be ensured that the upcoming public user interface for defining the
infobox-to-ontology mappings will include the possibility to reuse existing
classes and properties and that external classes and properties will be used
within the 3.3 release.

Defining the infobox-to-ontology mappings that we currently have was already
a lot of work (Anja thanks again), so please be patient with the
mappings/reuse of external ontologies.

Cheers

Chris
  

Andreas,

Apropos Chris' comments above, it's coming.

DBpedia ontology to UMBEL ontology mapping is one of a number of 
mappings that will emerge in due course. That said -- by the very nature 
of this project and RDF based Linked Data in general -- anyone with a 
vested interest in the ontology mapping efforts can step in and 
accelerate matters. Thus, I encourage others to participate bearing in 
mind DBpedia 3.2's increased compatibility with such endeavors.


Kingsley
 

  

-Ursprüngliche Nachricht-
Von: Andreas Harth [mailto:[EMAIL PROTECTED]
Gesendet: Montag, 17. November 2008 16:55
An: Chris Bizer
Cc: public-lod@w3.org; 'Semantic Web'; dbpedia-
[EMAIL PROTECTED]; dbpedia-
[EMAIL PROTECTED]
Betreff: Re: ANN: DBpedia 3.2 release, including DBpedia Ontology and
RDF links to Freebase

Hi Chris,

Chris Bizer wrote:


1. DBpedia Ontology

DBpedia now features a shallow, cross-domain ontology, which has been
manually created based on the most commonly used infoboxes within
  

Wikipedia
great work!

One thing: what's the reason for creating your own classes rather
than re-using or sub-classing existing ones (foaf:Person,
geonames:Feature...)?  Same for properties (foaf:name, dc:date...).

Regards,
Andreas.

--
http://swse.deri.org/




-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Dbpedia-announcements mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dbpedia-announcements

  



--


Regards,

Kingsley Idehen   Weblog: http://www.openlinksw.com/blog/~kidehen
President  CEO 
OpenLink Software Web: http://www.openlinksw.com








Re: [Dbpedia-discussion] ANN: DBpedia 3.2 release, including DBpedia Ontology and RDF links to Freebase

2008-11-17 Thread Kingsley Idehen


John Muth wrote:

Congrats all, and big thanks for your continuing great work.

The YAGO Classes and YAGO Links links are not working for me just now -- are
the URLs wrong or are the files yet to be published?

http://downloads.dbpedia.org/3.2/links/yago_en.nt.bz2
http://downloads.dbpedia.org/3.2/links/yagolink_en.nt.bz2

Thanks again,
John Muth
  


John,

Hmm..

Will have look and get it sorted if it's actually missing.

Kingsley

On 17/11/08 12:11, Chris Bizer [EMAIL PROTECTED] wrote:

  

Hi all,

we are happy to announce the release of DBpedia version 3.2.

The new knowledge base has been extracted from the October 2008 Wikipedia
dumps. Compared to the last release, the new knowledge base provides three
mayor improvements:


1. DBpedia Ontology

DBpedia now features a shallow, cross-domain ontology, which has been
manually created based on the most commonly used infoboxes within Wikipedia.
The ontology currently covers over 170 classes which form a subsumption
hierarchy and have 940 properties. The ontology is instanciated by a new
infobox data extraction method which is based on hand-generated mappings of
Wikipedia infoboxes to the DBpedia ontology. The mappings define
fine-granular rules on how to parse infobox values. The mappings also adjust
weaknesses in the Wikipedia infobox system, like having different infoboxes
for the same class (currently 350 Wikipedia templates are mapped to 170
ontology classes), using different property names for the same property
(currently 2350 Wikipedia template properties are mapped to 940 ontology
properties), and not having clearly defined datatypes for property values.
Therefore, the instance data within the infobox ontology is much cleaner and
better structured than the infobox data within the DBpedia infobox dataset
that is generated using the old infobox extraction code. The DBpedia
ontology currently contains about 882.000 instances.

More information about the ontology is found at:
http://wiki.dbpedia.org/Ontology


2. RDF Links to Freebase

Freebase is an open-license database which provides data about million of
things from various domains. Freebase has recently released an Linked Data
interface to their content. As there is a big overlap between DBpedia and
Freebase, we have added 2.4 million RDF links to DBpedia pointing at the
corresponding things in Freebase. These links can be used to smush and fuse
data about a thing from DBpedia and Freebase.

For more information about the Freebase links see:
http://blog.dbpedia.org/2008/11/15/dbpedia-is-now-interlinked-with-freebase-
links-to-opencyc-updated/


3. Cleaner Abstacts

Within the old DBpedia dataset it occurred that the abstracts for different
languages contained Wikpedia markup and other strange characters. For the
3.2 release, we have improved DBpedia's abstract extraction code which
results in much cleaner abstracts that can safely be displayed in user
interfaces. 



The new DBpedia release can be downloaded from:

http://wiki.dbpedia.org/Downloads32

and is also available via the DBpedia SPARQL endpoint at

http://dbpedia.org/sparql

and via DBpedia's Linked Data interface. Example URIs:

http://dbpedia.org/resource/Berlin
http://dbpedia.org/page/Oliver_Stone

More information about DBpedia in general is found at:

http://wiki.dbpedia.org/About


Lots of thanks to everybody who contributed to the Dbpedia 3.2 release!

Especially:

1. Georgi Kobilarov (Freie Universität Berlin) who designed and implemented
the new infobox extraction framework.
2. Anja Jentsch (Freie Universität Berlin) who contributed to implementing
the new extraction framework and wrote the infobox to ontology class
mappings. 
3. Paul Kreis (Freie Universität Berlin) who improved the datatype
extraction code. 
4. Andreas Schultz (Freie Universität Berlin) for generating the Freebase to

DBpedia RDF links.
5. Everybody at OpenLink Software for hosting DBpedia on a Virtuoso server
and for providing the statistics about the new Dbpedia knowledge base.

Have fun with the new DBpedia knowledge base!

Cheers

Chris


--
Prof. Dr. Christian Bizer
Web-based Systems Group
Freie Universität Berlin
+49 30 838 55509
http://www.bizer.de
[EMAIL PROTECTED]



-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Dbpedia-discussion mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion




http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose