As I am just in the middle of trying to put together a high-quality
information base using DBpedia 2016-10, I'll vote strongly for more-frequent
extractions at the expense of dropping some of the datasets.

The problem that I have run into is that there are some major errors in at
least the Italian, Spanish, and Russian DBpedias.   If there were going to be
new versions of this information coming along shortly I would be very
motivated to spend the extra time required to try to fix these problems and
wait for the next extraction run.  As things stand now, however, any changes
that I make to extraction rules will not show up for months, so what
motivation do I have to fix them?  I'm going to just ignore the parts of
DBpedia where these errors come from, which isn't ideal but at least gets me
some useful information.

So I would be much happier with a DBpedia that has a single namespace
(particularly if it is the Wikidata one).  This would allow for jettisoning a
lot of the fluff triples.   Now there may be some need to keep the different
language DBpedias separate, but this can be done by putting them in different
documents.  One issue is just how to handle identifiers that don't nicely
correspond to Wikipedia pages (or Wikidata items) - I'm in favour of using
blank nodes (or if blank nodes are still too controversial using fake blank
nodes, i.e., something like Skolem names).

peter




On 09/25/2017 09:31 AM, Sebastian Hellmann wrote:
> Hi all,
> 
> we are currently having quite a discussion about a stretch at DBpedia.
> 
> The main point is that we spent a lot of time maintaining the current way
> DBpedia is done, i.e. bi-annual releases. Personally, I feel that our time
> would be spent better on innovating it, i.e.
> 
> - having links, mappings and ontology maintained at GitHub
> 
> https://github.com/dbpedia/links ,
> https://github.com/dbpedia/ontology-tracker/ ,
> https://github.com/dbpedia/mappings-tracker
> 
> - letting the extraction run more frequently (maybe 15 day interval), but then
> producing less datasets, i.e. focusing on the ones that are the most used
> 
> - update the endpoint more often
> 
> 
> Also we would really benefit in terms of data quality and completeness, if we
> were to go for a fused version, where we pick the best data from all available
> sources (Wikidata+all DBpedia versions for now, later more sources like yago,
> VIAF and other datasets). I mean the data is there, why not fuse it into
> something consistent and usable.
> 
> This becomes especially obvious, if you look at the
> https://meta.wikimedia.org/wiki/Grants:Project/DBpedia/CrossWikiFact proposal
> and also at the display of the prototype:
> 
> http://downloads.dbpedia.org/temporary/crosswikifact/results/q64.html
> 
> 
> The pickle is a bit to decide, whether we should just stop releasing DBpedia
> in its current form and put all efforts into a more live version that also
> allows more contributions in a timely manner.
> 
> Any opinions on this? I have the feeling that we are only marginally getting
> better with the current way of running things.
> 
> 
> -- 
> All the best,
> Sebastian Hellmann
> 
> Director of Knowledge Integration and Linked Data Technologies (KILT)
> Competence Center
> at the Institute for Applied Informatics (InfAI) at Leipzig University
> Executive Director of the DBpedia Association
> Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org,
> https://www.w3.org/community/ld4lt <http://www.w3.org/community/ld4lt>
> Homepage: http://aksw.org/SebastianHellmann
> Research Group: http://aksw.org
> 
> 
> ------------------------------------------------------------------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> 
> 
> 
> _______________________________________________
> DBpedia-discussion mailing list
> DBpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
> 

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-discussion mailing list
DBpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to