Great work David!

Thank you for the links and the detailed report

Are you planning to open source the code that generates the links?
We could try and integrate it into the DBpedia release publishing workflow.

Cheers,
Dimitris


On Tue, Oct 18, 2016 at 2:28 PM, Δαβίδ Ναζαριάν <dnazar...@csd.auth.gr>
wrote:

> Hello,
>
>
>
> Concerning the DBPedia citations & references challenge, we report about a
> project that aims to map the DBPedia's citations to existing
> bibliographical data. Even though the deadline for the challenge has passed
> we would be grateful for your feedback about the project.
>
>
>
> More specifically, a number of properties of the
> *enwiki-20160305-citation-data.ttl* file have been used in order to
> facilitate the linking of the triples' subjects (found in the file) to URIs
> from other bibliographical sources. As a result, a total of 402,354 links
> were discovered, with 379,835 corresponding to distinct subjects. Emphasis
> has been given to the properties that represent identifiers, that can be
> found in other data sources and are relatively common. In particular, the
> properties *isbn*, *isbn13*, *issn*, *doi*, *journal*, *series*,
> *periodical,* *magazine*, *oclc,* *pmid and arxiv* have been used
> combined with the *title* and *year*. The linking of the data has been
> based on a number of LOD dumps that are available for download and
> bibliographical websites that provide their metadata through APIs. The
> project comprises of an application written in Java that processes and
> links the data and a triplestore which stores the original and the
> processed data.
>
>
>
> The following data sources have been used in the project:
>
> *Data source*
>
> *Type*
>
> *Unique triples *
>
> *in local data dump*
>
> DBPedia citations
>
> *Data dump <http://downloads.dbpedia.org/temporary/citations/>*
>
> 76.2M
>
> *DBLP - Digital Bibliography & Library Project <http://dblp.uni-trier.de/>*
>
> *Data dump <http://dblp.l3s.de/dblp++.php>*
>
> 88.1M
>
> *BNB - British National Bibliography <http://bnb.bl.uk/>*
>
> *Data dump <http://www.bl.uk/bibliographic/download.html#lodbnb>*
>
> 111M
>
> *DNB - Deutsche Nationalbibliografie
> <http://www.dnb.de/EN/Service/DigitaleDienste/DNBBibliografie/dnbbibliografie_node.html>*
>
> *Data dump
> <http://datendienst.dnb.de/cgi-bin/mabit.pl?userID=opendata&pass=opendata&cmd=login>*
>
> 414.2M
>
> *BNE - Biblioteca Nacional de España <http://www.bne.es/>*
>
> *Data dump
> <http://www.bne.es/en/Inicio/Perfiles/Bibliotecarios/DatosEnlazados/DescargaFicheros/>*
>
> 68.7M
>
> *Springer <http://www.springer.com/>*
>
> *Data dump <http://lod.springer.com/data/dumps>*
>
> 3.3M
>
> *WorldCat <https://www.worldcat.org/>*
>
> *API <https://www.oclc.org/data/data-sets-services.en.html>*
>
> 2.1M
>
> *PubMed <https://www.ncbi.nlm.nih.gov/pubmed>*
>
> *API <https://www.ncbi.nlm.nih.gov/books/NBK25497/>*
>
> 0.629M
>
> *arXiv <https://arxiv.org/>*
>
> *API <https://arxiv.org/help/api/user-manual>*
>
> 0.021M
>
>
>
> The *enwiki-20160305-citation-data.ttl *file contains 76,223,926 unique
> triples with 12,391,363 distinct subjects. The results found in the project
> correspond to 379,835 / 999,679 = 38% of the distinct subjects extracted
> and to 379,835 / 12,391,363 = 3% of the entire file.
>
>
>
> The links found, are contained in the *dbpedia_combined_links.nt.zip
> <https://dl.dropboxusercontent.com/s/9dm9qotlgzumcqc/dbpedia_combined_links.nt.zip>*
> file and also can be queried from the following GraphDB Free SPARQL
> endpoint: *http://lod.csd.auth.gr:7200/sparql
> <http://lod.csd.auth.gr:7200/sparql>*
>
>
>
> A more detailed report about the project can be found at:
>
> *https://dl.dropboxusercontent.com/s/botmb4ax8d7ixug/Report_citation-challenge.pdf
> <https://dl.dropboxusercontent.com/s/botmb4ax8d7ixug/Report_citation-challenge.pdf>*
>
>
>
>
>
> Respectfully,
>
> David Nazarian
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, SlashDot.org! http://sdm.link/slashdot
> _______________________________________________
> DBpedia-discussion mailing list
> DBpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>


-- 
Kontokostas Dimitris
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most 
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-discussion mailing list
DBpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to