Hi all,

I'm working on my master thesis and my work concerns to understand the
syncronization process between Wikipedia and DBpedia live-uptades
(changesets). In the following I describe some of the problems I came
across and I would like to have an answer:

First, according to the changes made in Wikipedia and the ones reported in
DBpedia, I cannot identify a corrispondence one to one. In other words, I
found that there are a lot of added and removed triples for a resource in
DBpedia than the changes of the same resource shown in Wikipedia history
page. How does it come? I was expecting that a change in change in the
Wikipedia infobox of an article is mapped in DBpedia as an added/removed
triple for the same article/resource.


Second, based on the structure of live-updates of DBpedia there is an
incompatibility between a folder and its correspondent zip folder, e.g., if
we consider zip folder 2012-09-01.tar.gz and the folder 2012-09-01, we find
that there are triples that are present in the former folder and missing in
the latter one. Is it caused because the system is down sometimes? In case
of a positive answer, which folder should we take in consideration for our
analysis?

Last point but not less relevant regards to the last modified field
associated with the added/removed file. I want to understand if the last
modified value corresponds either to the effective time of the change
carried out in a DBpedia resource or to the uploading time of the
added/removed file in the changeset?


Best regards,
Andrea Giacomini
------------------------------------------------------------------------------
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to