On Thu, Oct 25, 2012 at 8:33 PM, Jörn Hees <[email protected]> wrote:
> Hi,
>
> i'm currently deciding on what of the DBpedia 3.8 dumps to load into our 
> local mirror… (and updating 
> http://joernhees.de/blog/2012/05/25/setting-up-a-local-dbpedia-3-7-mirror-with-virtuoso-6-1-5/
>  ).
> I'm a bit clueless about three files on the download that aren't explained, 
> so maybe someone could shed some light on them … (I already checked 
> http://dbpedia.org/Downloads38, http://wiki.dbpedia.org/Datasets, 
> http://wiki.dbpedia.org/Datasets/Properties and 
> http://wiki.dbpedia.org/DatasetsLoaded):
>
>
> - http://downloads.dbpedia.org/3.8/en/infobox_test_en.ttl.gz :
> Probably just some leftover from testing?
> ### snip ###
> <http://dbpedia.org/resource/Autism> 
> <http://dbpedia.org/resource/Template:Infobox_disease> "Name"@en .
> <http://dbpedia.org/resource/Autism> 
> <http://dbpedia.org/resource/Template:Infobox_disease> "Alt"@en .
> …
> <http://dbpedia.org/resource/Anarchism> 
> <http://dbpedia.org/resource/Template:Sister_project_links> "n"@en .
> <http://dbpedia.org/resource/Anarchism> 
> <http://dbpedia.org/resource/Template:Sister_project_links> "v"@en .
> <http://dbpedia.org/resource/Agricultural_science> 
> <http://dbpedia.org/resource/Template:Infobox_Occupation> "name"@en .
> ### /snip ###
>

These datasets are needed for the mapping statistics [1]. The name
"test" has historical reasons and doesn't really make sense anymore.
The RDF triples aren't really subject-predicate-object, they just list
the templates and properties that are used on Wikipedia pages.

Cheers,
JC

[1] http://mappings.dbpedia.org/server/statistics/


>
> - http://downloads.dbpedia.org/3.8/en/topical_concepts_en.ttl.bz2 and
> - 
> http://downloads.dbpedia.org/3.8/en/topical_concepts_unredirected_en.ttl.bz2 :
> It seems they contain category pages which have an associated main article. 
> As such very interesting, but it seems they use the deprecated skos:subject 
> while all other files use dcterms:subject, and the "redirect resolving" 
> wasn't done for the subjects:
>
> http://downloads.dbpedia.org/3.8/en/topical_concepts_unredirected_en.ttl.bz2 :
> ### snip ###
> <http://dbpedia.org/resource/Category:Futurama> 
> <http://www.w3.org/2004/02/skos/core#subject> 
> <http://dbpedia.org/resource/Futurama_(TV_series)> .
> <http://dbpedia.org/resource/Futurama_(TV_series)> 
> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
> <http://www.w3.org/2004/02/skos/core#Concept> .
> ### /snip ###
>
> And now with redirects resolved (watch the "_(TV_series)" disappear in the 
> object position but not the subject):
> http://downloads.dbpedia.org/3.8/en/topical_concepts_en.ttl.bz2 :
> ### snip ###
> <http://dbpedia.org/resource/Category:Futurama> 
> <http://www.w3.org/2004/02/skos/core#subject> 
> <http://dbpedia.org/resource/Futurama> .
> <http://dbpedia.org/resource/Futurama_(TV_series)> 
> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
> <http://www.w3.org/2004/02/skos/core#Concept> .
> ### /snip ###
>
> Are there other places where this might cause "dangling subjects"?
>
> Despite these small issues it's actually quite an interesting dataset, any 
> other reasons why it wasn't loaded? Why not map it to a "dpo:categoryMain" 
> property. (dcterms:subject is already provided by article_categories and 
> foaf:isPrimaryTopciOf wouldn't be correct as it would imply one of them being 
> a document and it would cause problems as in older versions where people used 
> it to get the corresponding wikipedia page for a topic and unintentionally 
> get the category back.)
>
>
> Cheers,
> Jörn
>
>
> ------------------------------------------------------------------------------
> Everyone hates slow websites. So do we.
> Make your web apps faster with AppDynamics
> Download AppDynamics Lite for free today:
> http://p.sf.net/sfu/appdyn_sfd2d_oct
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to