Hi all,
I didn't run the extraction framework for sometime, and I used the wikidata
extractors for the first time.
These work fine, I followed
http://jplu.developpez.com/tutoriels/web-semantique/dbpedia-extraction-framework/#LVIII-B
and got the output files :
./wikidatawiki/[YYYYMMDD]/wikidatawiki-[YYYYMMDD]-wikidata-labels.ttl.gz
./wikidatawiki/[YYYYMMDD]/wikidatawiki-[YYYYMMDD]-wikidata-ll.ttl.gz
./wikidatawiki/[YYYYMMDD]/wikidatawiki-[YYYYMMDD]-wikidata-mapped.ttl.gz
./wikidatawiki/[YYYYMMDD]/wikidatawiki-[YYYYMMDD]-wikidata-namespace-sameas.ttl.gz
./wikidatawiki/[YYYYMMDD]/wikidatawiki-[YYYYMMDD]-wikidata.ttl.gz
I wrote some scripts to extract the sameas links with the resources of a
particular DBpedia :
https://github.com/JulienCojan/extraction-framework/blob/wp_fr/dump/src/main/bash/import_external_data.sh
But it is not very efficient as it needs to sort wikidata dumps before
performing a join on DBpedia resources.
Is there already be something in the extraction framework to do that more
efficiently ?
Cheers,
Julien Cojan
------------------------------------------------------------------------------
Slashdot TV.
Video for Nerds. Stuff that matters.
http://tv.slashdot.org/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion