Hi Elmahdi,
@Robert: could you have a look at this email?
We established to do the first release of the "Generic" DBpedia Core
module yesterday, it contains all the files you can find here:
http://downloads.dbpedia.org/repo/lts/generic-spark/
Since we are establishing more frequent releases now, we split up the
publishing into LTS for long term and then "dev" for things we will
eventually delete.
The most important change is:
- clearer release and versioning methodology
- metadata provided
We would hope that you would also join in with some datasets.
By the way, we moved most of the communications in the "#releasea"
channel on slack
All the best,
Sebastian
On 24.08.2018 16:54, Elmahdi Korfed wrote:
Hi everyone,
I'm working on an updated version of DBpediaFR chapter 2018 and I just
want to know where the extracted files are stored.
Some explications from beginning:
I downloaded :
- dbpedia/extraction-framework from github
<https://github.com/dbpedia/extraction-framework/>
- commons + fr + wiki dump 2018-08's version from the dumps wikimedia
website <http://dumps.wikimedia.your.org/frwiki/20180801/> (source:
**-pages-articles.xml.bz2)
Now I would like to extract, first, commonswiki.
To do that, I configured 2 files:
=> "extraction.commons.properties" (content of file):
source=pages-articles.xml.bz2
require-download-complete=false
languages=commons
extractors=
extractors.commons=.MappingExtractor,.ContributorExtractor,.TemplateParameterExtractor,.FileTypeExtractor,.GalleryExtractor,.ImageAnnotationExtractor,.CommonsKMLExtractor,.DBpediaResourceExtractor
copyrightCheck=false
=> "universal.properties" (content of file):
dbpedia-version=2018-08
base-dir=/Users/macbookpro/Documents/web_pro/github/dbpedia/extraction-framework/dump/extraction-dump/2018-08
log-dir=/Users/macbookpro/Documents/web_pro/github/dbpedia/extraction-framework/dump/extraction-data/2018-08
wiki-name=wiki
source=pages-articles.xml.bz2
parallel-processes=4
ontology=../ontology.xml
mappings=../mappings
uri-policy.iri=generic:en
format.ttl.bz2=turtle-triples
After that, I launched these command:
- cd extraction-frameworkd/dump
- ../clean-install-run extraction extraction.commons.properties
Now I have some messages like this:
- AVERTISSEMENT: Language not found: cbk-zam. To extract this
language, please edit the addonLanguage.json in core.
- INFOS: Will extract redirects from source for commons wiki, could
not load cache file
'/Users/macbookpro/Documents/web_pro/github/dbpedia/extraction-framework/dump/extraction-dump/2018-08/commonswiki/20180801/commonswiki-20180801-template-redirects.obj':
java.io.FileNotFoundException:
/Users/macbookpro/Documents/web_pro/github/dbpedia/extraction-framework/dump/extraction-dump/2018-08/commonswiki/20180801/commonswiki-20180801-template-redirects.obj
(No such file or directory)
- AVERTISSEMENT: wrong redirect. page:
[title=UNC;ns=0/Main/;language:wiki=commons,locale=en].
- found by dbpedia: [title=University of North Carolina at Chapel
Hill;ns=0/Main/;language:wiki=commons,locale=en].
- found by wikipedia: [title=University of North Carolina at Chapel
Hill;ns=0/Main/;language:wiki=commons,locale=en]
It's seem ok right?
Do you know if I just have to wait for the extraction to finish to see
the extracted files? Because I need to storage files in VirtuosoDB
Thank you for your help
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-developers mailing list
DBpedia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers
--
All the best,
Sebastian Hellmann
Director of Knowledge Integration and Linked Data Technologies (KILT)
Competence Center
at the Institute for Applied Informatics (InfAI) at Leipzig University
Executive Director of the DBpedia Association
Projects: http://dbpedia.org, http://nlp2rdf.org,
http://linguistics.okfn.org, https://www.w3.org/community/ld4lt
<http://www.w3.org/community/ld4lt>
Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-developers mailing list
DBpedia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers