Hello guys,
Today I was trying to use the extraction framework to extract data for the
Arabic language. When it comes to finding the file in the download
directory (dump file), it didn't work, so after a while I figured that a
part of code from the file Import.scala is written as follow :
try {
for (language <- languages) {
val finder = new Finder[File](baseDir, language, "wiki")
val tagFile = if (requireComplete) Download.Complete else "*
pages-articles.xml*"
val date = finder.dates(tagFile).last
val file = finder.file(date, "*pages-articles.xml*")
I tried to change the name to *"pages-articales.xml.bz2"* and the
extraction successfully passed this point.
My point is, don't you think that we should make the changes I mentioned
above ? Because when we download the dump file, it comes with *".bz2"* in
the name.
Best regards,
Ahmed.
--
*------------------------------------------------
**Ahmed Ktob
Dr. Taher Moulay* *University * *
Department of Computer Science*
*Saida , Algeria*
*Tel : +213 554 811 151**
------------------------------------------------*
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion