Hi,
I am loading dbpedia version 3.8 into Virtuoso 6.1. I unzipped the files
into .gz format and ran ld_dir method in isql mode to load all files in a
folder (whole dump). Then I am running rdf_loader_run() to process the files. I
have two questions,
1. the server is having 32 GB of ram and 8 cores. But still it has taken 2
days to load half the files. Why is it taking this much time? is it normal?
2. When I check the status using SELECT * FROM DB.DBA.LOAD_LIST; one file has
a start timestamp but no end timestamp. Following is the relevant row for that
file in the output,
/home/kalpa/Virtuoso/data/datasets/dbpedia/3.8/3.8/en/infobox_properties_en.nt.gz
http://dbpedia.org
1 2012.11.17 2:12.5 0 NULL 0 NULL
NULL
If I re-load that file only after the other files are processed, will it add
duplicate tuples or I have to remove that file and then load that file only? If
so, what is the process to do that?
If I commit after processing all files are finished (I assume I have to commit
using command commit WORK; to make changes permanent) and only load that
infobox_properties_en.nt.gz file only again, is it fine? Please let me know
some details about these two problems and make the local dbpedia server working
without any errors. Thank you.
regards,
kalpa
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion