Hi,
   I am loading dbpedia version 3.8 into Virtuoso 6.1. I unzipped the files 
into .gz format and ran ld_dir method in isql mode to load all files in a 
folder (whole dump). Then I am running rdf_loader_run() to process the files. I 
have two questions,

  1. the server is having 32 GB of ram and 8 cores. But still it has taken 2 
days to load half the files. Why is it taking this much time? is it normal?


  2. When I check the status using SELECT * FROM DB.DBA.LOAD_LIST; one file has 
a start timestamp but no end timestamp. Following is the relevant row for that 
file in the output,

/home/kalpa/Virtuoso/data/datasets/dbpedia/3.8/3.8/en/infobox_properties_en.nt.gz
  http://dbpedia.org                                                            
    1           2012.11.17 2:12.5 0  NULL                 0           NULL      
  NULL

   If I re-load that file only after the other files are processed, will it add 
duplicate tuples or I have to remove that file and then load that file only? If 
so, what is the process to do that?
If I commit after processing all files are finished (I assume I have to commit 
using command commit WORK; to make changes permanent) and only load that 
infobox_properties_en.nt.gz file only again, is it fine? Please let me know 
some details about these two problems and make the local dbpedia server working 
without any errors. Thank you.

regards,
kalpa


------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
  • [Dbpedia-discussion]... Gunaratna, Dalkandura Arachchige Kalpa Shashika Silva

Reply via email to