Hi Paul,

If there are still rdf_loader_run() processes running then the "status();" 
would show this to be the case or whatever other process might be running, but 
if all files are in ll_state=2 in the load_list table then no rdf_loader_run() 
processes should be in use.

Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc.      //              http://www.openlinksw.com/
Weblog   -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter  -- http://twitter.com/OpenLink
Google+  -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers

On 22 Apr 2014, at 03:56, Paul Houle <ontolo...@gmail.com> wrote:

> Well I am watching my Virtuoso instance right now and even though all
> the rows have been in state 2 for at least an hour I am still
> observing a load average of 1.14 thanks to Virtuoso CPU and disk
> activity.  So something is still going on although the load is "done".
> I've seen this behavior on and off in the past and I think that one
> contribution factor could be having more than one rdf_loader_run()
> running.
> 
> On Mon, Apr 21, 2014 at 10:52 PM, Hugh Williams
> <hwilli...@openlinksw.com> wrote:
>> Hi Paul,
>> 
>> Using the Virtuoso RDF Bulk Loader (rdf_loader_run) as you appear to be, the
>> load is complete when all files in the load_list table have an "ll_state" =
>> 2  ,  0 means still to be loaded, 1 means load is in progress.
>> 
>> What does the "status()" command report when the load rate slows as I
>> suspect all the "buffers" (NumberOfBuffers in INI) are being consumed at
>> that point and will thus slow  load rates.
>> 
>> Note you can also monitor the load rates using the Virtuoso LDMeter
>> functions as detailed at:
>> 
>> http://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/VirtTipsAndTricksGuideLDMeterUtility
>> 
>> Best Regards
>> Hugh Williams
>> Professional Services
>> OpenLink Software, Inc.      //              http://www.openlinksw.com/
>> Weblog   -- http://www.openlinksw.com/blogs/
>> LinkedIn -- http://www.linkedin.com/company/openlink-software/
>> Twitter  -- http://twitter.com/OpenLink
>> Google+  -- http://plus.google.com/100570109519069333827/
>> Facebook -- http://www.facebook.com/OpenLinkSoftware
>> Universal Data Access, Integration, and Management Technology Providers
>> 
>> On 21 Apr 2014, at 23:37, Paul Houle <ontolo...@gmail.com> wrote:
>> 
>> The image below is a trace of the CPU consumption of a bulk load I did
>> in Virtuoso.
>> 
>> https://raw.githubusercontent.com/paulhoule/images/master/CpuForVirtuosoLoad.PNG
>> 
>> Here I am loading half a billion or so triples in Amazon EC2 using an
>> r3.xlarge,  with Virtuoso running on the instance storage SSD.  In
>> terms of time and cost,  this is a great chart,  but there is one
>> detail that I'm concerned about.
>> 
>> At first I queued up the 'a' dataset and started one rdf_loader_run.
>> That used around 50% of system resources on a four core machine,  so
>> when I started a second loader,  the rate sped up until it reached
>> 100%.
>> 
>> It falls to zero because it finished the 'a' data,  then I queued the
>> rest of the data and the system ran at 100%.  Then there is a glitch
>> at one point,  then the CPU consumption falls in two plateaus,  with
>> another glitch at the end.
>> 
>> Definitely in the second plateau was looking at the machine and
>> noticed that the  db.dba.load_list said the loading was done but that
>> the CPU was still running hard and,  more to the point,  I/O was still
>> being done to the SSD.  I don't see anything in the log that
>> correlates with the end of this phase.
>> 
>> What is Virtuoso doing in this phase after the loading is said to be
>> complete?  Would it be safe to shut Virtuoso down at this time?  I can
>> watch the load_list and the CPU meter to get a pretty good guess of
>> when it is done,  but is there some other way I know Virtuoso is done
>> with this activity?
>> 
>> 
>> 
>> --
>> Paul Houle
>> Expert on Freebase, DBpedia, Hadoop and RDF
>> (607) 539 6254    paul.houle on Skype   ontolo...@gmail.com
>> 
>> ------------------------------------------------------------------------------
>> Start Your Social Network Today - Download eXo Platform
>> Build your Enterprise Intranet with eXo Platform Software
>> Java Based Open Source Intranet - Social, Extensible, Cloud Ready
>> Get Started Now And Turn Your Intranet Into A Collaboration Platform
>> http://p.sf.net/sfu/ExoPlatform
>> _______________________________________________
>> Virtuoso-users mailing list
>> Virtuoso-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>> 
>> 
> 
> 
> 
> -- 
> Paul Houle
> Expert on Freebase, DBpedia, Hadoop and RDF
> (607) 539 6254    paul.houle on Skype   ontolo...@gmail.com
> 
> ------------------------------------------------------------------------------
> Start Your Social Network Today - Download eXo Platform
> Build your Enterprise Intranet with eXo Platform Software
> Java Based Open Source Intranet - Social, Extensible, Cloud Ready
> Get Started Now And Turn Your Intranet Into A Collaboration Platform
> http://p.sf.net/sfu/ExoPlatform
> _______________________________________________
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users

------------------------------------------------------------------------------
Start Your Social Network Today - Download eXo Platform
Build your Enterprise Intranet with eXo Platform Software
Java Based Open Source Intranet - Social, Extensible, Cloud Ready
Get Started Now And Turn Your Intranet Into A Collaboration Platform
http://p.sf.net/sfu/ExoPlatform
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users

Reply via email to