[Wikidata-tech] lagging runUpdate.sh on wikidata stand-alone

Eric Scott Fri, 28 Oct 2016 07:06:49 -0700

Hi all -

We've been using a locally installed wikidata stand-alone service(https://www.mediawiki.org/wiki/Wikidata_query_service/User_Manual#Standalone_service)for several months now. Recently the service went down for a significantamount of time, and when we ran runUpdate.sh -n wdq, instead of catchingup to real time as it usually does, the update process lagged, failingeven to keep parity with real time.


Example output from the log:

09:30:39.805 [main] INFO org.wikidata.query.rdf.tool.Update - Polled upto 2016-10-24T23:01:05Z at (0.0, 0.0, 0.0) updates per second and(271.8, 56.2, 18.8) milliseconds per second

This is normal when starting the update of course, but the system neverseems to find its feet, and continues to stumble and lag. Restartingboth the blazegraph process and the update process has no lasting effect.


From time to time, a message like this will appear:

INFO org.wikidata.query.rdf.tool.RdfRepository - HTTP request failed:org.apache.http.NoHttpResponseException: wikidata.cb.ntent.com:9999failed to respond, retrying in 2175 ms.

I have experienced this effect in the past, and had success replacing anold journal which was the product of a long update process with a newjournal rebuilt from the latest dump. This strategy did not work. Itried rebuilding with the latest git pull from origin and rebuilding thejournal, again with no effect.

This problem started about 3 days ago, and we're now polling up to apoint in time 18 hours earlier than real time.


I would appreciate any guidance.

Also: is this an appropriate list to write to with such problems? Arethere more appropriate places?


Thanks,

Eric Scott


_______________________________________________
Wikidata-tech mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech

[Wikidata-tech] lagging runUpdate.sh on wikidata stand-alone

Reply via email to