We recently enacted a workflow to clean up old jobs to improve author instance performance. We are using TarMK for persistence, but our repo is on the order of 20GB in size. We have noticed we need to run OAK offline compaction jobs with AEM6.1 because the repository does appear to grow faster than under CQ5.6.1.
From: <Roll>, Kevin <[email protected]<mailto:[email protected]>> Reply-To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Date: Tuesday, December 15, 2015 at 10:27 To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Subject: Old jobs in repository Our application depends on a lot of lightweight jobs that run quickly. During field trials we have encountered several instances where the backing database for Sling (MS SQL Server) filled up to its maximum capacity of 4Gb. I have one of these machines in front of me and I've discovered that the repository is clogged up with old jobs under the /var/eventing/jobs node. The number of jobs is practically uncountable but is potentially in the millions given the number of rows in the DEFAULT_BUNDLE table. The majority of jobs are under the 'cancelled' node but there are also some under the 'unassigned' node. This raises many questions: * Why are CANCELLED jobs persisted in the repository? Shouldn't they just go away? * Why would unassigned jobs from many days ago be sitting there? Could anything block the job queue from processing them? * The jobs are stored under a node structure that uses the year/month/day/hour/etc. Does this ever get cleaned up? This node structure would continue to grow indefinitely, consuming more space. * Is there anything I can do to clean up all these old jobs without destroying the rest of my repository? I should also mention that despite the presence of all these nodes, the Sling Jobs status page shows zero for all counters. Thanks as always, everyone! ________________________________ This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.
