First off, thanks to writers of this great little description of how to do garbage collection and Fabian for pointing it out. http://wiki.apache.org/jackrabbit/DataStore#Data_Store_Garbage_Collection
My next question concerns running garbage collection in a cluster. If had a number of identical nodes running in a cluster, each of them periodically running a garbage collection task, where the periods may overlap... say nodes 1 starts and then in the middle of either the mark or the sweep, node 2 starts it's mark or perhaps even overlaps it's sweep.... what will the consequences be? Will they "collide", i.e. will their be unexpected errors (explicit exception based errors) or mis-behaviors (implicit non-identified errors)? Of course, the alternative is to guarantee that only one node in the cluster is responsible for the periodic mark and sweep. Thanks in advance for any pointers or insights. This community has been GREAT at responding to questions with very helpful solutions and bug fixes. -- Langley
