Hello yall!

This is half-topic but I think everyone will benefit from the results.

Here at my company we have 18 Gentoo servers spread over 3 data centers and
our office. We have well defined and time-proven (one of the servers was
installed in 2004 and it's still the same Gentoo) processes on monitoring,
backup, applying security fixes, and maintenance. We have almost 100% of
high availability and some services even have high availability across
different data centers in Florida and California. We are using catalyst with
cfengine to save us a few hours of work every week and everything is working
great.

I would like to move forward to some other projects but most of the
knowledge (60%) required to do everything resides on my head alone. I'm hit
by a car in the streets and something might go bad, like the required
monthly database partition maintenance.

I would like to hear from the list what you are using for infrastructure,
software, processes, hardware documentation. I think I need a system with a
good user access control, an all-in-one solution to document everything.
Using Wiki+UML would solve the issue (I have 30% already documented in
wikis) but (a) none of them were designed for this specific task and (b)
they don't integrate, people would have to use two systems that knows
nothing about each other.

What are you gurus doing to be replaceable?

Thank you very much in advance!

Best regards,
Daniel Colchete

Reply via email to