On Monday, 30 October 2023 13:42:12 AEDT Jade Ambrose wrote: > As for what we're planning technology wise, I'm genuinely not sure. Current > technologies used include terraform and ansible. I don't have any idea yet > for how we manage hardware lifecycle, or virtual machines (presuming we use > them). I have no plan yet for monitoring, and very little plan for backups. > So, lots of interesting problems to solve, and I think it'll be more > interesting to try and build a cohort of volunteers to learn about each > area and solve the problems (with guidance from mentors) than it will be > for me to have a solution ready.
For monitoring there are heavyweight monitoring systems like Nagios and derivatives that have a single console for all data. They are good for what they are but can trend towards the "dozens of things are red so let's not worry about it" scenario. For my servers I use etbemon (my fork of Trocki Mon) and have two main servers monitoring each other that monitor other servers and also both run Jabber for sending notifications on. So if one misbehaves then the other will report on it. As a general rule for monitoring you either need infrastructure that's significantly more reliable than what you are monitoring (telco networks are reliable but how you connect to them probably isn't) or multiple monitoring systems. -- My Main Blog http://etbe.coker.com.au/ My Documents Blog http://doc.coker.com.au/ _______________________________________________ luv-main mailing list -- [email protected] To unsubscribe send an email to [email protected]
