Re: [openstack-dev] [Fuel] fuel master monitoring

Przemyslaw Kaminski Thu, 06 Nov 2014 03:21:46 -0800

I didn't mean a robust monitoring system, just something simpler.Notifications is a good idea for FuelWeb.

P.


On 11/06/2014 09:59 AM, Anton Zemlyanov wrote:

We can add a notification to FuelWeb, no additional software or useractions are required. I would not overestimate this method though, itis in no way the robust monitoring system. Forcing user to dosomething on a regular basis is unlikely to work.


Anton

On Thu, Nov 6, 2014 at 11:55 AM, Przemyslaw Kaminski<pkamin...@mirantis.com <mailto:pkamin...@mirantis.com>> wrote:


    I think we're missing the point here. What I meant adding a simple
    monitoring system that informed the user via UI/CLI/email/whatever
    of low resources on fuel master node. That's it. HA here is not an
    option -- if, despite of warnings, the user still continues to use
    fuel and disk becomes full, it's the user's fault. By adding these
    warnings we have a way of saying "We told you so!" Without
    warnings we get bugs like [1] I mentioned in the first post.

    Of course user can check disk space by hand but since we do have a
    full-blown UI telling the user to periodically log in to the
    console and check disks by hand seems a bit of a burden.

    We can even implement such monitoring functionality as a Nailgun
    plugin -- installing it would be optional and at the same time we
    would grow our plugin ecosystem.

    P.


    On 11/05/2014 08:42 PM, Dmitry Borodaenko wrote:

    Even one additional hardware node required to host the Fuel
    master is seen by many users as excessive. Unless you can come up
    with an architecture that adds HA capability to Fuel without
    increasing its hardware footprint by 2 more nodes, it's just not
    worth it.

    The only operational aspect of the Fuel master node that you
    don't want to lose even for a short while is logging. You'd be
    better off redirecting OpenStack environments' logs to a
    dedicated highly available logging server (which, of course, you
    already have in your environment), and deal with Fuel master node
    failures by restoring it from backups.

    On Wed, Nov 5, 2014 at 8:26 AM, Anton Zemlyanov
    <azemlya...@mirantis.com <mailto:azemlya...@mirantis.com>> wrote:

        Monitoring of the Fuel master's disk space is the special
        case. I really wonder why Fuel master have no HA option, disk
        overflow can be predicted but many other failures cannot. HA
        is a solution of the 'single point of failure' problem.

        The current monitoring recommendations
        
(http://docs.openstack.org/openstack-ops/content/logging_monitoring.html)
        are based on analyzing logs and manual checks, that are
        rather reactive way of fixing problems. Zabbix is quite good
        for preventing failures that are predictable but for the
        abrupt problems Zabbix just reports them 'post mortem'.

        The only way to remove the single failure point is to
        implement redundancy/HA

        Anton

        On Tue, Nov 4, 2014 at 6:26 PM, Przemyslaw Kaminski
        <pkamin...@mirantis.com <mailto:pkamin...@mirantis.com>> wrote:

            Hello,

            In extension to my comment in this bug [1] I'd like to
            discuss the possibility of adding Fuel master node
            monitoring. As I wrote in the comment, when disk is full
            it might be already too late to perform any action since
            for example Nailgun could be down because DB shut itself
            down. So we should somehow warn the user that disk is
            running low (in the UI and fuel CLI on stderr for
            example) before it actually happens.

            For now the only meaningful value to monitor would be
            disk usage -- do you have other suggestions? If not then
            probably a simple API endpoint with statvfs calls would
            suffice. If you see other usages of this then maybe it
            would be better to have some daemon collecting the stats
            we want.

            If we opted for a daemon, then I'm aware that the user
            can optionally install Zabbix server although looking at
            blueprints in [2] I don't see anything about monitoring
            Fuel master itself -- is it possible to do? Though the
            installation of Zabbix though is not mandatory so it
            still doesn't completely solve the problem.

            [1] https://bugs.launchpad.net/fuel/+bug/1371757
            [2]
            https://blueprints.launchpad.net/fuel/+spec/monitoring-system

            Przemek

            _______________________________________________
            OpenStack-dev mailing list
            OpenStack-dev@lists.openstack.org
            <mailto:OpenStack-dev@lists.openstack.org>
            http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev



        _______________________________________________
        OpenStack-dev mailing list
        OpenStack-dev@lists.openstack.org
        <mailto:OpenStack-dev@lists.openstack.org>
        http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

--Dmitry Borodaenko



    _______________________________________________
    OpenStack-dev mailing list
    OpenStack-dev@lists.openstack.org  
<mailto:OpenStack-dev@lists.openstack.org>
    http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev



    _______________________________________________
    OpenStack-dev mailing list
    OpenStack-dev@lists.openstack.org
    <mailto:OpenStack-dev@lists.openstack.org>
    http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev




_______________________________________________
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

_______________________________________________
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Fuel] fuel master monitoring

Reply via email to