Jobs/tasks wait in framework schedulers, not mesos master. Autoscaling
triggers must come from schedulers, not only because that's who knows the
pending task set size, but, also because it knows how many of them need to
be launched right away, on what kind of machines.

We built such an autoscaling capability in our framework schedulers. The
autoscaling is achieved by our library Fenzo
<https://github.com/Netflix/Fenzo> which we open sourced recently. Also
read about Fenzo autoscaling here
<https://github.com/Netflix/Fenzo/wiki/Autoscaling>. You should look into
using that if you are developing your own scheduler. Or, have your
scheduler team pick up Fenzo for autoscaling.

Also, note that scaling up is temptingly easy by watching the pending task
queue. But, scaling down requires bin packing, etc. Other issues pop up as
well, for example:

- what if a user submits tasks that cannot be satisfied? Will autoscale
keep increasing the cluster size unbounded?
- what if you would like to have a heterogeneous mix of hosts and tasks?
which kind of hosts do you need to autoscale based on which tasks are
pending?

These are automatically addressed in Fenzo.

Sharma


On Wed, Sep 23, 2015 at 4:56 AM, Aaron Carey <[email protected]> wrote:

> No, I basically had the same question as Jim (but maybe didn't word it so
> well ;))
>
> I'll have a look at your response there :)
>
> ------------------------------
> *From:* haosdent [[email protected]]
> *Sent:* 23 September 2015 10:12
> *To:* [email protected]
> *Subject:* Re: Metric for tasks queued/waiting?
>
> Does /metrics/snapshot not satisfy your requirement?
>
> On Wed, Sep 23, 2015 at 4:50 PM, Aaron Carey <[email protected]> wrote:
>
>> Hi all,
>>
>> Is there any way to get a metric of all tasks currently waiting/queued in
>> Mesos (across all schedulers)? The snapshot metrics seem to cover ever
>> other kind of task state? This would be quite useful for auto-scaling
>> purposes..
>>
>> Thanks,
>> Aaron
>>
>
>
>
> --
> Best Regards,
> Haosdent Huang
>

Reply via email to