Re: Resource offers - DRF - Mesos

2018-05-22 Thread Thodoris Zois
been met. > >> On Tue, May 22, 2018 at 2:56 AM, Thodoris Zois wrote: >> Hello list, >> >> I have some questions about resource offers for Mesos and I am >> experiencing some problems that I hope somebody will be able to help. >> >> 1) The allocation mod

Re: Resource offers - DRF - Mesos

2018-05-22 Thread Joseph Wu
o the offers (i.e. not accepting nor declining them) until your necessary conditions have been met. On Tue, May 22, 2018 at 2:56 AM, Thodoris Zois wrote: > Hello list, > > I have some questions about resource offers for Mesos and I am > experiencing some problems that I hope somebod

Resource offers - DRF - Mesos

2018-05-22 Thread Thodoris Zois
Hello list, I have some questions about resource offers for Mesos and I am experiencing some problems that I hope somebody will be able to help. 1) The allocation module of Mesos master uses DRF (according to previous allocation history) and decides which framework will get an offer, and how

Re: resource offers after task failure

2017-02-26 Thread Benjamin Mahler
t to forward the status immediately and as a result there can be intermediate offers for resources on the agent before the reserved resources are re-offered. Further more it looks like I also sometimes receive a resource offers > request that only lists my reserved resources but no resourc

resource offers after task failure

2017-02-24 Thread Hendrik Haddorp
Hi, I'm using Mesos 0.28.2 and my own framework, which uses dynamic reservations. A task failed and shortly after that I received a resource offers call that did not contain my reserved resources. I had expected that the request would already contain my reserved resources that are no

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Eli Jordan
running on the >>> same node as the master. Things gets slightly more tricky if your scheduler >>> is running in a docker container. >>> >>> regards, >>> Hendrik >>> >>>> On 27.09.2016 14:34, Eli Jordan wrote: >>>> Y

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Hendrik Haddorp
ordan wrote: Yes, it appears in the mesos ui, and stays there. I log all messages from the mesos master, including resource offers and disconnected. I don't receive offers or disconnected. I know I need to accept or decline the offers, the problem is that I never receive the resource offe

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Gmail
ainer. > > regards, > Hendrik > >> On 27.09.2016 14:34, Eli Jordan wrote: >> Yes, it appears in the mesos ui, and stays there. I log all messages from >> the mesos master, including resource offers and disconnected. I don't >> receive offers or disconnecte

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Hendrik Haddorp
on the same node as the master. Things gets slightly more tricky if your scheduler is running in a docker container. regards, Hendrik On 27.09.2016 14:34, Eli Jordan wrote: Yes, it appears in the mesos ui, and stays there. I log all messages from the mesos master, including resource offers and

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Eli Jordan
Yes, it appears in the mesos ui, and stays there. I log all messages from the mesos master, including resource offers and disconnected. I don't receive offers or disconnected. I know I need to accept or decline the offers, the problem is that I never receive the resource offer, but the m

Re: Framework is registered, but never receives resource offers

2016-09-27 Thread Olivier Sallou
On 09/27/2016 02:08 PM, Gmail wrote: > Hi > > I am implementing a mesos framework, and have hit a strange issue that I > can't make sense of. Intermittently, my framework will receive the registered > message, and is shown as registered in the mesos ui. > > I never see any resource offer messa

Framework is registered, but never receives resource offers

2016-09-27 Thread Gmail
Hi I am implementing a mesos framework, and have hit a strange issue that I can't make sense of. Intermittently, my framework will receive the registered message, and is shown as registered in the mesos ui. I never see any resource offer messages being processed by the framework, however, the

Re: resource offers

2016-09-26 Thread Hendrik Haddorp
<mailto:hendrik.hadd...@gmx.net> > <mailto:hendrik.hadd...@gmx.net > <mailto:hendrik.hadd...@gmx.net>>>> wrote: > > Hi, > > I have three Mesos cl

Re: resource offers

2016-09-26 Thread Guangya Liu
cline offer? If you are enabling GLOG_v=2 for >>> mesos master, you will get some log as "Framework xxx filtered >>> agent for " >>> >>> On Mon, Sep 26, 2016 at 2:47 PM, Hendrik Haddorp >>> mailto:hendrik.hadd...@gmx.net

Re: resource offers

2016-09-26 Thread Hendrik Haddorp
hadd...@gmx.net> <mailto:hendrik.hadd...@gmx.net <mailto:hendrik.hadd...@gmx.net>>> wrote: Hi, I have three Mesos cluster test setups. On two my frameworks gets the resource offers from all slaves in one "resourceOffe

Re: resource offers

2016-09-26 Thread Hendrik Haddorp
, I have three Mesos cluster test setups. On two my frameworks gets the resource offers from all slaves in one "resourceOffers" call. In one three node setup I do however sometimes get offers for all slaves but most of t

Re: resource offers

2016-09-26 Thread Guangya Liu
ered agent for " >> >> On Mon, Sep 26, 2016 at 2:47 PM, Hendrik Haddorp > <mailto:hendrik.hadd...@gmx.net>> wrote: >> >> Hi, >> >> I have three Mesos cluster test setups. On two my frameworks gets >> the resource offers from all

Re: resource offers

2016-09-26 Thread Hendrik Haddorp
rks gets the resource offers from all slaves in one "resourceOffers" call. In one three node setup I do however sometimes get offers for all slaves but most of the time I get first two offers and then the third in a separate call. Should I get all offers in one call or do I

Re: resource offers

2016-09-26 Thread Guangya Liu
k xxx filtered agent for " On Mon, Sep 26, 2016 at 2:47 PM, Hendrik Haddorp wrote: > Hi, > > I have three Mesos cluster test setups. On two my frameworks gets the > resource offers from all slaves in one "resourceOffers" call. In one three > node setup I do however some

resource offers

2016-09-25 Thread Hendrik Haddorp
Hi, I have three Mesos cluster test setups. On two my frameworks gets the resource offers from all slaves in one "resourceOffers" call. In one three node setup I do however sometimes get offers for all slaves but most of the time I get first two offers and then the third in a sepa

Re: Reconnected slaves not sending resource offers?

2016-04-25 Thread Thomas Petr
Ah, thanks for the clarification. I can't find any logs from the framework indicating that we got the initial offer, so it looks like it could have been dropped. We haven't set --offer-timeout on our masters, so your explanation makes sense. Thanks! On Mon, Apr 25, 2016 at 4:17 PM, Vinod Kone wro

Re: Reconnected slaves not sending resource offers?

2016-04-25 Thread Vinod Kone
> I0421 21:03:32.014999 17071 master.cpp:4290] Sending 1 offers to > framework sy3x4 (sy3x4) at > scheduler-6bb2bcf0-d060-4072-a25b-917d8007fb1c@172.16.13.243:56861 > This shows that the slaves resources were sent to a framework. Looks like the framework is holding on to the offer for a long time?

Re: Reconnected slaves not sending resource offers?

2016-04-25 Thread Thomas Petr
I0421 21:03:32.014533 17073 hierarchical.hpp:528] Added slave 20151116-203437-35000492-5050-17068-S70 (lively-rice) with mem(*):217609; cpus(*):210; ports(*):[2048-3048]; disk(*):639829 (allocated: ) I0421 21:03:32.014529 17072 master.cpp:3395] Registered slave 20151116-203437-35000492-5050-17068-S

Re: Reconnected slaves not sending resource offers?

2016-04-25 Thread Vinod Kone
On Mon, Apr 25, 2016 at 8:40 AM, Thomas Petr wrote: > The only thing that ended up fixing the situation was bouncing our > scheduler (~10 minutes after the restarted slaves joined the cluster) -- > the act of failing over the framework appeared to "recover" the missing > resources: > What do the

Reconnected slaves not sending resource offers?

2016-04-25 Thread Thomas Petr
03437-35000492-5050-17068-S70 I0421 21:03:32.016317 53215 status_update_manager.cpp:183] Resuming sending status updates Everything makes perfect sense up to this point. The slaves appear to be online and connected to the cluster, but we quickly noticed that these slaves were not sending resource of

Re: Port Resource Offers

2016-03-29 Thread Pradeep Chhetri
eview :). >>> >>> >>> Da (Klaus), Ma (马达) | PMP® | Advisory Software Engineer >>> Platform OpenSource Technology, STG, IBM GCG >>> +86-10-8245 4084 | klaus1982...@gmail.com | http://k82.me >>> >>> >>>

Re: Port Resource Offers

2016-03-29 Thread Erik Weathers
:). >> >> >> Da (Klaus), Ma (马达) | PMP® | Advisory Software Engineer >> Platform OpenSource Technology, STG, IBM GCG >> +86-10-8245 4084 | klaus1982...@gmail.com | http://k82.me >> >> >> -- >> Date: Tue, 29 Mar 201

Re: Port Resource Offers

2016-03-29 Thread Pradeep Chhetri
gt; > > Da (Klaus), Ma (马达) | PMP® | Advisory Software Engineer > Platform OpenSource Technology, STG, IBM GCG > +86-10-8245 4084 | klaus1982...@gmail.com | http://k82.me > > > -- > Date: Tue, 29 Mar 2016 10:51:44 +0100 > Subject: P

RE: Port Resource Offers

2016-03-29 Thread Klaus Ma
+86-10-8245 4084 | klaus1982...@gmail.com | http://k82.me Date: Tue, 29 Mar 2016 10:51:44 +0100 Subject: Port Resource Offers From: pradeep.chhetr...@gmail.com To: user@mesos.apache.org Hello, I am running mesos slaves with the modified port announcement. $ cat /etc/mesos-slave/resourcesports

Port Resource Offers

2016-03-29 Thread Pradeep Chhetri
Hello, I am running mesos slaves with the modified port announcement. $ cat /etc/mesos-slave/resources ports(*):[6379, 9200, 9300, 27017, 31000-35000] I can that this is being picked up when starting the mesos slaves in ps output: --resources=ports(*):[6379, 9200, 9300, 27017, 31000-35000] How

Re: Not getting resource offers for 20 min

2015-08-25 Thread Hans van den Bogert
Wanted to add that, even if there wasn’t a preview package, you can clone from GIT, and checkout a tag, where in this case v1.5.0-rc1 is tagged. Then proceeded normally as you would’ve had a source distro as described in the already mentioned http://spark.apache.org/docs/latest/building-spark.ht

Re: Not getting resource offers for 20 min

2015-08-25 Thread CCAAT
THANKS, as I have not kept up on the spark lists James On 08/25/2015 04:28 AM, Iulian Dragoș wrote: On Mon, Aug 24, 2015 at 7:16 PM, CCAAT mailto:cc...@tampabay.rr.com>> wrote: On 08/24/2015 05:33 AM, Iulian Dragoș wrote: Hello Iulian, Ok, so I eventually build spark from

Re: Not getting resource offers for 20 min

2015-08-25 Thread Iulian Dragoș
On Mon, Aug 24, 2015 at 7:16 PM, CCAAT wrote: > On 08/24/2015 05:33 AM, Iulian Dragoș wrote: > > > Hello Iulian, > > Ok, so I eventually build spark from 100% sources, after some intermediate > builds on gentoo. Gentoo is not the best platform for Java development, > but those issues related to

Re: Not getting resource offers for 20 min

2015-08-24 Thread CCAAT
On 08/24/2015 05:33 AM, Iulian Dragoș wrote: Hello Iulian, Ok, so I eventually build spark from 100% sources, after some intermediate builds on gentoo. Gentoo is not the best platform for Java development, but those issues related to spark builds are slowly being fixed on gentoo. Where (ho

Re: Not getting resource offers for 20 min

2015-08-19 Thread Iulian Dragoș
chance of getting the same amount of > memory, but as their dominant resource is lower (memory) they more often > get CPU resources compared to that first instance. Counter intuitively, the > first instance finishes last. > > On 19 Aug 2015, at 14:07, Iulian Dragoș > wrote: > &

Re: Not getting resource offers for 20 min

2015-08-19 Thread Hans van den Bogert
t getting any resource offers for > 15-20 minutes, while other frameworks (8-9 of them) continuously get offers. > > The framework is Spark (running in fine-grained mode), and is launched with > Chronos. After a few tasks successfully executed, it stops getting offers, > though lo

Not getting resource offers for 20 min

2015-08-19 Thread Iulian Dragoș
I am facing a problem with a framework not getting any resource offers for 15-20 minutes, while other frameworks (8-9 of them) continuously get offers. The framework is Spark (running in fine-grained mode), and is launched with Chronos. After a few tasks successfully executed, it stops getting

Re: Setting Rate of Resource Offers

2015-06-19 Thread Christopher Ketchum
rs, it won't be re-offered the >>>> same resources for some period of time. >>>> >>>> On Sat, Jun 13, 2015 at 8:30 PM, Ondrej Smola >>>> wrote: >>>> Hi Christopher, >>>> >>>> i dont know about any way way how

Re: Setting Rate of Resource Offers

2015-06-18 Thread Alex Rukletsov
mola >>>> wrote: >>>> >>>>> Hi Christopher, >>>>> >>>>> i dont know about any way way how to speed up first resource offer - >>>>> in my experience new offers arrive almost immediately after framework >>>&

Re: Setting Rate of Resource Offers

2015-06-17 Thread Christopher Ketchum
registration. It depends on the infrastructure you are testing your >>> framework on - are there any >>> other frameworks running? As is discussed in an another thread offers >>> should be send to multiple frameworks at once. There may be small >>> delay based on i

Re: Setting Rate of Resource Offers

2015-06-17 Thread Vinod Kone
ng your >>>> framework on - are there any >>>> other frameworks running? As is discussed in an another thread offers >>>> should be send to multiple frameworks at once. There may be small >>>> delay based on initial registration and network delay. If

Re: Setting Rate of Resource Offers

2015-06-17 Thread Christopher Ketchum
multiple frameworks at once. There may be small >> delay based on initial registration and network delay. If you speak >> about "reoffers" - reoffering >> decline offers - there should param to set interval for reoffer. For >> example in Go you can decline offer

Re: Setting Rate of Resource Offers

2015-06-17 Thread Alexander Gallego
rs >>>> should be send to multiple frameworks at once. There may be small >>>> delay based on initial registration and network delay. If you speak >>>> about "reoffers" - reoffering >>>> decline offers - there should param to set interval for

Re: Setting Rate of Resource Offers

2015-06-17 Thread Vinod Kone
ld param to set interval for reoffer. For >>> example in Go you can decline offer this way (it is also important to >>> decline every non used offer): >>> >>> driver.DeclineOffer(offer.Id, &mesos.Filters{RefuseSeconds: >>> proto.Float64(5)}) >>&g

Re: Setting Rate of Resource Offers

2015-06-17 Thread Christopher Ketchum
p;mesos.Filters{RefuseSeconds: proto.Float64(5)}) > > Look to mesos UI - it shoud give you information abou what offers are > offered to which frameworks, mesos master logs also give you this > information. > > > 2015-06-13 18:23 GMT+02:00 Christopher Ketchum <mailto:cketc...

Re: Setting Rate of Resource Offers

2015-06-14 Thread Alex Gaudio
UI - it shoud give you information abou what offers are >> offered to which frameworks, mesos master logs also give you this >> information. >> >> >> 2015-06-13 18:23 GMT+02:00 Christopher Ketchum : >> > Hi, >> > >> > I was wondering if there

Re: Setting Rate of Resource Offers

2015-06-14 Thread Alex Rukletsov
d give you information abou what offers are > offered to which frameworks, mesos master logs also give you this > information. > > > 2015-06-13 18:23 GMT+02:00 Christopher Ketchum : > > Hi, > > > > I was wondering if there was any way to adjust the rate of resource &g

Re: Setting Rate of Resource Offers

2015-06-13 Thread Ondrej Smola
ion. 2015-06-13 18:23 GMT+02:00 Christopher Ketchum : > Hi, > > I was wondering if there was any way to adjust the rate of resource offers to > the framework. I am writing a mesos framework, and when I am testing it I am > noticing a slight pause were the framework seems to be wa

Setting Rate of Resource Offers

2015-06-13 Thread Christopher Ketchum
Hi, I was wondering if there was any way to adjust the rate of resource offers to the framework. I am writing a mesos framework, and when I am testing it I am noticing a slight pause were the framework seems to be waiting for another resource offer. I would like to know if there is any way to

Re: implementing data locality via mesos resource offers

2015-01-20 Thread Adam Bordelon
flags when you launch a mesos slave. And when this slave's resources is >>>> being offered, it will also include all the attributes you've tagged. >>>> >>>> This currently is static information on launch, and I believe there is >>>> JIRA tic

Re: implementing data locality via mesos resource offers

2015-01-16 Thread Sharma Podila
static information on launch, and I believe there is >>> JIRA tickets to make this dynamic (updatable at runtime). >>> >>> Tim >>> >>> On Thu, Jan 15, 2015 at 7:23 PM, Douglas Voet >>> wrote: >>> >>>> Hello, >>>>

Re: implementing data locality via mesos resource offers

2015-01-16 Thread Douglas Voet
lude all the attributes you've tagged. >>> >>> This currently is static information on launch, and I believe there is >>> JIRA tickets to make this dynamic (updatable at runtime). >>> >>> Tim >>> >>> On Thu, Jan 15, 2015 at 7:23 PM, D

Re: implementing data locality via mesos resource offers

2015-01-16 Thread Tim Chen
t; Tim >> >> On Thu, Jan 15, 2015 at 7:23 PM, Douglas Voet >> wrote: >> >>> Hello, >>> >>> I am evaluating mesos in the context of running analyses of many large >>> files. I only want to download a file to a small subset of my nodes and >&

Re: implementing data locality via mesos resource offers

2015-01-16 Thread Sharma Podila
file to a small subset of my nodes and >> route the related processing there. The mesos paper talks about using >> resource offers as a mechanism to achieve data locality but I can't find >> any reference to how one might do this in the documentation. How would a >> meso

Re: implementing data locality via mesos resource offers

2015-01-16 Thread Tim Chen
nd > route the related processing there. The mesos paper talks about using > resource offers as a mechanism to achieve data locality but I can't find > any reference to how one might do this in the documentation. How would a > mesos slave know what data is available keeping in mind that t

implementing data locality via mesos resource offers

2015-01-15 Thread Douglas Voet
Hello, I am evaluating mesos in the context of running analyses of many large files. I only want to download a file to a small subset of my nodes and route the related processing there. The mesos paper talks about using resource offers as a mechanism to achieve data locality but I can't fin

Re: A problem with resource offers

2014-11-07 Thread Sharma Podila
ded >> framework 20141106-193147-16842879-5050-10406- >> I1106 19:32:29.647886 10423 http.cpp:391] HTTP request for >> '/master/state.json' >> >> >> On Thu, Nov 6, 2014 at 6:53 PM, Benjamin Mahler < >> benjamin.mah...@gmail.com>

Re: A problem with resource offers

2014-11-07 Thread Adam Bordelon
h version of the master are you using and do you have the logs? The >> fact that no offers were coming back sounds like a bug! >> >> As for using O1 after a disconnection, all offers are invalid once a >> disconnection occurs. The scheduler driver does not automatically rescind

Re: A problem with resource offers

2014-11-06 Thread Timothy Chen
ing back sounds like a bug! >> >> As for using O1 after a disconnection, all offers are invalid once a >> disconnection occurs. The scheduler driver does not automatically rescind >> offers upon disconnection, so I'd recommend clearing all cached offers when >&

Re: A problem with resource offers

2014-11-06 Thread Sharma Podila
ST > updates. > > On Thu, Nov 6, 2014 at 6:25 PM, Sharma Podila wrote: > >> We had an interesting problem with resource offers today and I would like >> to confirm this problem and request an enhancement. Here's the summary in >> the right sequence of events: >&g

Re: A problem with resource offers

2014-11-06 Thread Benjamin Mahler
, so I'd recommend clearing all cached offers when your scheduler gets disconnected, to avoid the unnecessary TASK_LOST updates. On Thu, Nov 6, 2014 at 6:25 PM, Sharma Podila wrote: > We had an interesting problem with resource offers today and I would like > to confirm this problem and

A problem with resource offers

2014-11-06 Thread Sharma Podila
We had an interesting problem with resource offers today and I would like to confirm this problem and request an enhancement. Here's the summary in the right sequence of events: 1. resource offer O1 for slave A arrives 2. mesos disconnects 3. mesos reregisters 4. mesos offer O2 for sl

Re: Question on resource offers and framework failover

2014-05-16 Thread Sharma Podila
> > I'm not sure these two cases are any different. The TASK_INVALID_OFFER > would model a terminal state for the task. Afterwards, one still has to > generate a new "TaskInfo" in so far as the TaskID should not be re-used > across launch requests. I was expecting to reuse the TaskID. If it can't

Re: Question on resource offers and framework failover

2014-05-16 Thread Sharma Podila
> > > (1) If the slave is unknown, we send TASK_LOST. > (2) If the task is missing on the slave, we send TASK_LOST. > (3) If the task state differs, we send the latest state. > In the absence of bugs or data loss, (1) is the only one that is strictly > necessary for correctness. In your case,

Re: Question on resource offers and framework failover

2014-05-16 Thread Benjamin Mahler
Thanks for providing more details! I'm not sure these two cases are any different. The TASK_INVALID_OFFER would model a terminal state for the task. Afterwards, one still has to generate a new "TaskInfo" in so far as the TaskID should not be re-used across launch requests. *For example, what if r

Re: Question on resource offers and framework failover

2014-05-16 Thread Benjamin Mahler
> > Where as, a TASK_LOST will make me (unnecessarily, in this case) try to > ensure that the task is actually lost, not running away on the slave that > got disconnected from Mesos master. Not all environments may need the > distinction, but at least some do. To be clear, are you still planning

Re: Question on resource offers and framework failover

2014-05-15 Thread Sharma Podila
TASK_LOST is a good thing. I expect to deal with it now and in the future. I was trying to distinguish this: - case TASK_LOST: - persist state update to TASK_LOST - create new task submission request - schedule with next available offer - case TASK_INVALID_OFFER: - pe

Re: Question on resource offers and framework failover

2014-05-13 Thread Sharma Podila
​Thanks for confirming that, Adam. ​ > , but it would be a good Mesos FAQ topic. I was thinking it might be good to also add to doc in code, either in mesos.proto or MesosSchedulerDriver (mesos.proto already refers to the latter for failover at FrameworkID definition). If you were to try to pers

Re: Question on resource offers and framework failover

2014-05-13 Thread Adam Bordelon
Correct, Sharma. I don't think this is documented anywhere yet, but it would be a good Mesos FAQ topic. When the master notices that the framework has exited or is deactivated, it disables the framework in the allocator so no new offers will be made to that framework, and removes any outstanding of

Question on resource offers and framework failover

2014-05-12 Thread Sharma Podila
My understanding is that when a framework fails over (either new instance starts after previous one fails, or the same instance restarts), Mesos master would automatically cancel any unused offers it had given to the previous framework instance. This is a good thing. Can someone confirm this to be