Re: [Alchemi-users] Re: [Alchemi-developers] Grid-based Malware

Krishna Sat, 25 Feb 2006 19:04:23 -0800

Hi John,

Yes, this is getting quite long - but hopefully useful.


So, regarding (1) below:

We are currently using a custom logger class which basically closelyechoes the log4Net logger, as in there are logger.debug, warn, infomethods... All it does is fire up an event so that the eventhandler candeal with the actual logging. So, in the executor for example, whenrunning in service-mode or normal mode, the actual service-code /application GUI code handles this log event and does somethingappropriate. In both cases, we just use the log4Net library to write themessage to a log file. Now, we could have simply used the log4Netlibrary inside the Alchemi core dll as well. However I though it isbetter to have this extra abstraction, so that if we decide to use someother logging library, it would be easy to change...just need to changeit at one place..in the GUI app / service-app, instead of all over theplace in the core library. Also, this means the Alchemi core library isactually independent of any external logging library. So John, any helpwould be useful :)


At the risk of repeating myself:

The Gridbus broker, is an interesting project. In fact, part of it islike the Alchemi manager, but a bit more advanced. It has 3 or 4different scheduling algorithms, which are the results of our researchat GRIDS lab, at Melbourne University. It includes an economy-basedscheduler, which I mentioned a while back...and also a data-awarescheduler which tries to optimise data transfer by choosing acompute-server (or executor) close to a datahost (which hosts the dataneeded by a grid app), based on their network proximity among otherthings. (Of course, this data scheduler is aimed at grid apps that needdata in the order of gigabytes....). For more info (just in case someonehas missed it) you may want to check out: http://www.gridbus.org/brokerWe are thinking of borrowing some ideas from the broker to incorporateinto new schedulers in Alchemi's manager.(And yes, both the broker and alchemi are actually part of my day job;). (And John, congrats and all the best with MS...:)


Regarding (2):

Yes, I think if a Manager goes down, when a GThread is executing on anexecutor, then the Executor does not know what to do with the GThread.In fact, I am not really sure what would happen. I guess it would justthrow up an exception, and perhaps even bring down the entire Executorapplication. (hmm...need to check the code to try and guess what couldhappen...havent really tested that one....)Also, just wanted to clarify that, an executor can only be connected toone manager at any point of time.And I am not really sure the term "executor" in the idea you mentionedbelow, has the same meaning as the executor in Alchemi at present. Butyes, we could have that seperate layer, to handle appdomain life times.


Cheers
Krishna.

Wow, this thread is starting to get long but I think a lot of gooddetails are coming to light and are being recorded for posterity inthe mailing list. Now if we could just compile it into on resource. :D
If I put my comments inline they will be hard to read so I will try toquery/respond by providing section number.
1.) Krishna I think that I may be able to help with this. I have raninto similiar issues with logging in secondary appdomains. I don'thave the code setting in front of me so excuse my ignorance but whatare you using for logging currently? No sweat about not having timefor Alchemi, we all have day jobs and understand. I'm getting readyto start at Microsoft out in Redmond at the end of March so I will befairly busy the next couple of months. I'll have to check out thework you are doing with Grid Broker, sounds interesting.
2.) So Krishna what is the behavior if we have a Manager that goesbelly up or communications with the net is severed. Does all workernodes of that manager leave the appdomain hanging until the Executoris shutdown? If an executor is connected to several managers andtheir GApplication that is being run on that worker node on theManagers behalf is larger which may very well happen with the type ofapplications that are suitable for grid enablement this could become aprominent issue. What I was proposing is as follows. TheServiceManager/ExecutorController is what a Manager communicates withon a worker node.. This Controller will fire up and manage appdomainsbased upon a number of Managers and then start an executor in thatappdomain. These 'executors' objects are based upon MarshalByRefobjects that have configurable lease lifetime on them. The Controlleracts as a bridge between the 'Manager' and the 'Executor' routing allcalls to the 'Executor'. Everytime communications happen between thetwo the lease lifetime is extended. If a Manager drops off line orcommunications is cut for whatever reason the lease lifetime willexpire for that 'Executor' and the Controller will be notified by adelegate and then clean up the 'abandoned' appdomain. The can also beinitiated by the Manager when it is done executing is work. As I seeit it is just another layer of abstraction between the Manager andExecutor that allows for a little more robustness.
I think I helped Tibor out with the threading issues he was facing.Tibor, did that work for you?
3.) Krishna, I was thinking of a little longer caching lifetime fordlls. So lets say one day a Manager1 needs AppA to be executed on thegrid. You have to push the dlls of that AppA down to each workernode. You finish your work for that day, the manager notifies theworker nodes that it no longer needs their services and they clean upany executable payload that was pushed to them. Next day Manager2needs AppA to be executed on the grid. Follow the same exact steps asthe first day. Now shorten the time to 12 hours, 1 hour, 1 minute. Alot of redundant bytes could be flying around the grids topology. Ifwe could cache the Apps being pushed around on the worker nodes andhave a manager check to see if it is on a worker node before pushingit it would make it less network intensive. Basically all managerspush an app to the controller on a worker node if it doesn't alreadyreside there. Then when the 'Executor' is loading up the App for the'Manager' it is pulled from this central repository, which is ineffect mutliple folders, one per app, and copies them to a shadowdirectory which the 'Executors' appdomains path points to. This alowsfor multiple versions of the same app to be run side by side indifferent app domains.
4.) Security. Krishna I agree with everything you said. You wouldwant exactly that level of control of security.
5.) Krishna, if I am reading between the lines correctly it wouldalmost seem that you are talking about some sort of P2P overlaytopology for the grid. If this correct I think that it is a fantasticidea. I have been involved in a couple of P2P apps and I would beglad to lend a hand with implementation for Alchemi. This would allowfor clustering of resources. It would also allow for pushing throughfirewalls and all manner of network nastiness that can happen. But,like I alluded to above, I'll be busy until probably mid-May gettingsettled in with my new employer. After that I would be happy tocontribute to the project.
Have a great day,
John




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
alchemi-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/alchemi-users

Re: [Alchemi-users] Re: [Alchemi-developers] Grid-based Malware

Reply via email to