Hi everyone,

We are working on designing a build cluster for the OpenOffice
BuildBot. Our goal is to have a farm of machines that community
contributors can use to quickly do (distributed) builds of OpenOffice.

I've heard that Sun Release Engineering has a build cluster in use.
We would love to know more about the cluster to help us design the
BuildBot one.

So here is a bunch of questions:
- How many machines are there in the cluster?
- What hardware/OS are they running?
- How does the network infrastucture work? What is the design and capacity?
- How is the shared disk space handled? What type of server/software
are you using?
- How do you monitor the cluster? What loads (disk, CPU, network) are
you measuring? How do you measure them?
- What is the bottleneck? Is the cluster CPU, disk, RAM or network bound?
- How does the task distribution software work?
- How does the cluster handle nodes dying during a build?
- How many nodes can the build be paralellized on? 50? 100?
- What is the utilization of the cluster? Ie. How much paralellism are
you able to extract from the build? 75% 80%? How does the paralellism
scale, what is the optimum number of machines for a build.

Is there anything I'm not asking about that I should?

Thanks for the answers and helping out with this!

Take care,

Kai

--
Kai Backman, Software Engineer, [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to