Hi Richard,

We've been running DUCC in a cloud environment for almost a year now. The
DUCC master and a glusterfs servers run on bare metal and all of the
workstations and worker machines run on VMs. Cluster users add VMs to the
cluster as needed. A job can be started on one more workers and then
additional VMs dynamically added to which the job will automatically scale
out to use. A common system image is maintained on all VM machines via an
LDAP server and shared filesystem data. Users belong to groups and share
machines allocated by members of the group.

A DUCC VM-image is used to automatically connect new VMs to the DUCC master
and glusterfs. The DUCC master configuration may be updated anytime, for
example to add new groups or even update master software. VMs automatically
sync DUCC software and configuration each time they start their DUCC agent.
The VM image supports three different machine types: a graphical
workstation, a CPU worker and a GPU worker. DUCC spawns work on specified
worker machine types and even specific machines. Workstations are optional
as DUCC requests can be submitted from worker machines. Docker images are
supported using Podman. Podman runs rootless and only allows access to all
mounted file systems with user credentials.

In order to keep some level of data security, a group directory is only
mounted on the VMs created by members of the group. Individual users
maintain file permissions as desired, but, as anyone that creates a VM has
root access, they could become any other user and access data from other
group members.There is a self-service glusterfs webapp that is used to
export group data to new VMs and manage quotas.

The VM-image builder and glusterfs webapp are not yet part of Apache DUCC.
Not clear to me about running DUCC master and agent components in Docker
containers. Can Kubernetes master and agent components run this way?

Regards,
Eddie




On Wed, Mar 18, 2020 at 7:03 AM Richard Eckart de Castilho <[email protected]>
wrote:

> Hi all,
>
> does anybody have experience to share of running UIMA DUCC on a
> container-based and/or cloud-based infrastructure?
>
> I found a third-party project which helps setting up a DUCC cluster with
> docker:
>
>   https://github.com/aleksey-hariton/uima-ducc-docker
>
> Are there more relevant resources or experiences you can share?
>
> Is anybody running DUCC on AWS or Kubernetes or similar platforms?
>
> Does DUCC run there "out of the box" or is customization/plumbing
> required? If so, how much?
>
> Looking forward to your stories!
>
> -- Richard

Reply via email to