On Fri, 10 Jun 2011 08:56:57 -0400, Brad Chapman wrote:
Ketil, Christian and Antoine;

> sure, it should be no problem talking about this. I have, however, no
> experience with cloud-based computing -- I prefer local clusters,
> where we have greater control over what is going on.

I currently use local machinery as well, for practical and political
reasons. I do think cloud (that is infrastructure-as-a-service) would be
great for a lot of stuff, though.

Despite the name there isn't much cloud specific about
CloudBioLinux. It is a framework to install packages and will work
on local machines as well as on Amazon instances. What the cloud
provides is a lot of bare bone machines and need bootstrapping, and
a nice way to make the results of that bootstrapping and
installation available.

With the framework you can do:

fab -f fabfile.py -H localhost install_libraries:haskell

to install all of the defined Haskell packages on a local machine.


Actually, that would make this /much/ more useful, not being cloud-bound. I'll forward it to some of our guys as well.

> An idea for big Haskell installations is to proxy hackage. Almost
> everything comes from hackage... but feel free to talk about anything

All you need is a network connection, a working, recentish ghc, and a
working cabal-install - everything else is then available on-demand.

Perfect. I just re-did the Haskell part of the install to have a
working cabal, which had fell by the wayside at some point.
Hopefully the version that ships with Ubuntu is recent enough:

$ ghc --version
The Glorious Glasgow Haskell Compilation System, version 6.12.3


My comment towards a proxy hackage is mainly because of scaling. If you only compile on one machine and have static binaries -- which you typically have; you do not need it. But several dozen or hundred machines requiring hackage access would benefit. Though I guess this is not the case here; but see below for more comments.

> I regard both my and Ketils libraries as "too unstable" to pre-include > them into a ready-to-use package. "Unstable" here basically means that
> we change things too often; though Ketils is already nicely
> stable. But if a tool like cabal-dev is available, that is of no big
> concern anyway.

I do have a bunch of tools that are reasonably stable, and I run weekly
tests to ensure that they compile against current hackage.

"My" library is a little bit of everything - some stuff is used all over
the place, some has little use but extensive testing, some is just
experimental. One option - part of the "biocore" effort - could be to
factor out the stable and well tested stuff.

What I'd really like to get is just a set of the most useful
packages a Haskell bioinformatics/statistical programmer would want
around. For instance, for Python I'd be disappointed if I worked on
a system that claimed to have support for doing Python in
bioinformatics and it didn't have Numpy or Biopython. Here's my
initial start:


https://github.com/chapmanb/cloudbiolinux/blob/master/config/haskell-libs.yaml


I still propose to install the haskell-platform -- and only the Haskell platform. This is basically what I mean with stable, packages included in the platform are "stable" everything else is not. For "bio", Ketil has to say what he thinks, but "Biobase" should never ever be part of such a pre-installed system. I change something almost every day and break backward- and forward-compatibility one or twice per week.

My workflow is basically like this:

- have the Haskell platform installed (everywhere, not that important right now as haskell builds big static blobs) - on the development machine "cabal-dev" provides a localized build environment which uses the per-system haskell-platform but has per-project additional dependencies
- compiled binaries are started on the cluster

Having faster-changing libraries pre-installed would be /extremely/ problematic due to multiple versions clashing, etc...

But then again, a Haskell developer can be much more lenient with what he needs. A simple "cabal-dev install-deps" will grab everything I need for my current project with all dependencies correctly solved.

Thanks much for all the ideas,
Brad
_______________________________________________
Biohaskell mailing list
Biohaskell@biohaskell.org
http://malde.org/cgi-bin/mailman/listinfo/biohaskell

_______________________________________________
Biohaskell mailing list
Biohaskell@biohaskell.org
http://malde.org/cgi-bin/mailman/listinfo/biohaskell

Reply via email to