We are looking into ways to integrate galaxy into the workflows of users at our cluster, with lots of NGS users running all and any kind of analyses on their typically huge amounts of data. For this we use a parallel file system, available on all compute nodes.

This file system, although approx 1PB in size, is constantly filling up, and thus we are not very attracted by the idea of copying files into/out of galaxy for each analysis.

Thus, we would be interested to know what are the options for working with existing/external(to galaxy) file systems?

Eg. would it be possible to link files into some kind of galaxy file system (I'm not totally clear about how galaxy stores it's data, although I found out that stuff is created in database/files), from outside?

... or is there any work going on for selecting any file system path as input in galaxy workflows?

... or any other hints?

As said, I'm quite new to galaxy, trying to grok my head around how we can use it, so all hints are welcome.

Best Regards
// Samuel

Developer at SNIC-UPPMAX www.uppmax.uu.se
Developer at Dept of Pharm Biosciences www.farmbio.uu.se
Twitter - twitter.com/samuellampa
Blog - saml.rilspace.org
G+ - gplus.to/saml

Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:


Reply via email to