[galaxy-dev] Relabeling dataset pairs in 'list:paired' collection

Peter Briggs Mon, 13 Feb 2017 08:12:11 -0800

Dear Developers

Is there an existing tool or mechanism that can be used to duplicate a"list of pairs" dataset collection, keeping the paired datasets the samebut relabeling each pair with a new identifier taken from a usersupplied file or list?


I've cobbled together my own tool to try and do something like this:

https://github.com/pjbriggs/Amplicon_analysis-galaxy/blob/77340d8bb2470a646deba4933625413fc70985d1/relabel_samples.xml

and while it works, it doesn't feel like a good solution as it createsduplicates of the datasets from the first collection and consumesadditional disk/quota space unnecessarily. (This is particularlyundesirable as we expect that the input collections might be relativelylarge numbers of FASTQ pairs e.g. 30 or more.)

Looking at some of the 'Collection Operations' tools that come withGalaxy, it appears that these are able to create new collections withoutmaking duplicate datasets, which seems much better. But these tools workby directly invoking Python classes from the Galaxy core, so I don'tknow if a similar approach could be used in a non-core tool.


Any advice or suggestions are very welcome! Thanks

Best wishes

Peter

--
Peter Briggs [email protected]
Bioinformatics Core Facility University of Manchester
B.1083 Michael Smith Bldg Tel: (0161) 2751482
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
 https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
 http://galaxyproject.org/search/mailinglists/

[galaxy-dev] Relabeling dataset pairs in 'list:paired' collection

Reply via email to