[galaxy-dev] Re: How to include a large datafile in a bioconda package?

2019-07-24 Thread Peter Cock via galaxy-dev
That seems a good compromise within conda, since BioConda wouldn't want the binary package itself to be too big. (I'm doing something similar with some real sample data for a tool, putting it up on Zenodo. Of course, this is optional for my tool - your use case is different.) The Galaxy Data Mana

[galaxy-dev] Re: How to include a large datafile in a bioconda package?

2019-07-24 Thread Jin Li
Hi Bjoern, Thank you for your direction and information. The post-link script is exactly what I was looking for. I am glad I asked the question here. Thank you. Best regards, Jin On Wed, Jul 24, 2019 at 3:26 PM Björn Grüning wrote: > > Hi Jin, > > you can use a post-link script in conda. > > Li

[galaxy-dev] Re: How to include a large datafile in a bioconda package?

2019-07-24 Thread Björn Grüning
Hi Jin, you can use a post-link script in conda. Like here: https://github.com/bioconda/bioconda-recipes/blob/master/recipes/picrust2/post-link.sh This way the data can be fetch during tool installation. See more information here: https://docs.conda.io/projects/conda-build/en/latest/resourc

[galaxy-dev] Re: How to include a large datafile in a bioconda package?

2019-07-24 Thread Jin Li
Hi Brad, Thank you for your quick reply. I can put the data file to Zenodo so that I will have a permanent location for it. As for re-computing the data file locally, it may need several days to run, so it may be quite inefficient to do the computing. I am expecting an automatic download of the d

[galaxy-dev] Re: How to include a large datafile in a bioconda package?

2019-07-24 Thread Langhorst, Brad
Hi: I’d be concerned about that file changing or disappearing and causing irreproducibility. If the URL were to a permanent location (e.g. NCBI or zenodo) maybe it’s ok. Could it be re-computed locally if necessary (like a genome index)? Maybe others know of examples where this is done. Brad