evel@r-project.org
Subject: Re: [Bioc-devel] Query regarding size limit and including external
datasets
Any TCGA MAFs released to the public were considered deidentified. That
wouldn't be the part i would worry about. It's a nice idea, and a data package
or packages seems like the idiomatic way
Any TCGA MAFs released to the public were considered deidentified. That
wouldn't be the part i would worry about. It's a nice idea, and a data package
or packages seems like the idiomatic way to do it, as you noted. Personally I
think it would indeed benefit a lot of people (vs, say, GDC).
I cannot speak for the core team.
You should separate the data from the software methods and provide a data
package containing the MAFs. This has the additional advantage of
separating versionning of the mutation data from your software. As a data
package this does not sound extensive; the