Hi Vinny,
One option is to filter for a single representative transcript in your
BED file from UCSC as a first step or to use that sort of list as a
filter for your final result (if the data is still labeled by
transcriptIDs). If using the "UCSC Genes" track, the table is called
"knownCanonical".
Another option is to consider the tools in "Operate on Genomic
Intervals" and to if any meet your criteria.
https://bitbucket.org/galaxy/galaxy-central/wiki/GopsDesc
Merge or Cluster may be what you want. Note: this can result in gene
models that are not represented by a single transcript in the primary
query species.
If you have more questions, please let us know, and kindly keep the cc
to galaxy-user so that the Galaxy team and community can offer input,
Best,
Jen
Galaxy team
On 6/9/11 10:17 AM, Vincent Joseph Lynch wrote:
To Whom It May Concern,
Sorry to bother you with what is likely a fairly simple problem, but I have
trying to figure this out myself for several days and just can't figure out how
to do it.
I have a set of 8766 genes that I would like to test for positive selection in using various
other programs (HyPhy for example). To do this I obviously need an alignment of these genes
across various species, but I just can't figure out how to get the alignment in a fasta
format. For example, I have a BED12 file from UCSC with the data for the 8766 genes, I thought
the easiest way was to use the "Stitch Gene blocks" option and then select locally
cached alignments as the MAF source for the species I care about. However, because these 8766
genes have multiple transcripts I end up with 23,581 regions. Is there a way to merge the
multiple regions for each gene into a single region for the longest transcript? Then I should
have 8766 regions and can use Stitch Gene blocks". (Unless there is a more economical way
to do this.)\
Thanks
Vinny
Vincent J. Lynch, Associate Research Scientist
Department of Ecology and Evolutionary Biology& Yale Systems Biology Institute
Yale University
http://pantheon.yale.edu/~vjl4/profpage/
"There is a grandeur in this view of life, with its several powers,
having been originally breathed into a few forms or into one; and that
whilst this planet has gone on cycling according to the fixed laws of
gravity, from so simple a beginning endless forms most beautiful and most
wonderful have been, and are being, evolved." -C. Darwin, 1859
(Walker, Wisconsin, Madison, Maddow, Tea Party, Obama, global warming)
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/
--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/