Hello - thank you for your patience.  We've pored through archives with no luck 
on these.

Our goal is to isolate the sequence that codes for any protein, along with 
flanking genomic sequence.  It looks like the Table should make this easy, but 
we've run into 2 distinct problems.

1. What are "5' UTR exons" and "3' UTR exons"?  We had hoped that these were 
purely non-coding sequence, but BLATting some of them matched exons in the 
middle of genes.  Are they sequences that CAN be alternatively spliced but are 
not always, and so sometimes code for protein?  If so, are some of them also 
represented in the "CDS" file?

2. This one is embarrassing - when returning a file of sequences, how do we get 
gene names instead of merely UCSC ID's?

Thank you!

ray





_______________________________________________
Genome maillist  -  [email protected]
http://www.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to