Hello - thank you for your patience. We've pored through archives with no luck on these.
Our goal is to isolate the sequence that codes for any protein, along with flanking genomic sequence. It looks like the Table should make this easy, but we've run into 2 distinct problems. 1. What are "5' UTR exons" and "3' UTR exons"? We had hoped that these were purely non-coding sequence, but BLATting some of them matched exons in the middle of genes. Are they sequences that CAN be alternatively spliced but are not always, and so sometimes code for protein? If so, are some of them also represented in the "CDS" file? 2. This one is embarrassing - when returning a file of sequences, how do we get gene names instead of merely UCSC ID's? Thank you! ray _______________________________________________ Genome maillist - [email protected] http://www.soe.ucsc.edu/mailman/listinfo/genome
