Dear Vinayak, For human and mouse, the knownGenes data is pretty close to what you want. For each transcript isoform that is found with a threshold level of evidence, knownGene indicates the start and end coordinate of each exon. You'd need to do a bit of filtering, because the same exon can (and often does) appear in multiple knownGene transcripts. If you'd like the length of the gene specifically, you can also infer that from knownGene, which specifies the start and end coordinates of each transcript. You can obtain this data in bulk through the table browser (where you'll be looking for UCSC Genes), or in the download site under the Annotation section. Let us know if you have any further questions.
- Melissa > Subject: [Genome] Length of a gene > Date: Tue, 29 Jun 2010 10:56:13 -0700 > From: Vinayak Kulkarni <[email protected]> > To: [email protected] > > Hi UCSC folks, > I am wondering if you guys have a precomputed file /table in your repository > which for a given gene , has the length of all the unique exon blocks > together which could be downloaded from your FTP site? > Please let me know. > Thanks, > Vinayak _______________________________________________ Genome maillist - [email protected] https://lists.soe.ucsc.edu/mailman/listinfo/genome
