Hello

I tried to generate a file like "upstream1000.fa.gz - Sequences 1000 bases 
upstream of annotated transcription starts for RefSeq genes with annotated 5' 
UTRs". By using "refGene.txt" to locate different refGene and sequence file of 
chromosome "chromFaMasked.tar.gz", I can get a file very similar to 
"upstream1000.fa", but I found some NM names show in "refGene.txt" do no 
contain in "upstream1000.fa", such as "NM_001166752,NM_053230....." -- why this 
would happen?

Would you please give me some instructions on after locating each refgene and 
cut their sequence from chromosome what criterion you have used to select 
refGene?

Best,

Yunfei Li
--------------------------------------------------------------------------------------
Research Assistant
Department of Statistics &
School of Molecular Biosciences
Biotechnology Life Sciences Building 427
Washington State University
Pullman, WA 99164-7520
Phone: 509-339-5096
http://www.wsu.edu/~ye_lab/people.html

_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to