All of the fasta sequence is in the file: ftp://hgdownload.cse.ucsc.edu/goldenPath/danRer7/bigZips/danRer7.fa.gz
There are 1,133 scaffold sequences in that file. We don't want to create 1,133 files. You can split this sequence into individual files if you would like. Use the faSplit program from the kent source tree. You can also get all sequence out of the more efficient .2bit file: ftp://hgdownload.cse.ucsc.edu/goldenPath/danRer7/bigZips/danRer7.2bit Use the kent source tree program twoBitToFa to extract sequence. Use the twoBitInfo file to see what is in the .2bit file. --Hiram _______________________________________________ Genome maillist - [email protected] https://lists.soe.ucsc.edu/mailman/listinfo/genome
