I needed to parse the upstream5000.maf(28 species) file from the hg18 series. I was encountering bloacks and bloacks of empty space within the file ,with the size dropping to 200 MB from near about 2GB . Being a computer scientist ,it is more like a enormous text file for me ..what is the significance of the empty space within this file.
-- Thanks and Regards Harsh _______________________________________________ Genome maillist - [email protected] http://www.soe.ucsc.edu/mailman/listinfo/genome
