Hi All, I work with microarrays and get huge tab delimited files as outputs from the software that analysis these microarrays. The result is a tab-delimted Excel type of file that has 160000 rows and about 20 columns.
Every 44K rows make one unit within the data. These units are identified by the Second data column, called meta arrow. So the first 44K rows have the value 1 on Meta row, the next 44K have the value 2 and so for. I would like to be able to separate these files into 4 different files, each one containing each unit of data. So all the rows that have meta row 1 would go to one file, and the ones with meta row 2 would go to another file and so forth. I have been reading beginning perl to tried to figure this out, but I haven't be able to come up with anything. I have many questions: I know I can use a filhandle to connect to the file, but how would I store the data to begin with? Is there a way to iteratively read through the rows and then copy them to a variable as long as their metarow column read let's say 1? and then out put that as a new file? Any help would be greatly appriciated, even if is just hints on how to get started. Cheers, Tiago -- "Education is not to be used to promote obscurantism." - Theodonius Dobzhansky. "Gracias a la vida que me ha dado tanto Me ha dado el sonido y el abecedario Con él, las palabras que pienso y declaro Madre, amigo, hermano Y luz alumbrando la ruta del alma del que estoy amando Gracias a la vida que me ha dado tanto Me ha dado la marcha de mis pies cansados Con ellos anduve ciudades y charcos Playas y desiertos, montañas y llanos Y la casa tuya, tu calle y tu patio" Violeta Parra - Gracias a la Vida Tiago S. F. Hori PhD Candidate - Ocean Science Center-Memorial University of Newfoundland