Help Parsing a Tab delimited file

Tiago Hori Wed, 13 Jul 2011 10:00:18 -0700

Hi All,

I work with microarrays and get huge tab delimited files as outputs from the
software that analysis these microarrays. The result is a tab-delimted Excel
type of file that has 160000 rows and about 20 columns.


Every 44K rows make one unit within the data. These units are identified by
the Second data column, called meta arrow. So the first 44K rows have the
value 1 on Meta row, the next 44K have the value 2 and so for.

I would like to be able to separate these files into 4 different files, each
one containing each unit of data. So all the rows that have meta row 1 would
go to one file, and the ones with meta row 2 would go to another file and so
forth.

I have been reading beginning perl to tried to figure this out, but I
haven't be able to come up with anything.

I have many questions: I know I can use a filhandle to connect to the file,
but how would I store the data to begin with?

Is there a way to iteratively read through the rows and then copy them to a
variable as long as their metarow column read let's say 1? and then out put
that as a new file?

Any help would be greatly appriciated, even if is just hints on how to get
started.

Cheers,

Tiago


-- 
"Education is not to be used to promote obscurantism." - Theodonius
Dobzhansky.

"Gracias a la vida que me ha dado tanto
Me ha dado el sonido y el abecedario
Con él, las palabras que pienso y declaro
Madre, amigo, hermano
Y luz alumbrando la ruta del alma del que estoy amando

Gracias a la vida que me ha dado tanto
Me ha dado la marcha de mis pies cansados
Con ellos anduve ciudades y charcos
Playas y desiertos, montañas y llanos
Y la casa tuya, tu calle y tu patio"

Violeta Parra - Gracias a la Vida

Tiago S. F. Hori
PhD Candidate - Ocean Science Center-Memorial University of Newfoundland

Help Parsing a Tab delimited file

Reply via email to