Hello all,
I am struggling hard for doing thsi but still not able to get. Its like i have a xml file whose excerpt is given below as :
<document> <sentence>IL-2 gen _expression_ and NF-kappa B activation
through CD28 requires reactive oxygen production by 5-lipoxygenase .
<sentence>...........................
.....................................<document>
<document>................................................................<docuemnt>
.
.
.
.so on.
In the file i have atleast 100 documents to parse through along.
Now what i have to do is to form a matrix as
Word1 word 2 word 3 word4
D1 1 1 .
D2 0 1 ;
D3 0 0
D4 1 1 so on
.
.
.
D(1 2 3 4) are the documents
For the 4 words given i have to find them in each of the document if i find one in the docuemnt i have to place 1 correponding to that row and column value.
I am just not bale to get how to keep track of the number of documents and then update the matrix every time.
Can anybody help me with some suggestions.
Thanks
_______________________________________________ ActivePerl mailing list [email protected] To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs
