HI,

        I have the data in the following format. My aim is to determine the 
concordance in genotype calls (SNP1, 2, 3,etc) at two centers.
         
        DNA center snp1 snp2
        NA07019 1 A A
        NA07348 1 M G
        NA10830 1 A G
        NA10851 1 M G
        NA10857 1 A G
        NA10860 1 A G
        NA10861 1 A G
        NA12761 1 ? R
        NA07019 2 A A
        NA07348 2 M G
        NA10830 2 A G
        NA10851 2 M G
        NA10857 2 A G
        NA10860 2 A G
        NA10861 2 A G
        NA12761 2 A A
         
        In SPSS, I would create individual crosstabulation tables and any 
positive integer values (counts) falling outside of the diagonal are 
disagreements in calls by the 2 centers (SNP2).

        At SNP1, there are total of 7 genotype counts to compare (one DNA did 
not have genotype readout). And 7/7 are in agreement.
         
        At SNP2, all 8 DNA samples were successfully typed by both centers. 
There is one discrepant call between centers.

        Q1: how do I create the same crosstabs in R?
         
        Eventually, I would summarize the data as follows:
         
        SNP N_to_compare N_agreement %Concordance
        snp1 7  7  100
        snp2 8  7  87.5
        etc
         
        Q2: From the individual crosstabs tables, can I automatically extract 
those numbers in columns 2 and 3?
         
         
        rgds,
        jenny

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to