Hello, all,

I would like to use imputation, given the following scenario:

The data I possess are not independent as they quantify the degree of genetic 
relatedness for PAIRS of individuals sampled from a population.  

For example:

individual_1    individual_2    relatedness

     1              1               1.00
     1              2               0.34
     1              3               0.00
     1              4               0.79 
     2              2               1.00
     2              3                 ?
     3              4               0.50
     4              4               1.00

The data collected for the variables "individual_1" and "individual_2" are 
categorical in nature and are complete, thus I do not need to impute these 
data.  The data I need to impute are the continuous genetic relatedness data; 
however, I am concerned about the pairwise nature of these data.  

Intuitively, if we know the degree of relatedness for individuals 1 and 2 as 
well as for individuals 1 and 3, then we should be able to deduce the degree of 
relatedness for individuals 2 and 3 (the missing datum in the above example).  
Conversely, knowing the relatedness for individuals 1 and 4 do not help in 
deducing the relatedness between individuals 2 and 3.

I have tried to use Joe Schafer's NORM software, declaring the variables 
"individual_1" and "individual_2" as dummy variables.  In doing so, am I using 
the appropriate model?  Alternatively, should I be stratifying the 
categorical variables?

Help would be greatly appreciated.

Best regards,
Kelly
_______________________________

Kelly Gallagher
Institut des Sciences de l'Evolution (ISEM)
G?n?tique et Environnement CC065
Universit? de Montpellier 2
Place Eug?ne Bataillon
F-34095 Montpellier Cedex 05
France
tel: 33 (0) 4 67 14 47 18
fax: 33 (0) 4 67 14 36 22
e-mail: [email protected]


Reply via email to