Hi All,

I have some data where the dependent variable is a score, low (1:3) or high (8:9), and the independent variables are 21 genotypic markers. I'm fitting a logistic regression on the whole dataset after transforming the score to 0/1 and normal linear regression on the high and low subsets.

I all cases I have a numer of cases of data 'duplications', i.e. different individuals with the same score and the same genotype at the 21 markers.

When I do:

mod$fitted.values I get a number of fitted values corresponding to the umber of unique lines in the dataset. Is there a way to have the fitted values match the observation, even though some are duplicated and so have the same fitted value? I could do it by hand but it's laborious and I'd venture there is a better way.

Best,

Federico


--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to