Re: [R] problem with predict(mboost,...)

Benjamin Hofner Wed, 20 Oct 2010 07:38:36 -0700

Hi Tim,

you have two "problems" at the same time:

1.) The warning you get means that you predictor (e.g. predictor1) hasanother range in the training set than in the test set. In this case youhave data in you test set that lies outside of the range of the trainingset (for predictor1). This is only a problem if the ranges are REALLYdifferent. However, this doesn't lead to your second problem! So I thinkyou can just ignore the warning (especially as you write both trainingand test set have the same range).

2.) The second problem you describe (negative prediction for a positiveoutcome) has nothing to do with boosting or mboost. This results fromthe fact that you estimate a model for a positive outcome but theprediction might be ANY number.You can avoid this by, for example, considering log-transformed outcomesand / or using another family (depending on the type of your outcome).Please consult literatur on generalized linear models (GLMs) for furtherhelp.


Hope that helps
  Benjamin


On 20.10.2010 12:00, r-help-requ...@r-project.org wrote:

Message: 129
Date: Wed, 20 Oct 2010 11:08:44 +0200
From: H?ring, Tim (LWF)<tim.haer...@lwf.bayern.de>
To:<r-help@r-project.org>
Subject: [R] problem with predict(mboost,...)
Message-ID:
        <70fc67c4a585d1489e66225a4e0238bab36...@rzs-exc-cl06.rz-sued.bayern.de>

Content-Type: text/plain;       charset="iso-8859-1"

Hi,

I use a mboost model to predict my dependent variable on new data. I get the 
following warning message:
In bs(mf[[i]], knots = args$knots[[i]]$knots, degree = args$degree,  :
    some 'x' values beyond boundary knots may cause ill-conditioned bases

The new predicted values are partly negative although the variable in the 
training data ranges from 3 to 8 on a numeric scale. In order to restrict the 
predicted values to the value range from 3 to 8 I limit the feature space of 
the prediction data on the minima and maxima of the training data for every 
predictor variable before applying the model on the new data.
As baselearner in mboost I use splines ("bbs"):

mod<- mboost(MF ~ bbs(predictor1) + bbs(predictor2) + bbs(...), data = train)

I wonder why there are negative values when applying the model on new data, 
because both, training and prediction data have the same value ranges in the 
predictor variables.

Did somebody get the same warning message? Can someone help me please?

TIM

------------------------------------------
Tim H?ring
Bavarian State Institute of Forestry
Department of Forest Ecology
Hans-Carl-von-Carlowitz-Platz 1
D-85354 Freising

E-Mail:tim.haer...@lwf.bayern.de
http://www.lwf.bayern.de


--
******************************************************************************
Dipl.-Stat. Benjamin Hofner

Institut für Medizininformatik, Biometrie und Epidemiologie
Friedrich-Alexander-Universität Erlangen-Nürnberg
Waldstr. 6 - 91054 Erlangen - Germany

Tel: +49-9131-85-22707
Fax: +49-9131-85-25740

Office:
  Room 3.036
  Universitätsstraße 22
  (Entrance at the left side of the building)

benjamin.hof...@imbe.med.uni-erlangen.de

http://www.imbe.med.uni-erlangen.de/~hofnerb/
http://www.benjaminhofner.de

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] problem with predict(mboost,...)

Reply via email to