Re: [R] Caret Internal Data Representation

2015-11-06 Thread Max Kuhn
Providing a reproducible example and the results of `sessionInfo` will help get your question answered. For example, did you use the formula or non-formula interface to `train` and so on On Thu, Nov 5, 2015 at 1:10 PM, Bert Gunter wrote: > I am not familiar with

[R] Caret Internal Data Representation

2015-11-05 Thread Lorenzo Isella
Dear All, I have a data set which contains both categorical and numerical variables which I analyze using Cubist+the caret framework. Now, from the generated rules, it is clear that cubist does something to the categorical variables and probably uses some dummy coding for them. However, I cannot

Re: [R] Caret Internal Data Representation

2015-11-05 Thread Bert Gunter
I am not familiar with caret/Cubist, but assuming they follow the usual R procedures that encode categorical factors for conditional fitting, you need to do some homework on your own by reading up on the use of contrasts in regression. See ?factor and ?contrasts (and other linked Help as