Hi, say I want to use the GLM package to run a regression that contains several categorical variables. My understanding is that I need the equivalent of an R factor, which is a PooledDataArray. Suppose I take this dataset
using RDatasets
*data = dataset("Ecdat","Benefits")*
and I want columns 6-14 as PooledDataArray
I tried
data[,:Sex] = convert(PooledDataArray,data[,:Sex])
and
pool!(data[,:Sex]) # I can see that this needs a second column with the
numeric code
thanks!
