Thanks for the response Ted! *Brian Krebs* CIO and Co-founder TapHeaven Mobile:443.866.2137 Email: [email protected] Twitter: @BKrebsTH LinkedIn: www.linkedin.com/in/briankrebs tapheaven.com The static word encoder is appropriate for categorical variables with an unknown number of values.
On Sun, Aug 3, 2014 at 9:16 PM, Brian Krebs <[email protected]> wrote: > Hi everyone, > > I have a very basic question on the Apache SGD implementation. My training > set has about 50 features, most of which are categorical. Some of these > categories are binary, but others can have an unknown number of discrete > values (countries, cities, etc.). > > Should I be encoding these with the ConstantValueEncoder? The > StaticWordValueEncoder? > > Thanks, > > *Brian Krebs* > CIO and Co-founder > TapHeaven > Mobile: 443.866.2137 > Email: [email protected] > Twitter: @BKrebsTH > LinkedIn: www.linkedin.com/in/briankrebs > tapheaven.com >
