Are there some examples of use of ModelSerializer? Can I use it without fixing mahout from the trunk? I see that ModelSerializer uses json too, isn't it?
-----Messaggio originale----- Da: Ted Dunning [mailto:[email protected]] Inviato: mercoledì 26 gennaio 2011 19.38 A: [email protected] Oggetto: Re: problems saving and loading SGD classifications This is a known problem that should be fixed in trunk. While you are at it, the LogisticModelParameters approach may not be as useful as the ModelSerializer approach. Here is a comparison of pros and cons: LogisticModelParameters + incorporates lots of CSV parsing info + serializes the whole lot including model and data representation + somewhat simpler to use + matches chapter 13 of MiA examples -- uses json to serialize model - pretty much assumes CSV input by implication - has a bug in many recent versions ModelSerializer ++ allows binary serialization + makes no assumptions about how feature vectors are encoded - requires that you make your own arrangements for vector encoding The bit about binary serialization is (for me) a real show-stopper for LMP for big models. Almost as important is the issue about vector encoding since real Mahout applications tend to have large sparse text-like input variables. On Wed, Jan 26, 2011 at 9:44 AM, Claudia Grieco <[email protected]>wrote: > What do you think can be the problem? >
