On Tue, Jun 22, 2010 at 9:44 AM, Robin Anil <[email protected]> wrote:
> > > On Mon, Jun 21, 2010 at 8:35 PM, Robin Anil <[email protected]> > > wrote: > > > > > >> A Classifier Training Job will take a Trainer, and a Vector location > and > > > > > > produce a Model > > > > How about A tranform layer which converts ondisk data into vectors > seamlessly? That should solve the issue This is what I meant that the Classifier Training Job should accept a Vectorizer. This transform layer needs to be close to the object doing the training to avoid data transfer and copy costs. It should also be flexible in that different kinds of transforms should be injectable into the job.
