On Tue, Jul 28, 2009 at 12:18 AM, Robin Anil <[email protected]> wrote:
> ... We need modules > to convert data in databases (Flatfiles, XMLdumps, MySQL, Different > formats on HDFS, Hbase) into intermediate form(say vector). Yes. We do need that. > Ever considered having a Workflow where we select InputformatReader Job and > an algorithm to perform (classification, clustering , itemset mining). > where the first process breaks different sources into the vector > format. and then launches the algorithms. That is an intriguing thought. How many algorithms have the same shape? (as in, one input, one output, one algorithm, one input format)?
