If there are only that few data points, you should just use R. On Mon, Dec 7, 2009 at 3:29 PM, Rajat Banerjee <[email protected]> wrote:
> Dear Ted, Thanks for your prompt reply. > > There are 16,000 rows of data. There are only four significant > variables in my hypothesis. The regression shouldn't be too nasty. > I've looked at some non-distributed libraries and they seem capable, > but would love to get it started in hadoop since that's my end goal. > > single-threaded : > http://www.ee.ucl.ac.uk/~mflanaga/java/Regression.html#sumgl<http://www.ee.ucl.ac.uk/%7Emflanaga/java/Regression.html#sumgl> > > >
