Re: unit testing with hadoop

Adrian Woodhead Thu, 22 Nov 2007 11:03:06 -0800

We have done something like this where we want our unit tests to runagainst a 1-machine "cluster". As a starting point we tookHadoopTestCase and have written our own modified version of this whichwe configure and tell it whether to run in "local fs" or "minimapreduce" mode. The former is slightly faster but doesn't catch allbugs so we have a continuous integration machine setup to perform testsin both modes. We have also made some changes so that it only starts andstops the cluster when the class is loaded instead of between testswhich also improves the speed.

Anyway, I would say that having a look at the source code forHadoopTestCase is probably your best starting point and hopefully youcan take it from there!


Regards,

Adrian

Eugeny N Dzhurinsky wrote:

Hello there, we would like to make some tests with hadoop.

For the tests we would like to have a hadoop filesystem up and configured, so
using stubs and some mocks of core interfaces we can test the overall storage
functionality we're about to develop (which would be a part of map/reduce jobs
later).

As far as we learned from documentation/tutorials, to run hadoop we need to
use it's own startup scripts, which looks like an overhead and a bit
cumbersome to integrate in maven testing.

Could you please advice?

Re: unit testing with hadoop

Reply via email to