I happened to run into this interesting scenario:

I had some mahout seq2sparse jobs, originally i run them in parallel using
the distributed mode. but because the input files are so small, running
them locally actually is much faster. so I truned them to local mode.

but I run 10 of these jobs in parallel, so when 10 mahout jobs are run
together, everyone became very slow.

is there an existing code that takes a desired shell script, and possibly
some archive files (could contain the jar file, or C++ --generated
executable code). I understand that I could use yarn API to code such a
thing, but it would be nice if I could just take it and run in shell..

Thanks
Yang

Reply via email to