I have a very similar problem as Anveshi Charuvaka is mailing about. What I found additionally when I set task logging to DEBUG mode, is that the DEBUG logs get interrupted at same point and replaced with the "INFO bsp.BSPJobClient: Job failed." message. My program works in local, distributed and pseudo mode, so that's probably not the issue.
The only case the program does run, is when I use the maximum number of machines (i.e. 7 machines, with 12 cores, 128GB ram..). I set the maximum number of tasks to 12 per node, thus 84. But when I force the program to run with 60 tasks, the "Job Failed" comes up with no additional info. Last note: I'm running an Inverted Indexing algorithm with a data set of approximately 17 GB. Could someone help me with this? Regards, Steven
