Gangadhar, After running TrainClassifier again, the map task just failed with the same exception and I am pretty sure it is an issue with disk space. As the map was progressing, I was monitoring my free disk space dropping from 81GB. It came down to 0 after almost 66% through the map task and then the exception happened. After the exception, another map task was resuming at 33% and I got close to 15GB free space (i guess the first map task freed up some space) and I am sure they would drop down to zero again and throw the same exception. I am going to modify the country.txt to just 1 country and recreate wikipediainput and run TrainClassifier. Will let you know how it goes..
Do we have any benchmarks / system requirements for running this example ? Has anyone else had success running this example anytime. Would appreciate your inputs / thots. Should we look at tuning the code for handling these situations ? Any quick suggestions on where to start looking at ? regards, Joe.
