Hi, All. I'm trying to run Data analytic on my x86, Ubuntu machine. I found that when I divided 30GB Wikipedia input data into small chunks of 64MB, CPU usage was really low. It was checked by /usr/bin/time command. Most of execution time was idle and waiting. User cpu time was only 13% of total running time.
Is it because I'm running Data analytic with single node? Or does it have something to do with following warning message..? WARN driver.MahoutDriver: No wikipediaXMLSplitter.props found on classpath, will use command-line arguments only I don't understand why user cpu time is so low while it takes 2.5 hours to finish splitting Wikipedia inputs. Thanks! -- *Jinchun Kim*
