Would you please give the output of the "top" command? at least to show that the HDFS process did use that much of CPU;
On Mon, Sep 1, 2014 at 2:19 PM, Shiyuan Xiao <[email protected]> wrote: > Hi > > > We have written a MapReduce application based on Hadoop 2.4 which keeps > reading data from HDFS(Pseudo-distributed mode in one node). And we > found the CPU system time and user time of the application keeps increasing > when it is running. If we changed the application to read data from local > disk without changing any other business logic, the CPU utilization kept > stable. So we have conclusion that the CPU utilization is related to HDFS. We > want to know whether this issue is really related to HDFS and is there any > solution to fix it? > > > > > > Thanks a lot! > > > > BR/Shiyuan > -- Regards, *Stanley Shi,*
