With 1.0.2, only one checkpoint process is executed at a time. When the namenode gets an overlapping checkpointing request, it checks edit.new in its storage directories. If all of them have this file, namenode concludes the previous checkpoint process is not done yet and prints the warning message you've seen.
Brandon On Tue, Jul 3, 2012 at 10:56 AM, Terry Healy <the...@bnl.gov> wrote: > Running Apache 1.0.2. > > The NN log is reporting that it cannot "roll the edit log" from the > secondary NN. The SecondaryNameNode is running on the system referred to > as xxx.yyy.254.238 in the log snippet below. > > From the NN, I can connect to the Secondary via ssh as the user. Any > suggestions what have I got wrong here? > > thanks, > > Terry > > INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll Edit Log > from xxx.yyy.254.238 > WARN org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Cannot roll > edit log, edits.new files already exists in all healthy directories: > /home/[user]/hdfs/name/current/edits.new > > ERROR org.apache.hadoop.security.UserGroupInformation: > PriviledgedActionException as:[user] cause:java.net.ConnectException: > Connection refused > ERROR org.apache.hadoop.security.UserGroupInformation: > PriviledgedActionException as:[user] cause:java.net.ConnectException: > Connection refused > > WARN org.mortbay.log: /getimage: java.io.IOException: GetImage failed. > java.net.ConnectException: Connection refused > at java.net.PlainSocketImpl.socketConnect(Native Method) > at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351) > .... > > > -- > Terry Healy / the...@bnl.gov > Cyber Security Operations > Brookhaven National Laboratory > Building 515, Upton N.Y. 11973 > >