[jira] [Commented] (HDFS-3049) During the normal loading NN startup process, fall back on a different EditLog if we see one that is corrupt

Todd Lipcon (JIRA) Thu, 17 May 2012 14:59:32 -0700

    [ 
https://issues.apache.org/jira/browse/HDFS-3049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13278317#comment-13278317
 ]


Todd Lipcon commented on HDFS-3049:
-----------------------------------

{code}
+      } catch (RuntimeException e) {
+        LOG.error("caught exception initializing " + this, e);
+        state = State.CLOSED;
+        return null;
+      } catch (IOException e) {
+        LOG.error("caught exception initializing " + this, e);
+        state = State.CLOSED;
+        return null;
+      }
{code}

Why not simply catch Throwable here?

Also, I'm not sure why the exception is swallowed instead of rethrown. If it 
fails to open the edit log, shouldn't that generate an exception on init()? 
Should we make init() a public interface (eg "open()") instead, so that the 
caller is cognizant of the flow here, instead of doing it lazily? I think that 
would also simplify the other functions, which could just do a 
Preconditions.checkState(state == State.OPEN) instead of always handling 
lazy-init.

----
{code}
+          LOG.info("skipping the rest of " + this + " since we " +
+              "reached txId " + txId);
+          close();
{code}
Why do you close() here but not close() in the normal case where you reach the 
end of the log? It seems it should be up to the caller to close upon hitting 
the "eof" (null txn) either way.

----
{code}
     try {
-      return reader.readOp(true);
+      return nextOpImpl(true);
     } catch (IOException e) {
+      LOG.error("nextValidOp: got exception while reading " + this, e);
+      return null;
+    } catch (RuntimeException e) {
+      LOG.error("nextValidOp: got exception while reading " + this, e);
       return null;
     }
{code}
again, why not just catch Throwable?

----
{code}
+    if (!streams.isEmpty()) {
+      String error = String.format("Cannot start writing at txid %s " +
+        "when there is a stream available for read: %s",
+        segmentTxId, streams.get(0));
+      IOUtils.cleanup(LOG, streams.toArray(new EditLogInputStream[0]));
+      throw new RuntimeException(error);
     }
{code}
IllegalStateException would be more appropriate here

----

Changes to PositionTrackingInputStream: can you refactor out a function like 
{{checkLimit(int amountToRead);}} here? Lots of duplicate code.

Why is the opcode size changed from 100MB to 1.5MB? Didn't you just change it 
to 100MB recently?

Also, why is this change to the limiting behavior lumped in here? It's hard to 
review when the patch has a lot of distinct changes put together.

StreamLimiter needs an interface annotation, or be made package private.

----

- There are a lot of new LOG.info messages which look more like they should be 
debug level. I don't think the operator would be able to make sense of all this 
output.


----

How hard would it be to separate this into three patches?
1) Bug fix which uses the new StreamLimiter to fix the issue you mentioned 
higher up in the comments (and seems distinct from the rest)
2) Change the API to get rid of getInputStream() and fix the O(n^2) behavior
3) Introduce RedundantInputStream to solve the issue described in this JIRA

I think there really are three separate things going on here and the 120KB 
patch is difficult to digest.
                
> During the normal loading NN startup process, fall back on a different 
> EditLog if we see one that is corrupt
> ------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-3049
>                 URL: https://issues.apache.org/jira/browse/HDFS-3049
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: name-node
>    Affects Versions: 0.23.0
>            Reporter: Colin Patrick McCabe
>            Assignee: Colin Patrick McCabe
>            Priority: Minor
>         Attachments: HDFS-3049.001.patch, HDFS-3049.002.patch, 
> HDFS-3049.003.patch, HDFS-3049.005.against3335.patch, 
> HDFS-3049.006.against3335.patch, HDFS-3049.007.against3335.patch, 
> HDFS-3049.010.patch, HDFS-3049.011.patch, HDFS-3049.012.patch, 
> HDFS-3049.013.patch, HDFS-3049.015.patch, HDFS-3049.017.patch, 
> HDFS-3049.018.patch
>
>
> During the NameNode startup process, we load an image, and then apply edit 
> logs to it until we believe that we have all the latest changes.  
> Unfortunately, if there is an I/O error while reading any of these files, in 
> most cases, we simply abort the startup process.  We should try harder to 
> locate a readable edit log and/or image file.
> *There are three main use cases for this feature:*
> 1. If the operating system does not honor fsync (usually due to a 
> misconfiguration), a file may end up in an inconsistent state.
> 2. In certain older releases where we did not use fallocate() or similar to 
> pre-reserve blocks, a disk full condition may cause a truncated log in one 
> edit directory.
> 3. There may be a bug in HDFS which results in some of the data directories 
> receiving corrupt data, but not all.  This is the least likely use case.
> *Proposed changes to normal NN startup*
> * We should try a different FSImage if we can't load the first one we try.
> * We should examine other FSEditLogs if we can't load the first one(s) we try.
> * We should fail if we can't find EditLogs that would bring us up to what we 
> believe is the latest transaction ID.
> Proposed changes to recovery mode NN startup:
> we should list out all the available storage directories and allow the 
> operator to select which one he wants to use.
> Something like this:
> {code}
> Multiple storage directories found.
> 1. /foo/bar
>     edits__curent__XYZ          size:213421345       md5:2345345
>     image                                  size:213421345       md5:2345345
> 2. /foo/baz
>     edits__curent__XYZ          size:213421345       md5:2345345345
>     image                                  size:213421345       md5:2345345
> Which one would you like to use? (1/2)
> {code}
> As usual in recovery mode, we want to be flexible about error handling.  In 
> this case, this means that we should NOT fail if we can't find EditLogs that 
> would bring us up to what we believe is the latest transaction ID.
> *Not addressed by this feature*
> This feature will not address the case where an attempt to access the 
> NameNode name directory or directories hangs because of an I/O error.  This 
> may happen, for example, when trying to load an image from a hard-mounted NFS 
> directory, when the NFS server has gone away.  Just as now, the operator will 
> have to notice this problem and take steps to correct it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-3049) During the normal loading NN startup process, fall back on a different EditLog if we see one that is corrupt

Reply via email to