steveloughran commented on issue #951: HADOOP-15183. S3Guard store becomes 
inconsistent after partial failure of rename
URL: https://github.com/apache/hadoop/pull/951#issuecomment-504052506
 
 
   @bgaborg thanks for those results, we need to look at them to see if they 
are related.
   
   ## 
`test_040_get(org.apache.hadoop.fs.s3a.s3guard.ITestDynamoDBMetadataStoreScale)`
   
   That FileNotFoundException wraps a ResourceNotFoundException: the DDB isn't 
there. What happens on a rerun
   
   ##  terasort.
   The tests are in an ordered chain: they only run if the previous test stage 
completed, which is inferred from the _SUCCESS marker in the previous dir
   
   The only one to worry about (at least at first) is: `test_110_teragen`, 
where the exec'd operation returned a non-zero value: it failed. But we don't 
know why.
   
   One thing I have never worked out is where in the miniyarn cluster the logs 
from the MR job actually collect. We have those of the JUnit process, but not 
the forked processes which are actually logging what's going on. If you have 
any insight here, that'd help us debug. Otherwise, what happens when you rereun 
this?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to