[ https://issues.apache.org/jira/browse/HADOOP-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12542648 ]
dhruba borthakur commented on HADOOP-1989: ------------------------------------------ I still get a merge failure on DataChecksum.java. I managed to merge it by hand. I am running the unit test and saw a test failure on TestCrcCorruption: Testcase: testCrcCorruption took 12.692 sec Caused an ERROR Java heap space java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.fs.FSInputChecker.set(FSInputChecker.java:396) at org.apache.hadoop.fs.FSInputChecker.<init>(FSInputChecker.java:71) at org.apache.hadoop.dfs.DFSClient$BlockReader.<init>(DFSClient.java:697) at org.apache.hadoop.dfs.DFSClient$BlockReader.newBlockReader(DFSClient.java:755) at org.apache.hadoop.dfs.DFSClient$DFSInputStream.fetchBlockByteRange(DFSClient.java:1144) at org.apache.hadoop.dfs.DFSClient$DFSInputStream.read(DFSClient.java:1211) at org.apache.hadoop.fs.FSInputStream.readFully(FSInputStream.java:66) at org.apache.hadoop.fs.FSDataInputStream.readFully(FSDataInputStream.java:56) at org.apache.hadoop.dfs.DFSTestUtil.checkFiles(DFSTestUtil.java:150) at org.apache.hadoop.dfs.TestCrcCorruption.thistest(TestCrcCorruption.java:181) at org.apache.hadoop.dfs.TestCrcCorruption.testCrcCorruption(TestCrcCorruption.java:223) > Add support for simulated Data Nodes - helpful for testing and performance > benchmarking of the Name Node without having a large cluster > ---------------------------------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-1989 > URL: https://issues.apache.org/jira/browse/HADOOP-1989 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Affects Versions: 0.16.0 > Reporter: Sanjay Radia > Assignee: Sanjay Radia > Priority: Minor > Fix For: 0.16.0 > > Attachments: SimulatedStoragePatchSubmit.txt, > SimulatedStoragePatchSubmit5.txt, SimulatedStoragePatchSubmit6.txt, > SimulatedStoragePatchSubmit7.txt, SimulatedStoragePatchSubmit8.txt > > > Proposal is to add an implementation for a Simulated Data Node. > This will > - allow one to test certain parts of the system (especially the Name Node, > protocols) much more easily and efficiently. > - allow one to run performance benchmarks on the Name node without having a > large cluster. > - Inject faults for testing (e.g. one can add random faults based > probability parameters). > The idea is that the Simulated Data Node will > - discard any data written to blocks (but remember the blocks and their > sizes) > - generate fixed data on the fly when blocks are read (e.g. block is fixed > set of bytes or repeated sequence of strings). > The Simulated Data Node can also be used for fault injection. > The data node can be parameterized with probabilities that allow one to > control: > - Delays on reads and writes, creates, etc > - IO Exceptions > - Loss of blocks > - Failures -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.