milleruntime opened a new issue #196:
URL: https://github.com/apache/accumulo-testing/issues/196


   I was running 4 threads for the MultiTable randomwalk test locally on my 
machine using Uno. I had 2 tservers running. After some time, one of the 
threads died while trying to do copyTable. Typically, I see this error when I 
kill the test myself. But this time the tserver reported the error on its own. 
   <pre>
   Caused by: java.io.IOException: Connection reset by peer
           at sun.nio.ch.FileDispatcherImpl.read0(Native Method) ~[?:?]
           at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39) ~[?:?]
           at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:276) ~[?:?]
           at sun.nio.ch.IOUtil.read(IOUtil.java:245) ~[?:?]
           at sun.nio.ch.IOUtil.read(IOUtil.java:223) ~[?:?]
           at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:356) 
~[?:?]
           at 
org.apache.thrift.transport.TNonblockingSocket.read(TNonblockingSocket.java:149)
 ~[libthrift-0.15.0.jar:0.15.0]
           ... 5 more
   </pre>
   
   From the MultiTable walker thread:
   <pre>
   2022-03-15T08:47:12,162 [randomwalk.multitable.CopyTable] DEBUG: copying 
multi_ip_10_113_12_25_13 to multi_ip_10_113_12_25_17
   2022-03-15T08:52:12,132 [testing.randomwalk.Module] WARN : Node 
org.apache.accumulo.testing.randomwalk.multitable.CopyTable has been running 
for 300.022 seconds. You may want to look into it.
   </pre>
   
   I am not sure what happened but the Hadoop MR job is still running.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to