Eric Payne created HDFS-9634:
--------------------------------
Summary: webhdfs client side exceptions don't provide enough
details
Key: HDFS-9634
URL: https://issues.apache.org/jira/browse/HDFS-9634
Project: Hadoop HDFS
Issue Type: Bug
Components: webhdfs
Affects Versions: 2.7.1, 3.0.0, 2.8.0
Reporter: Eric Payne
Assignee: Eric Payne
When a WebHDFS client side exception (for example, read timeout) occurs there
are no details beyond the fact that a timeout occurred. Ideally it should say
which node is responsible for the timeout, but failing that it should at least
say which node we're talking to so we can examine that node's logs to further
investigate.
{noformat}
java.net.SocketTimeoutException: Read timed out
at java.net.SocketInputStream.socketRead0(Native Method)
at java.net.SocketInputStream.read(SocketInputStream.java:150)
at java.net.SocketInputStream.read(SocketInputStream.java:121)
at java.io.BufferedInputStream.read1(BufferedInputStream.java:273)
at java.io.BufferedInputStream.read(BufferedInputStream.java:334)
at sun.net.www.MeteredStream.read(MeteredStream.java:134)
at java.io.FilterInputStream.read(FilterInputStream.java:133)
at
sun.net.www.protocol.http.HttpURLConnection$HttpInputStream.read(HttpURLConnection.java:3035)
at
org.apache.commons.io.input.BoundedInputStream.read(BoundedInputStream.java:121)
at
org.apache.hadoop.hdfs.web.ByteRangeInputStream.read(ByteRangeInputStream.java:188)
at java.io.DataInputStream.read(DataInputStream.java:149)
at java.io.BufferedInputStream.read1(BufferedInputStream.java:273)
at java.io.BufferedInputStream.read(BufferedInputStream.java:334)
at
com.yahoo.grid.tools.util.io.ThrottledBufferedInputStream.read(ThrottledBufferedInputStream.java:58)
at java.io.FilterInputStream.read(FilterInputStream.java:107)
at
com.yahoo.grid.replication.distcopy.tasklet.HFTPDistributedCopy.copyBytes(HFTPDistributedCopy.java:495)
at
com.yahoo.grid.replication.distcopy.tasklet.HFTPDistributedCopy.doCopy(HFTPDistributedCopy.java:440)
at
com.yahoo.grid.replication.distcopy.tasklet.HFTPDistributedCopy.access$200(HFTPDistributedCopy.java:57)
at
com.yahoo.grid.replication.distcopy.tasklet.HFTPDistributedCopy$1.doExecute(HFTPDistributedCopy.java:387)
... 12 more
{noformat}
There are no clues as to which datanode we're talking to nor which datanode was
responsible for the timeout.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)