[ 
https://issues.apache.org/jira/browse/HADOOP-12949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15338785#comment-15338785
 ] 

Steve Loughran commented on HADOOP-12949:
-----------------------------------------

+ we'll want to have the htrace context ID go all the way down to s3 by way of 
the HADOOP-13122 UA header. That lets your storage infra provider know which 
queries are causing problems, and, if this goes via a proxy capable of reading 
the HTTP Requests, lets them sample and correlate with network load

> Add HTrace to the s3a connector
> -------------------------------
>
>                 Key: HADOOP-12949
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12949
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs/s3
>            Reporter: Madhawa Gunasekara
>            Assignee: Madhawa Gunasekara
>
> Hi All, 
> s3, GCS, WASB, and other cloud blob stores are becoming increasingly 
> important in Hadoop. But we don't have distributed tracing for these yet. It 
> would be interesting to add distributed tracing here. It would enable 
> collecting really interesting data like probability distributions of PUT and 
> GET requests to s3 and their impact on MR jobs, etc.
> I would like to implement this feature, Please shed some light on this 
> Thanks,
> Madhawa



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to