[ 
https://issues.apache.org/jira/browse/HADOOP-17511?focusedWorklogId=556501&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-556501
 ]

ASF GitHub Bot logged work on HADOOP-17511:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 23/Feb/21 18:20
            Start Date: 23/Feb/21 18:20
    Worklog Time Spent: 10m 
      Work Description: steveloughran commented on pull request #2675:
URL: https://github.com/apache/hadoop/pull/2675#issuecomment-784410906


   This hits the big milestone of *this has got complicated*.
   
   For best-effort tracking of what is the FS operation this thread initiated, 
we are functionally complete.
   
   For anything more rigorous where the lifespan of operations is tracked: no.
   
   I also think the attempts to preserve and restore the previous span get too 
complex/unreliable once you start considering spans with more complex 
lifecycles (what if you the previous span has already been closed?)
   
   I'm going to take a couple of days off from this and do other stuff for the 
3.3.1 release (which this is too big/complex to make). 
   
   Then think about how better to do this.
   
   What do we want?
   
   * Maybe differentiate bounded and long-lived spans? with different API to 
create?
   * Or declare that close() must be callable > once and do nothing but log in 
an audit service
   * and that its not an error to call it on a closed span.
   
   The S3A FS code will be slightly more sophisticated in that bounded 
try-with-resources spans
   will, when close()d, switch the current thread to the unbonded span (i.e. no 
attempt to restore
   to previous one). And when we handoff() a span, we get a new span and the 
active one is made inactive
   immediately.
   
   Like I said: I need to think of what we are trying to achieve here. 
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 556501)
    Time Spent: 8h  (was: 7h 50m)

> Add an Audit plugin point for S3A auditing/context
> --------------------------------------------------
>
>                 Key: HADOOP-17511
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17511
>             Project: Hadoop Common
>          Issue Type: Sub-task
>    Affects Versions: 3.3.1
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 8h
>  Remaining Estimate: 0h
>
> Add a way for auditing tools to correlate S3 object calls with Hadoop FS API 
> calls.
> Initially just to log/forward to an auditing service.
> Later: let us attach them as parameters in S3 requests, such as opentrace 
> headeers or (my initial idea: http referrer header -where it will get into 
> the log)
> Challenges
> * ensuring the audit span is created for every public entry point. That will 
> have to include those used in s3guard tools, some defacto public APIs
> * and not re-entered for active spans. s3A code must not call back into the 
> FS API points
> * Propagation across worker threads



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to