[jira] [Updated] (HADOOP-18190) s3a prefetching streams to collect iostats on prefetching operations

Daniel Carl Jones (Jira) Wed, 18 May 2022 02:02:39 -0700


     [ 
https://issues.apache.org/jira/browse/HADOOP-18190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Daniel Carl Jones updated HADOOP-18190:
---------------------------------------
    Description: 
There is a lot more happening in reads, so there's a lot more data to collect 
and publish in IO stats for us to view in a summary at the end of processes as 
well as get from the stream while it is active.

Some useful ones would seem to be:

counters
 * is in memory. using 0 or 1 here lets aggregation reports count total #of 
memory cached files.
 * prefetching operations executed
 * errors during prefetching

gauges
 * number of blocks in cache
 * total size of blocks
 * active prefetches
+ active memory used

duration tracking count/min/max/ave
 * time to fetch a block
 * time queued before the actual fetch begins
 * time a reader is blocked waiting for a block fetch to complete

and some info on cache use itself
 * number of blocks discarded unread
 * number of prefetched blocks later used
 * number of backward seeks to a prefetched block
 * number of forward seeks to a prefetched block

the key ones I care about are
 # memory consumption
 # can we determine if cache is working (reads with cache hit) and when it is 
not (misses, wasted prefetches)
 # time blocked on executors

The stats need to be accessible on a stream even when closed, and aggregated 
into the FS. once we get per-thread stats contexts we can publish there too and 
collect in worker threads for reporting in task commits

  was:


There is a lot more happening in reads, so lot of more to collect and publish 
in IO stats for us to view in a summary at the end of processes as well as get 
from the stream while it is active

Some useful ones would seem to be

counters
* is in memory. using 0 or 1 here lets aggregation reports count total #of 
memory cached files.
* prefetching operations executed
* errors during prefetching


gauges
* number of blocks in cache
* total size of blocks
* active prefetches
+ active memory used

duration tracking count/min/max/ave

* time to fetch a block 
* time queued before the actual fetch begins
* time a reader is blocked waiting for a block fetch to complete


and some info on cache use itself

* number of blocks discarded unread
* number of prefetched blocks later used
* number of backward seeks to a prefetched block
* number of forward seeks to a prefetched block

the key ones I care about are 
# memory consumption
# can we determine if cache is working (reads with cache hit) and when it is 
not (misses, wasted prefetches)
# time blocked on executors

The stats need to be accessible on a stream even when closed, and aggregated 
into the FS. once we get per-thread stats contexts we can publish there too and 
collect in worker threads for reporting in task commits




> s3a prefetching streams to collect iostats on prefetching operations
> --------------------------------------------------------------------
>
>                 Key: HADOOP-18190
>                 URL: https://issues.apache.org/jira/browse/HADOOP-18190
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 3.4.0
>            Reporter: Steve Loughran
>            Priority: Major
>
> There is a lot more happening in reads, so there's a lot more data to collect 
> and publish in IO stats for us to view in a summary at the end of processes 
> as well as get from the stream while it is active.
> Some useful ones would seem to be:
> counters
>  * is in memory. using 0 or 1 here lets aggregation reports count total #of 
> memory cached files.
>  * prefetching operations executed
>  * errors during prefetching
> gauges
>  * number of blocks in cache
>  * total size of blocks
>  * active prefetches
> + active memory used
> duration tracking count/min/max/ave
>  * time to fetch a block
>  * time queued before the actual fetch begins
>  * time a reader is blocked waiting for a block fetch to complete
> and some info on cache use itself
>  * number of blocks discarded unread
>  * number of prefetched blocks later used
>  * number of backward seeks to a prefetched block
>  * number of forward seeks to a prefetched block
> the key ones I care about are
>  # memory consumption
>  # can we determine if cache is working (reads with cache hit) and when it is 
> not (misses, wasted prefetches)
>  # time blocked on executors
> The stats need to be accessible on a stream even when closed, and aggregated 
> into the FS. once we get per-thread stats contexts we can publish there too 
> and collect in worker threads for reporting in task commits



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (HADOOP-18190) s3a prefetching streams to collect iostats on prefetching operations

Reply via email to