asf-tooling commented on issue #816:
URL: 
https://github.com/apache/tooling-trusted-releases/issues/816#issuecomment-4410092938

   <!-- gofannon-issue-triage-bot v2 -->
   
   **Automated triage** — analyzed at `main@2da7807a`
   
   **Type:** `documentation`  •  **Classification:** `actionable`  •  
**Confidence:** `medium`
   **Application domain(s):** `shared_infrastructure`, `admin_operations`
   
   ### Summary
   This issue requests new documentation describing ATR's disk layout and size 
requirements, covering svn:dist areas, the main release store, state/log 
directories, and /tmp usage. The goal is to support production VM sizing and 
monitoring script design. No prior discussion exists. The relevant code that 
would inform this documentation (atr/paths.py, atr/storage/__init__.py) was not 
provided in the source files I can see, but from the architecture inventory 
these files clearly exist and define the storage structure.
   
   ### Where new code would go
   - `docs/disk-layout.md` — new file
     A new markdown file documenting disk layout, size requirements, growth 
expectations, log rotation, and /tmp usage patterns. This would live alongside 
other project documentation.
   
   ### Proposed approach
   To properly document the disk layout, someone with access to the production 
system (or deep knowledge of atr/paths.py and atr/storage/__init__.py) needs to 
enumerate: (1) the svn:dist checkout paths for 'release' and 'dev' areas with 
existing SVN size policies, (2) the main ATR release store structure showing 
how artifacts are organized and how storage grows per release, (3) the state 
directory structure including log files and their rotation policy, and (4) 
temporary file usage during archive extraction (atr/tasks/quarantine.py) and 
SBOM generation. The documentation should include concrete size estimates and 
growth projections that can inform VM disk allocation and monitoring thresholds.
   
   A monitoring script (or at least its design) should also be documented or 
scaffolded, checking disk usage against defined thresholds for each area. This 
could live in a scripts/ directory given the existing labeler.yml 
categorization for that path.
   
   ### Open questions
   - What are the actual paths defined in atr/paths.py for state, releases, svn 
checkouts, and tmp?
   - What are the current SVN size enforcement policies referenced in the issue?
   - What is the expected release volume (releases per month/year) to estimate 
growth?
   - What log rotation mechanism is currently in place (logrotate, 
application-level, etc.)?
   - How large are typical quarantine extractions in /tmp (depends on archive 
sizes)?
   - Should the monitoring script be a standalone script in scripts/ or 
integrated into the application?
   
   ### Files examined
   - `.asf.yaml`
   - `.github/PULL_REQUEST_TEMPLATE.md`
   - `.github/dependabot.yml`
   - `.github/labeler.yml`
   - `.github/linters/.markdown-lint.yml`
   - `.github/workflows/allowlistchecker.yml`
   - `.github/workflows/analyze.yml`
   - `.github/workflows/build.yml`
   
   ---
   *Draft from a triage agent. A human reviewer should validate before merging 
any change. The agent did not run tests or verify diffs apply.*


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to