justinmclean opened a new pull request, #253:
URL: https://github.com/apache/airflow-steward/pull/253

   > **Generated by the spec-driven build loop.** This page was produced by an
   > autonomous run of `tools/spec-loop` (`./loop.sh` — one work item, one 
branch,
   > one PR). Authored by Claude (see the `Generated-by` commit trailer) and
   > reviewed + tested by a human before submission.
   
   ## What
   
   Adds `docs/mode-economics.md` — an indicative per-skill / per-mode / 
per-model
   token-cost page, so a maintainer evaluating adoption can see the cost shape 
up
   front rather than discovering it after the fact. Covers Triage, Mentoring,
   Drafting, Pairing and Auto-merge; model-class trade-offs; local/self-hosted
   inference; cost-reduction levers; and the long-term ASF inference-endpoint
   roadmap item. Adds a reciprocal cross-reference from `docs/modes.md`.
   
   ## Why
   
   `MISSION.md § Affordability` commits to documenting mode economics honestly.
   This page is that documentation, and its data is the quantitative input for
   future ASF inference-endpoint capacity planning.
   
   ## Changes
   
   - `docs/mode-economics.md` — new page.
   - `docs/modes.md` — cross-reference back to it.
   
   ## Testing — and a limitation it shipped with
   
   The doc gates runnable locally pass: the doctoc TOC matches all 15 headings, 
every
   internal link/anchor resolves (including the MISSION § Affordability 
deep-link and
   the reciprocal `modes.md` cross-ref), file-hygiene is clean, and 12 of 13 
referenced
   skills exist on `main` (the 13th, `pairing-self-review`, is a sibling 
unmerged
   branch — a name in a table, not a link). `markdownlint`/`typos`/`lychee` 
should be
   confirmed by CI.
   
   **Limitation found in testing — the loop did not detect it.** The page's 
headline
   anchor claimed "one skill file (SKILL.md) ≈ 3,000–6,000 tokens" and that this
   overhead "applies" generally. Tokenizing the real catalogue with 
`cl100k_base`
   showed that holds only for the *median* skill: of 26 skills, **9** fall in 
3k–6k,
   **6** are below, and **11** are above — the large multi-step security skills 
run far
   higher (`security-issue-import` ~22,000, `security-issue-sync` ~36,000, i.e. 
4–6×
   the stated ceiling). The anchor is corrected here to measured, class-specific
   ranges (small ~1k–3k, typical ~3.5k–9k, large security ~11k–36k). The build 
loop
   shipped a plausible but unverified figure; the test-before-PR step is what 
caught it.
   
   ## Notes
   
   - Indicative cost guidance; the counts in the "what tokens means here" 
anchor are
     now measured (`cl100k_base`), not estimated.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to