andreahlert opened a new pull request, #326:
URL: https://github.com/apache/airflow-steward/pull/326

   ## Summary
   
   Three small corrections to `docs/mode-economics.md`, all confined to the 
existing **Reducing costs** and **Local and self-hosted inference** sections. 
No new claims, no methodology changes, no new rows — only tightening claims 
already on the page.
   
   1. **Cache section referenced a pre-correction anchor.** Item 3 ("Cache 
skill context") said _"The skill file (3 000–6 000 tokens) and stable project 
configuration files are ideal cache candidates"_. That `3 000–6 000` figure is 
the same anchor the PR description of #253 explicitly flagged as wrong and 
corrected elsewhere on the page — the **What \"tokens\" means here** table now 
reports measured ranges:
   
      | Skill class | Range |
      |---|---|
      | small setup/utility | ~1 000–3 000 |
      | typical workflow | ~3 500–9 000 (median ~5 300) |
      | large multi-step security | ~11 000–36 000 |
   
      The inline figure in the cache item was never updated, so the page 
contradicted itself. Replace the figure with a pointer to the corrected anchor.
   
   2. **Cache TTL caveat (same paragraph).** The pattern \"first invocation 
pays; subsequent invocations cheap\" is real for bursty same-session workloads 
but typically misses for the periodic / day-spaced workloads listed in the 
per-mode tables above (Anthropic prompt cache TTL: 5 min default, 1 h extended 
at higher write cost). One sentence flagging the constraint avoids the footgun.
   
   3. **Local-inference table: amortisation needed a denominator.** The 
Consumer-GPU row listed `~\$0.10–0.50/hr amortised` with no basis. 
\"Amortised\" without capex / lifespan / utilisation is uninterpretable. Inline 
the assumption (`capex amortised over ~3 yr lifespan × moderate utilisation`) 
so a reader can sanity-check the range against their own hardware.
   
   ## Out of scope (intentionally)
   
   The page still has broader gaps that I think deserve discussion, not a 
unilateral edit: a measurement-date / coverage / tokenizer banner, splitting 
measured-vs-planned rows in the per-mode tables, p50 anchors on the wide 
ranges, a date stamp on the cross-class multipliers. I'll open a separate issue 
for those so #281's eval harness can inform the answers rather than my picking 
them in isolation.
   
   ## Test plan
   
   - [x] `markdownlint` / `doctoc` / `typos` clean locally (only edits are 
in-paragraph text; no new anchors, ToC unchanged).
   - [x] Internal anchor `#what-tokens-means-here` resolves to the existing 
section heading on the same page.
   - [x] Diff is 9 lines added / 4 removed, scoped to a single file.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to