zclllyybb opened a new pull request, #64329: URL: https://github.com/apache/doris/pull/64329
## Proposed changes The Codex review workflow can produce large JSONL traces. Sending every Litefuse observation in a single legacy ingestion request can exceed Langfuse Cloud's per-request payload limit and fail with HTTP 413. This patch: - splits Litefuse ingestion into size-bounded requests while preserving original event order; - keeps a single generated trace id per run and uses that same trace id for all observations across chunks; - keeps the previous per-field truncation defaults (`max-json-chars=40000`) unless a single ingestion event itself exceeds the configured request-size budget; - uses compact JSON encoding for ingestion payloads; - shrinks only oversized single ingestion events as a last-resort guard before posting; - passes an explicit 4MB payload cap from the workflow to stay below Langfuse Cloud's 5MB request limit. ## Validation - `python3 -m py_compile .github/scripts/emit_litefuse_otel_io.py` - `git diff --check` - Parsed both related workflow YAML files and ran `bash -n` over every `run:` block. - Unit-style chunking check: splitting into multiple chunks preserved event bodies, event order, and trace id. - Synthetic dry-run with 267 Codex events and default context truncation: split into 24 requests, max request size 395324 bytes under a 400000-byte test cap. - Synthetic dry-run with a single huge observation and default context truncation: max request size 341060 bytes under a 400000-byte test cap. - Pushed the candidate to `zclllyybb/doris:master` and ran a real large fork PR review: https://github.com/zclllyybb/doris/pull/32 - Origin workflow run succeeded on candidate commit `fbd8306700428be56b6c25d9056dd24fef4e8c20`: https://github.com/zclllyybb/doris/actions/runs/27213314652/job/80348074816 - The workflow posted a real PR review from `github-actions` on commit `02d6ef37d49490ff8d72d28c6a1d22684b7686f0`. - Litefuse trace uploaded and verified: `647a0be2f7c451f9271d39d2e27707d0`, 78 observations, 21 agent messages, no observations missing I/O, agent-message context windows/events present. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
