fix(llm): preflight compact oversized requests by golemcore1 · Pull Request #284 · alexk-dev/golemcore-bot

golemcore1 · 2026-04-14T04:55:46Z

Summary

This PR fixes a long-chat failure mode where the agent could build an LLM request larger than the selected model can accept and only discover the problem after sending the request to the provider.

Changes included:

Add a provider-agnostic ContextTokenEstimator for estimating full LLM request size, including system prompt, conversation view, tools, and tool results.
Add request preflight compaction in LlmCallPhase after context assembly and model selection, immediately before the provider call.
Emit preflight diagnostics via the canonical ContextAttributes.LLM_REQUEST_PREFLIGHT key.
Add REQUEST_PREFLIGHT as a distinct compaction reason for observability.
Allow context-overflow recovery to retry after fallback compaction even when LLM summary generation was unavailable.
Expand context-overflow error classification for common token/context messages.
Avoid treating obvious context-overflow messages such as too_many_tokens as rate-limit retry signals in the LangChain4j adapter.
Reuse the shared estimator in AutoCompactionSystem so pre-context estimates are less ad hoc.
Add/adjust tests for preflight compaction, fallback overflow recovery, classifier coverage, adapter classification, and architecture compliance.

Add targeted tests that exercise reachable defensive null-check arms: explicit messages(null) override on AgentSession builder to bypass Lombok @Builder.Default, degenerate coordinator with null orchestration service, and empty-messages preflight attempt. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sonarqubecloud · 2026-04-15T01:49:31Z

Quality Gate passed

Issues
23 New issues
0 Accepted issues

Measures
0 Security Hotspots
94.9% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

golemcore1 force-pushed the fix/llm-context-preflight branch from 89984f6 to 4193fd3 Compare April 14, 2026 05:15

fix(llm): preflight compact oversized requests

832f75f

golemcore1 force-pushed the fix/llm-context-preflight branch from 4193fd3 to 832f75f Compare April 14, 2026 05:42

alexk-dev and others added 11 commits April 14, 2026 14:04

fix(llm): harden context budget handling

1730b5e

test(toolloop): split default tool loop system tests

69b5242

fix(toolloop): harden context compaction flow

e545c75

Refine context compaction architecture

936a7b2

Fix context overflow classification and wiring

83b764e

Harden preflight diagnostics

db903e9

Harden preflight diagnostics and token estimation

49f4f0a

fix(llm): harden compaction diagnostics and estimation

7a5497f

fix(llm): prevent char array token overflow

2af4da8

Merge branch 'main' into fix/llm-context-preflight

d2d464c

alexk-dev merged commit d533cd9 into main Apr 15, 2026
18 checks passed

alexk-dev deleted the fix/llm-context-preflight branch April 15, 2026 01:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llm): preflight compact oversized requests#284

fix(llm): preflight compact oversized requests#284
alexk-dev merged 12 commits intomainfrom
fix/llm-context-preflight

golemcore1 commented Apr 14, 2026

Uh oh!

Uh oh!

sonarqubecloud bot commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

golemcore1 commented Apr 14, 2026

Summary

Uh oh!

Uh oh!

sonarqubecloud bot commented Apr 15, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants