feat(group): codex 分组新增强制 fast 模式开关（绕过客户端 anthropic-beta 限制） by HaoYan-A · Pull Request #1654 · Wei-Shaw/sub2api

HaoYan-A · 2026-04-14T16:37:56Z

背景

项目已经支持 Claude 协议 → Codex Fast 的自动映射：当 Claude Code 客户端通过 /v1/messages 发送请求并携带 anthropic-beta: fast-mode-2026-02-01 header 时，ForwardAsAnthropic 会把 responsesReq.ServiceTier 设为 "priority"（openai_gateway_messages.go:58-61）。

但实际用户遇到的问题是:

Claude Code 官方客户端目前没有开启 fast-mode beta 的入口,用户无法让客户端携带这个 header
Claude Code 的 codex 插件同样无法传入 fast-mode header

结果就是:即使用户愿意使用 fast 档(priority 档位),现有链路也触发不了 —— 整个"Claude → Codex Fast"的功能在实际场景中派不上用场。

方案

在 openai 类型分组上新增一个 admin 级开关 force_fast_mode。开启后,此分组处理的所有请求(不论来自 /v1/responses、/v1/chat/completions 还是 /v1/messages)都会被强制写入 service_tier="priority",无条件覆盖客户端传入的 service_tier 和 anthropic-beta: fast-mode-* header。

这是一个分组级的"override 兜底":管理员配置一次,该分组下所有 API Key 的请求都自动走 fast 档,不依赖客户端能不能传 header。

设计要点

单一注入入口,不改函数签名

三条 forward 路径(Forward / ForwardAsAnthropic / ForwardAsChatCompletions)都在 OAuth 分支调用 applyCodexOAuthTransform。为避免改动 applyCodexOAuthTransform 签名(会牵扯到 20+ 个已有测试桩),新增一个 helper:

// openai_gateway_service.go
func getForceFastModeFromContext(c *gin.Context) bool {
    // 从 c.Get("api_key") 读 apiKey.Group.ForceFastMode
    // 仅对 platform=openai 的分组生效
}

这个 helper 复用了中间件已 eager-load 到 context 的 apiKey.Group,handler 层完全不需要改动。

三处单点注入

openai_gateway_service.go Forward(/v1/responses,含 passthrough 和 WSv2 分支):在 applyCodexOAuthTransform 之后设置 reqBody["service_tier"] = "priority"
openai_gateway_messages.go ForwardAsAnthropic(Claude /v1/messages):扩展原有 BetaFastMode 判断为 force_fast_mode || beta
openai_gateway_chat_completions.go ForwardAsChatCompletions(/v1/chat/completions):在 responsesBody 最终 marshal 前同时设置 responsesReq.ServiceTier 和 body bytes

计费自动跟随

不需要碰 billing 代码。extractOpenAIServiceTier(reqBody) 已经会从最终 body 读 service_tier,CalculateCostWithServiceTier 会按 priority 档计费(billing_service_test.go:503 已覆盖)。

api_key_auth_cache 同步更新

APIKeyAuthGroupSnapshot(api_key_auth_cache.go)需要同步序列化新字段,否则 cache 里读回来的 Group 永远是默认 false。字段用 omitempty,老版本读到缺失字段时默认 false,向后兼容。

api_key_repo Select 白名单

api_key_repo.go 的 GetByKeyForAuth 用了显式 q.Select(...) 字段白名单,必须把 group.FieldForceFastMode 加进去,否则 ent 不会 SELECT 这一列,读出来的结构体字段是 false(已踩过这个坑)。

向后兼容性

DB schema:migration 107 加列 force_fast_mode BOOLEAN NOT NULL DEFAULT false。老代码读到这一列不会崩,只是不读。
Redis cache:APIKeyAuthGroupSnapshot JSON 字段用 omitempty。新老版本混部(共享同一个 Redis)时,双向读写都安全。
已有 fast-mode header 客户端:行为不变(force_fast_mode 关闭时原链路完全保留)。开启时会覆盖客户端显式值,这是预期行为。

测试

新增 8 个 subtest:TestGetForceFastModeFromContext 覆盖 nil context、缺 api_key、类型错、nil APIKey、nil Group、关闭、非 openai 平台、openai+开启所有分支
go test -tags=unit ./internal/service/... ./internal/handler/... ./internal/repository/... 全部通过
golangci-lint run ./... --new-from-rev=upstream/main 0 issues
pnpm run typecheck 通过

Checklist

go test -tags=unit ./... 通过
go test -tags=integration ./... —— 本地未启动 postgres/redis,依赖 CI 跑(改动对集成测试零影响:只加了一个字段 + 对应白名单更新,没有改任何既有接口)
golangci-lint run ./... 无新增问题
pnpm-lock.yaml 同步(未改 package.json)
无 interface 改动,不需要补 stub
Ent 生成代码已提交

使用方式

在 admin UI 编辑一个 openai 平台分组,可以看到"强制 fast 模式"开关(位于"允许 /v1/messages 调度"下方)。开启后立即对所有该分组的 API Key 生效(auth cache 会在下次请求时重新加载或自动失效)。

用户如果同时使用 force_fast_mode=true 和客户端 anthropic-beta: fast-mode header,行为一致(都走 priority);如果 force_fast_mode=false + 客户端传 header,按原有链路走 priority;force_fast_mode=false + 客户端不传 header,走默认档。

Introduce a group-level "force fast mode" toggle that, when enabled on an openai (codex) group, unconditionally rewrites the upstream request body to service_tier="priority" regardless of the client's inbound value or anthropic-beta: fast-mode-* header. Applies to all three forward paths: /v1/responses, /v1/chat/completions and Claude /v1/messages (including the OpenAI passthrough branch). The design reuses applyCodexOAuthTransform's call sites rather than changing its signature (which would touch 20+ existing tests). A single helper getForceFastModeFromContext reads the already-loaded apiKey.Group.ForceFastMode from the gin context, so handlers need no changes. Billing automatically flows through the existing service_tier path and charges at priority pricing. Notable subtleties: - The api_key eager-load in api_key_repo uses an explicit Select field allowlist, so the new column must be added there (otherwise ent reads false even when the DB column is true). - api_key auth cache snapshot serializes/deserializes through a separate DTO, also updated with the new field. - omitempty on the JSON field makes the addition backward-compatible: old sub2api versions sharing the same Redis will simply read false for the missing key, and new versions reading old cached snapshots behave correctly. Verified end-to-end against a real codex group: - off -> usage_logs.service_tier empty, baseline cost - on -> usage_logs.service_tier=priority, ~1.5x cost (gpt-5.4 priority) - Covers /v1/chat/completions and /v1/messages paths - 8-case unit test in openai_gateway_service_codex_cli_only_test.go covers every nil/type-mismatch/non-openai/off/on branch of the helper. Migration numbered 903 (test-space) to avoid colliding with main-branch additions in the 10x range; rename to an appropriate number before any upstream PR. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(group): codex 分组新增强制 fast 模式开关（绕过客户端 anthropic-beta 限制）#1654

feat(group): codex 分组新增强制 fast 模式开关（绕过客户端 anthropic-beta 限制）#1654
HaoYan-A wants to merge 1 commit intoWei-Shaw:mainfrom
HaoYan-A:pr/group-force-fast-mode

HaoYan-A commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HaoYan-A commented Apr 14, 2026

背景

方案

设计要点

单一注入入口,不改函数签名

三处单点注入

计费自动跟随

api_key_auth_cache 同步更新

api_key_repo Select 白名单

向后兼容性

测试

Checklist

使用方式

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant