fix(ai_guard): record model output in LLMObs when blocked after model call by avara1986 · Pull Request #18585 · DataDog/dd-trace-py

avara1986 · 2026-06-11T10:06:36Z

Description

When AI Guard blocks a request after the model call completes, the OpenAI and Anthropic integrations dropped the model output from the LLM Observability span. Output extraction is gated on not span.error, and the AI Guard block errors the span — so even though the model response was produced (and is visible in the AI Guard UI), LLM Obs recorded an empty output.

Root causes (same symptom, two mechanisms):

Anthropic (contrib/internal/anthropic/patch.py): event.response = resp was only set after the .after dispatch, so on a block the response was never attached to the event and the ended-event handler recorded response=None.
OpenAI (llmobs/_integrations/utils.py): the response is available, but openai_set_meta_tags_from_chat / _from_response blank the output whenever span.error is set — which the block triggers.

Fix

Add a span ctx-item flag AI_GUARD_BLOCKED (llmobs/_constants.py).
The contrib patches set the flag when a DDBlockException is raised after a successful model call (Anthropic also now attaches the response to the request event before the after-hook).
The OpenAI (chat + responses) and Anthropic output extractors honour the flag: when the span is errored but a valid response exists due to an AI Guard block, the model output is still recorded. Behaviour is unchanged for genuine model/API errors (no response exists).

Testing

Before this PR

After this PR:

… call When AI Guard blocked a request AFTER the model call completed, the OpenAI and Anthropic integrations dropped the model output from the LLMObs span: output extraction is gated on `not span.error`, and the block errors the span. The response was already produced (and is visible in the AI Guard UI), but LLMObs recorded an empty output — APPSEC-68147. The contrib patches now flag the span with an `AI_GUARD_BLOCKED` ctx item when a `DDBlockException` is raised after a successful model call (Anthropic also attaches the response to the request event, which was previously only set after the after-hook). The OpenAI (chat + responses) and Anthropic output extractors honour the flag and still record the model output even though the span is errored by the block. Behaviour is unchanged for genuine model/API errors, where no response exists. Verified end-to-end via the AI Guard OpenAI dogfooding scenario: on an after-model block the LLMObs span output goes from empty (pre-fix) to the full model response (post-fix). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

cit-pr-commenter-54b7da · 2026-06-11T10:08:38Z

Codeowners resolved as

tests/appsec/ai_guard/anthropic/test_anthropic.py                       @DataDog/asm-python

datadog-prod-us1-6 · 2026-06-11T10:11:06Z

Tests

✨ Fix all issues with BitsAI

⚠️ Warnings

🚦 8 Pipeline jobs failed

DataDog/apm-reliability/dd-trace-py | build linux serverless: [amd64, cp315-cp315, v113741238-d2b8243-manylinux2014_x86_64, 1]

DataDog/apm-reliability/dd-trace-py | build linux serverless: [amd64, cp315-cp315, v113741491-d2b8243-musllinux_1_2_x86_64, 1]

DataDog/apm-reliability/dd-trace-py | build linux serverless: [arm64, cp315-cp315, v113741357-d2b8243-manylinux2014_aarch64, 1]

View all 8 failed jobs.

ℹ️ Info

No other issues found (see more)

🧪 All tests passed
❄️ No new flaky tests detected

Useful? React with 👍 / 👎

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: 1c85f5c | Docs | Datadog PR Page | Give us feedback!}

avara1986 · 2026-06-11T13:49:48Z

@codex review

chatgpt-codex-connector · 2026-06-11T13:53:07Z

Codex Review: Didn't find any major issues. More of your lovely PRs please.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Yun-Kim · 2026-06-12T18:46:19Z

+                    ctx.span._set_ctx_item(AI_GUARD_BLOCKED, True)
            ctx.dispatch_ended_event(*sys.exc_info())
            raise
        event.response = resp


Can we just always set the event.response = resp before we do the AI Guard after core dispatch? This way we can decouple AI Guard errors from non-none responses.

Yun-Kim · 2026-06-12T18:47:30Z

+        # Record output when a response exists. ``span.error`` normally
+        # suppresses output, but an AI Guard block after the model call errors
+        # the span while still having a valid response (APPSEC-68147).
+        if response is not None and (not span.error or span._get_ctx_item(AI_GUARD_BLOCKED)):


we should just gate this on if response is not None, it should be independent of span.error or ai guard blocked being true

Yun-Kim · 2026-06-12T18:48:29Z

-    if span.error or not messages:
+    # ``span.error`` normally suppresses output, but an AI Guard block after the
+    # model call errors the span while a valid response exists (APPSEC-68147).
+    if (span.error and not span._get_ctx_item(AI_GUARD_BLOCKED)) or not messages:


what's stopping us from just gating this as if not messages?

fix errors

4753eb1

avara1986 requested a review from christophe-papazian June 11, 2026 13:49

Merge branch 'main' into fix/ai-guard-llmobs-output-after-block

4ad25a0

christophe-papazian approved these changes Jun 11, 2026

View reviewed changes

Merge branch 'main' into fix/ai-guard-llmobs-output-after-block

1c85f5c

avara1986 marked this pull request as ready for review June 12, 2026 13:40

avara1986 requested review from a team as code owners June 12, 2026 13:40

avara1986 requested review from dubloom and sabrenner June 12, 2026 13:40

Yun-Kim reviewed Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ai_guard): record model output in LLMObs when blocked after model call#18585

fix(ai_guard): record model output in LLMObs when blocked after model call#18585
avara1986 wants to merge 4 commits into
mainfrom
fix/ai-guard-llmobs-output-after-block

avara1986 commented Jun 11, 2026 •

edited by atlassian Bot

Loading

Uh oh!

cit-pr-commenter-54b7da Bot commented Jun 11, 2026 •

edited

Loading

Uh oh!

datadog-prod-us1-6 Bot commented Jun 11, 2026 •

edited by datadog-datadog-prod-us1 Bot

Loading

Uh oh!

avara1986 commented Jun 11, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 11, 2026

Uh oh!

Yun-Kim Jun 12, 2026 •

edited

Loading

Uh oh!

Yun-Kim Jun 12, 2026

Uh oh!

Yun-Kim Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

avara1986 commented Jun 11, 2026 • edited by atlassian Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Fix

Testing

Before this PR

After this PR:

Uh oh!

cit-pr-commenter-54b7da Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codeowners resolved as

Uh oh!

datadog-prod-us1-6 Bot commented Jun 11, 2026 • edited by datadog-datadog-prod-us1 Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Warnings

ℹ️ Info

Uh oh!

avara1986 commented Jun 11, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 11, 2026

Uh oh!

Yun-Kim Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yun-Kim Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Yun-Kim Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

avara1986 commented Jun 11, 2026 •

edited by atlassian Bot

Loading

cit-pr-commenter-54b7da Bot commented Jun 11, 2026 •

edited

Loading

datadog-prod-us1-6 Bot commented Jun 11, 2026 •

edited by datadog-datadog-prod-us1 Bot

Loading

Yun-Kim Jun 12, 2026 •

edited

Loading