fix(check-collections): report runner scan FAILED with logs on abort re-raise (SAS-13001) by m1n0 · Pull Request #2779 · sodadata/soda-core

m1n0 · 2026-07-02T12:39:52Z

What & why

SAS-13001: a post-migration contract scan failed but produced no logs in Cloud, making it undiagnosable.

Root cause is a diagnosability hole in the engine. A single-contract (runner) scan runs execute_check_collections with abort_on_first_error=True, so the first exception during contract construction or verify re-raises immediately — before phase 3's combined upload, which is the only place that ships the engine's captured logs to Cloud and calls mark_scan_as_failed. The exception then reaches the CLI (exit 3) and the launcher only writes it to pod logs, so the Cloud scan record shows FAILED with an empty log payload. This turns any pre-upload crash into a silent, undiagnosable scan.

Fix

Before the abort_on_first_error re-raise, best-effort mark the still-PENDING runner scan FAILED with the captured logs (_report_runner_scan_failed_before_reraise). The shared Logs gatherer holds every construction/verify record (each impl is built with logs=logs), so its records are exactly what should reach Cloud. The historical re-raise contract is preserved and reporting never masks the original exception. No-op for ad-hoc runs (no scan id).

Tests

New test_uncaught_exception_during_verify_marks_scan_failed_with_logs: a scan that raises during verify now marks the scan FAILED with a non-empty log payload (fails on main, passes here).
Full soda-core unit suite: 978 passed, 2 skipped; pre-commit clean.

Note

This is the diagnosability fix. The underlying crash for M&S (most likely a migrated contract with an unsupported check type) is tracked separately — this change makes that (and any future pre-upload crash) self-diagnosing in Cloud.

🤖 Generated with Claude Code

…re-raise A single-contract (runner) scan runs with abort_on_first_error=True, so the first construction/verify exception re-raises out of execute_check_collections before phase 3's combined upload — the only place that otherwise ships the engine logs and marks the scan FAILED in Cloud. The exception then reaches the CLI (exit 3) and the launcher only writes it to pod logs, leaving the Cloud scan record FAILED with no logs and the failure undiagnosable. Before the abort re-raise, best-effort mark the still-PENDING runner scan FAILED with the captured logs (the shared Logs gatherer holds every construction/verify record). The re-raise contract is preserved and reporting never masks the original exception. No-op for ad-hoc runs (no scan id). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

sonarqubecloud · 2026-07-02T12:40:53Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(check-collections): report runner scan FAILED with logs on abort re-raise (SAS-13001)#2779

fix(check-collections): report runner scan FAILED with logs on abort re-raise (SAS-13001)#2779
m1n0 wants to merge 1 commit into
mainfrom
SAS-13001-scan-failure-logs

m1n0 commented Jul 2, 2026 •

edited by atlassian Bot

Loading

Uh oh!

sonarqubecloud Bot commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

m1n0 commented Jul 2, 2026 • edited by atlassian Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What & why

Fix

Tests

Note

Uh oh!

sonarqubecloud Bot commented Jul 2, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

m1n0 commented Jul 2, 2026 •

edited by atlassian Bot

Loading