Avoid redundant memory_breakdown() computation in device memory probe by Jessen-Li · Pull Request #21988 · ggml-org/llama.cpp

Jessen-Li · 2026-04-16T11:16:44Z

Overview

This PR removes a redundant computation of ctx->memory_breakdown() during device memory probing in llama_get_device_memory_data().

Additional information

Changes

Introduce llama_memory_breakdown_print_impl which accepts a precomputed memory_breakdown object.
Reuse the existing memory_breakdown result from llama_get_device_memory_data() instead of recomputing it in
llama_memory_breakdown_print()

Motivation

llama_get_device_memory_data() may be called multiple times during fit/probe workflows (e.g. baseline, context probing, repeated resource checks). Each call currently recomputes memory_breakdown, leading to unnecessary duplicated work.

This change ensures:

No redundant computation within a single probe
Consistent snapshot used for both computation and logging
Slight reduction in overhead in repeated probe scenarios

Behavior

No functional change is intended.
Output and memory accounting remain identical.

Requirements

No new requirements introduced by this change.

I have read and agree with the contributing guidelines
AI usage disclosure:
NO.

Add llama_memory_breakdown_print_impl that accepts a precomputed memory_breakdown object, and reuse it in llama_get_device_memory_data to avoid duplicate computation during fit/probe runs.

Avoid redundant memory_breakdown() recomputation in device memory probe

ad10b42

Add llama_memory_breakdown_print_impl that accepts a precomputed memory_breakdown object, and reuse it in llama_get_device_memory_data to avoid duplicate computation during fit/probe runs.

Jessen-Li requested a review from ggerganov as a code owner April 16, 2026 11:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid redundant memory_breakdown() computation in device memory probe#21988

Avoid redundant memory_breakdown() computation in device memory probe#21988
Jessen-Li wants to merge 1 commit intoggml-org:masterfrom
Jessen-Li:pr-memory-breakdown

Jessen-Li commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jessen-Li commented Apr 16, 2026

Overview

Additional information

Changes

Motivation

Behavior

Requirements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant