libs : rename libcommon -> libllama-common by ggerganov · Pull Request #21936 · ggml-org/llama.cpp

ggerganov · 2026-04-15T05:51:23Z

Overview

Rename libcommon -> libllama-common
Rename libbuild_info -> libllama-common-base
Allow libllama-common to be shared (based on BUILD_SHARED_LIBS)
Install alongside libllama

Additional information

This change reduces the output binaries size by making all binaries dynamically link to libllama-common.

Ideally, libllama-common should not be installed in the public lib path as it is not meant for 3rd-party usage. However, I'm not sure what is the canonical way to do that. For now, installing it the exact way as libllama using the same versioning scheme.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: NO

ckastner · 2026-04-15T07:30:34Z

For now, installing it the exact way as libllama using the same versioning scheme.

I think it would be OK to get the first version out like this, if that keeps things simpler for you, and to improve upon this later.

Improvement: the canonical way to do this would be to

Place this shared library in a non-public search path, typically /usr/lib/<package> or /usr/lib/<gnu-triplet>/<package>
Set RPATH/RUNPATH on relevant binaries so they can find it

You can see this with systemd, for example.

$ ldd /usr/bin/journalctl | grep libsystemd-shared
libsystemd-shared-255.so => /usr/lib/x86_64-linux-gnu/systemd/libsystemd-shared-255.so (0x00007e3a8167f000)

$ readelf -d /usr/bin/journalctl | grep RUNPATH
 0x000000000000001d (RUNPATH)            Library runpath: [/usr/lib/x86_64-linux-gnu/systemd]

Since Debian requires libraries in public paths to have stable ABI, we maintain a downstream-only patch that currently makes all llama.cpp libraries private following the example above (see our libllama0 file list). And at build time, all we do is pass -DCMAKE_INSTALL_RPATH=<private-dir>.

If you intend to implement this improvement, perhaps we can find a general solution that makes our patch obsolete.

[We never intended to upstream our patch as it goes too far for you, by hiding libllama0 that users actually want. In Debian, users can easily un-hide it through the pkg-config and cmake configurations that we provide in (non-default) search paths.]

arthw

I tested for SYCL backend.
It's OK for SYCL backend.

Thank you!

ngxson · 2026-04-15T09:37:14Z

A bit related to this, I've been thinking about moving server-context to a new lib named libllama-engine, the idea will be similar to llamax that you brought up before. I can create a dedicated issue for discussion if it's necessary.

The current PR starts to introduce the naming scheme libllama-something which is aligned with my point above.

Re. your concern about not allowing 3rd-party to use the libllama-common, a simple hack could be to have a #ifndef USE_INTERNAL_LIBCOMMON that the cpp code need to define before including the header, so if user doesn't include it, it will show an error. It's similar to mtmd-helper:

llama.cpp/tools/mtmd/mtmd-helper.cpp

Lines 35 to 37 in 8dc530b

    
           #ifdef MTMD_INTERNAL_HEADER 
        
           #error "mtmd-helper is a public library outside of mtmd. it must not include internal headers" 
        
           #endif

ggerganov · 2026-04-15T10:49:35Z

Improvement: the canonical way to do this would be to

Place this shared library in a non-public search path, typically /usr/lib/<package> or /usr/lib/<gnu-triplet>/<package>

Set RPATH/RUNPATH on relevant binaries so they can find it

Got it, thanks. We can do that in the next iteration.

If you intend to implement this improvement, perhaps we can find a general solution that makes our patch obsolete.

I guess to do that we need to automate the ABI stability validation on our end. We basically need to add a workflow to llama.cpp repo that would compare the current ABI with the latest released ABI and raise an error if there is backwards incompatibility. Probably something that we should address soon.

A bit related to this, I've been thinking about moving server-context to a new lib named libllama-engine, the idea will be similar to llamax that you brought up before. I can create a dedicated issue for discussion if it's necessary.

Yes, we need to reorganize the libllama-common and the server code in such a way that for example llama-cli should not require to link to httplib and still be able to use the "server engine" underneath.

Re. your concern about not allowing 3rd-party to use the libllama-common

It's not a problem. The only annoyance is that users will see a libllama-common.so in /usr/lib without having a header file to interface with it, but will fix it as @ckastner explained above.

ckastner · 2026-04-15T11:25:10Z

I guess to do that we need to automate the ABI stability validation on our end. We basically need to add a workflow to llama.cpp repo that would compare the current ABI with the latest released ABI and raise an error if there is backwards incompatibility. Probably something that we should address soon.

I had this on my TODO list, actually in both directions

Find ABI break with abi-compliance-checker or libabigail
Find new ABI features (backwards-compatible)

Point 2. would be a common occurence. It's when llama.cpp needs a newer ggml for new symbols/features but doesn't break existing ones. Which means a new ggml minor release is needed.

ngxson · 2026-04-15T11:27:58Z

Yes, we need to reorganize the libllama-common and the server code in such a way that for example llama-cli should not require to link to httplib and still be able to use the "server engine" underneath.

Forgot to mention, we're having a discussion in #21674 (especially the comment #21674 (comment)) about the direction of moving CLI to using HTTP instead native "server engine" version. Kindly asking if you can share you POV the discussion when you have time. Thanks in advance!

CMay · 2026-04-15T11:47:07Z

This change reduces the output binaries size by making all binaries dynamically link to libllama-common.

Smaller binaries are appealing. Some of the larger backends may not even notice, but Vulkan might.

I guess to do that we need to automate the ABI stability validation on our end. We basically need to add a workflow to llama.cpp repo that would compare the current ABI with the latest released ABI and raise an error if there is backwards incompatibility. Probably something that we should address soon.

This is one of the most exciting things I've heard out of llama.cpp development in a while. Llama.cpp interests me as a library more than a client front end or inference web API host, but I've got old code that doesn't work with the newer versions of the library and haven't gotten around to fixing it.

It does make me wonder about the balance between the flexibility of exposing new things that newer models use versus abstracting those so that old code can use drop-in replacements of llama.cpp and benefit from newer models without code changes.

Like a basic default usage path and a more manual usage path that allows more tinkering. Is an approach like that even compatible with aiming for stability without leaving a graveyard of old public methods? Would you have to create a kind of feature router that people can use to indirectly call a subset of private methods by request until they're stabilized and made public?

ggerganov · 2026-04-16T05:31:05Z

@ckastner If you have any suggestions how to implement the workflow let us know. I tried to prototype something that extracts the current ABI, but it boils down to parsing the header file which seems a bit fragile. Maybe the tools that you mentioned would be better suited for that.

@ngxson Yes, generally decoupling the communication layer from the logic (both server and client) is a good idea in terms of the implementation. The server-side though is more important IMO atm.

@CMay I'm not sure I follow everything, but the bottom line is that when we transition to semantic versioning it should generally be easier for 3rd parties using llama.cpp. We just need an automation for that because it is difficult to keep track of all the changes.

Would you have to create a kind of feature router that people can use to indirectly call a subset of private methods by request until they're stabilized and made public?

No, this sounds too complicated. We want to expose only functionality that will likely remain stable long term. Experiments can be done in tools/examples in the repo or in forks.

ggerganov requested review from a team, JohannesGaessler, am17an, danbev, ngxson and pwilkin as code owners April 15, 2026 05:51

github-actions bot added build Compilation issues testing Everything test related examples server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Apr 15, 2026

ggerganov requested review from angt and ckastner April 15, 2026 05:51

arthw approved these changes Apr 15, 2026

View reviewed changes

ggerganov force-pushed the gg/libcommon-shared branch from b9c51fc to c4f5835 Compare April 15, 2026 09:19

ggerganov force-pushed the gg/libcommon-shared branch from dac63e9 to 64c8c88 Compare April 15, 2026 10:24

ggerganov requested a review from CISC April 16, 2026 05:11

ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Apr 16, 2026

ggerganov force-pushed the gg/libcommon-shared branch from aefc0d1 to 625c622 Compare April 16, 2026 17:49

ngxson approved these changes Apr 16, 2026

View reviewed changes

cmake : allow libcommon to be shared

4dd1f33

ggerganov added 6 commits April 17, 2026 08:52

cmake : rename libcommon to libllama-common

7be7fb9

cont : set -fPIC for httplib

b2f35f5

cont : export all symbols

2c813c1

cont : fix build_info exports

ca41ee1

libs : add libllama-common-base

55deb43

log : add common_log_get_verbosity_thold()

fa86a92

ggerganov force-pushed the gg/libcommon-shared branch from 625c622 to fa86a92 Compare April 17, 2026 05:53

ggerganov merged commit 6990e2f into master Apr 17, 2026
48 of 51 checks passed

ggerganov deleted the gg/libcommon-shared branch April 17, 2026 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

libs : rename libcommon -> libllama-common#21936

libs : rename libcommon -> libllama-common#21936
ggerganov merged 7 commits intomasterfrom
gg/libcommon-shared

ggerganov commented Apr 15, 2026 •

edited

Loading

Uh oh!

ckastner commented Apr 15, 2026

Uh oh!

arthw left a comment

Uh oh!

ngxson commented Apr 15, 2026 •

edited

Loading

Uh oh!

ggerganov commented Apr 15, 2026

Uh oh!

ckastner commented Apr 15, 2026

Uh oh!

ngxson commented Apr 15, 2026

Uh oh!

CMay commented Apr 15, 2026 •

edited

Loading

Uh oh!

ggerganov commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ggerganov commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

ckastner commented Apr 15, 2026

Uh oh!

arthw left a comment

Choose a reason for hiding this comment

Uh oh!

ngxson commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Apr 15, 2026

Uh oh!

ckastner commented Apr 15, 2026

Uh oh!

ngxson commented Apr 15, 2026

Uh oh!

CMay commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ggerganov commented Apr 15, 2026 •

edited

Loading

ngxson commented Apr 15, 2026 •

edited

Loading

CMay commented Apr 15, 2026 •

edited

Loading