[Staging] Add SLES support for AMD gpu-operator by Priyankasaggu11929 · Pull Request #371 · ROCm/gpu-operator

Priyankasaggu11929 · 2025-10-28T04:41:37Z

(based on comment #365 (review) from the original PR)

Motivation

This PR aim at adding support for SUSE Linux Enterprise Server (SLES) 15 SP5+ to the AMD GPU operator.

Technical Details

781c5b5 - add support for detecting SLES nodes and automatically selecting appropriate AMD GPU driver versions
- add new slesCMNameMapper to parse SLES version strings like 'SUSE Linux Enterprise Server 15 SP6' to 'sles-15.6'
- add SLESDefaultDriverVersionsMapper to select driver versions
  - SLES 15 SP6/SP7 -> driver 7.0.2 (ref: https://repo.radeon.com/amdgpu-install/7.0.2/sle/)
  - SLES 15 SP5 -> driver 6.2.2 (ref: https://repo.radeon.com/amdgpu-install/6.2.2/sle/)
- register both 'sles' and 'suse' identifiers in mappers
0170a9a - add SLES Dockerfile template (DockerfileTemplate.sles) for building AMD GPU drivers on SLES (currently, I've skipped adding the GIM Dockerfile template for SLES, will tackle it once this goes through).
- also embed the template via go:embed and add SLES case logic
~~c2dce44 - docs: update example/deviceconfig_example.yaml~~ <- dropped
4da60d3 - use "registry.suse.com" as the default base image registry if OS == "sles"
- although, use-specified BaseImageRegistry still takes precedence
- also extend tests in internal/kmmodule/kmmodule_test.go to test above changes in resolveDockerfile func

Test Plan

b625441 - tests: update internal/utils_test.go for added support for SLES 15 SP*

Test Result

truncated output of make unit-test after new added tests in b625441

> make unit-test
...
...
=== RUN   TestSLESDefaultDriverVersionsMapper
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_SP6
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_SP7
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_SP5
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_SP4
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_base
=== RUN   TestSLESDefaultDriverVersionsMapper/SLES_15_with_dash_format
--- PASS: TestSLESDefaultDriverVersionsMapper (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_SP6 (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_SP7 (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_SP5 (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_SP4 (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_base (0.00s)
    --- PASS: TestSLESDefaultDriverVersionsMapper/SLES_15_with_dash_format (0.00s)
PASS
coverage: 48.6% of statements
ok  	github.com/ROCm/gpu-operator/internal	0.019s	coverage: 48.6% of statements
=== RUN   TestAPIs
Running Suite: Controller Suite - /home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/controllers
==========================================================================================================================
Random Seed: 1761223798

Will run 15 of 15 specs
•••••••••••••••

Ran 15 of 15 Specs in 0.008 seconds
SUCCESS! -- 15 Passed | 0 Failed | 0 Pending | 0 Skipped
--- PASS: TestAPIs (0.01s)
PASS
coverage: 7.9% of statements
ok  	github.com/ROCm/gpu-operator/internal/controllers	(cached)	coverage: 7.9% of statements
=== RUN   TestAPIs
Running Suite: KMMModule Suite - /home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/kmmmodule
=======================================================================================================================
Random Seed: 1761223798

Will run 5 of 5 specs
testing multiple valid homogeneous nodes
testing multiple valid heterogeneous nodes
testing multiple valid heterogeneous nodes + one unsupported node
testing multiple unsupported nodes
testing empty node list
•<moduleName>
<amdgpu>
•<moduleName>
<amdgpu>
•••

Ran 5 of 5 Specs in 0.005 seconds
SUCCESS! -- 5 Passed | 0 Failed | 0 Pending | 0 Skipped
--- PASS: TestAPIs (0.01s)
PASS
coverage: 32.3% of statements
ok  	github.com/ROCm/gpu-operator/internal/kmmmodule	(cached)	coverage: 32.3% of statements

•••••••••••••••

Ran 15 of 15 Specs in 0.008 seconds
SUCCESS! -- 15 Passed | 0 Failed | 0 Pending | 0 Skipped

output from tests added as part of 4da60d3

❯ go test ./internal/kmmmodule/... -v -ginkgo.focus="resolveDockerfile" -ginkgo.v
=== RUN   TestAPIs
Running Suite: KMMModule Suite - /home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/kmmmodule
=======================================================================================================================
Random Seed: 1761548380

Will run 3 of 8 specs
SSSS
------------------------------
resolveDockerfile should use correct default registry when not specified by user
/home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/kmmmodule/kmmmodule_test.go:683
• [0.000 seconds]
------------------------------
resolveDockerfile should respect user-specified BaseImageRegistry for all OS types
/home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/kmmmodule/kmmmodule_test.go:702
• [0.000 seconds]
------------------------------
resolveDockerfile should return error for unsupported OS
/home/psaggu/work-suse/amd-gpu-operator-work/oct-23-pr/gpu-operator/internal/kmmmodule/kmmmodule_test.go:727
• [0.000 seconds]
------------------------------
S

Ran 3 of 8 Specs in 0.000 seconds
SUCCESS! -- 3 Passed | 0 Failed | 0 Pending | 5 Skipped
--- PASS: TestAPIs (0.00s)
PASS
ok  	github.com/ROCm/gpu-operator/internal/kmmmodule	0.022s

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

…cs (#1021)

* add suspend and resume functionality for remediation workflows * minor updates to docs * minor refactoring to avoid duplicate k8s get calls * add default configmap * fix helm chart issues * address code review comments * move remediation configs and scripts into separate files * add jq package to utils_container

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

Co-authored-by: Yuva Shankar <11082310+yuva29@users.noreply.github.com>

… dashboard Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

…fter partitioning Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

…071) Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

(cherry picked from commit 42a7329)

* make max parallel workflows configurable for auto remediation * add zero value in default CR * address review comments (cherry picked from commit f023a5c)

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

(cherry picked from commit 3f1a1ee2ea08f7675a6aba6cd60ed2f06ca7bdc6)

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

(cherry picked from commit c8409b8)

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

* remediation e2e tests for suspend and resume actions * add e2e test for recoverypolicy cr * use init container image from dev.env (cherry picked from commit 3e0f7aa)

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

(cherry picked from commit 4c737d7)

* GPUOP-525 update auto node remediation documentation * address review comments (cherry picked from commit 8e3f3e0)

* customize auto node remediation options * address review comments * commit generated files * support custom labels and taints in workflow * handle custom drain policy * update documentation * fix e2e test (cherry picked from commit 8dd5196)

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

…219) (ROCm#494) * handle imagePullSecrets in ANR workflow test runner and kmmmodule * address review comments (cherry picked from commit e6d01a9) Co-authored-by: Uday Bhaskar <udayb@amd.com>

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

* upgrade Argo workflow CRDs and controller to v4.0.3 (#1235) * upgrade Argo workflow CRDs and controller to v4.0.3 * update controller image version to v4.0.3 (cherry picked from commit 155a669) * Update amd-gpu-operator.clusterserviceversion.yaml --------- Co-authored-by: Uday Bhaskar <udayb@amd.com> Co-authored-by: Praveen Kumar Shanmugam <58961022+spraveenio@users.noreply.github.com>

…OCm#500) * [Fix] GPUOP-607 fail the ANR workflow when imagePullBackOff * Update internal/controllers/remediation/scripts/test.sh * Update internal/controllers/remediation/scripts/test.sh --------- (cherry picked from commit 344e480) Signed-off-by: yansun1996 <Yan.Sun3@amd.com> Co-authored-by: Yan Sun <Yan.Sun3@amd.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

@biluriuday

…m#501) * GPUOP-618 fix helm upgrade issue with latest Argo CRDs (#1283) (cherry picked from commit fe9ec91) * Apply suggestion from @biluriuday --------- Co-authored-by: Uday Bhaskar <udayb@amd.com>

* anr - fixes for applylabels step * multiple anr fixes (cherry picked from commit b33e4c9) Co-authored-by: Uday Bhaskar <udayb@amd.com>

…ition (#1281) (ROCm#503) (cherry picked from commit 9314824) Signed-off-by: yansun1996 <Yan.Sun3@amd.com> Co-authored-by: Yan Sun <Yan.Sun3@amd.com>

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

* enable npd and anr e2e sims * increase validation check duration (cherry picked from commit 67defdf) Co-authored-by: Uday Bhaskar <udayb@amd.com>

Fix two bugs in DeviceConfig node assignment management: 1. buildNodeAssignments now logs and skips node assignment conflicts instead of returning a fatal error. A CR-level conflict should not block the entire operator — the runtime validateNodeAssignments check already handles this per-CR during reconciliation. 2. Remove premature updateNodeAssignments call during finalization that freed nodes from the in-memory map before the finalizer was removed. Node cleanup is now handled solely via the NotFound path after CR garbage collection, preventing other DeviceConfigs from claiming nodes mid-finalization. Also adds DRA driver DaemonSet cleanup to the finalization path, which was previously only handled during normal reconciliation. (cherry picked from commit a945553) Co-authored-by: Nitish Bhat <bhatnitish@gmail.com>

…d and it's E2Es (#1267) (ROCm#508) * DCM: mount default ConfigMap when spec.configManager.config is omitted When DeviceConfig.spec.configManager.config is nil or has an empty name, the DCM DaemonSet now always mounts a ConfigMap volume named default-dcm-config (configurable by setting spec.configManager.config.name). Add E2E coverage (TestDCMDefaultConfigMapWhenConfigOmitted), cluster_test helpers, SIM skips for GPU-only partition tests, and align E2E_DCM_IMAGE in dev.env with v1.4.1. * Helm default CM + operator EnsureDefaultDCMConfigMap + E2E/docs * changes * address comments * comments * dcm changes (cherry picked from commit e9c1e91) Co-authored-by: nikhilsk <47417007+nikhilsk@users.noreply.github.com>

…ation (#1295) (ROCm#509) (cherry picked from commit 65785ed) Co-authored-by: bhatturu <bhatturu@amd.com>

…4) (ROCm#526) (cherry picked from commit fa1328d092487fa7482c7d3166bbd5fd5fe6d74d) Co-authored-by: Srivatsa Sangli <58572624+sangli-pensando@users.noreply.github.com>

…nual test examples (#1364) (#1365) (ROCm#527) Add privileged SCC permissions to all ClusterRole definitions in manual/scheduled test documentation to support OpenShift deployments. (cherry picked from commit 9915f721319cd7bf8fcb2ac581092473c0c3dc56) Co-authored-by: Yan Sun <Yan.Sun3@amd.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

* GPUOP-640 update remediation documentation * fix argo helm chart version for openshift (cherry picked from commit 1f246648b635f53b937c8003aa15278a67b9a008) (cherry picked from commit a968197d9974e22087eefe3cad2b7a184bf848c9) Co-authored-by: Uday Bhaskar <udayb@amd.com>

* metricsclient cli change * add test/e2e dependency to e2e sim * increase timeout (cherry picked from commit 5b1e46a) Co-authored-by: Praveen Kumar Shanmugam <58961022+spraveenio@users.noreply.github.com>

…rt (#1337) (ROCm#522) * Add workflow and workflow-triggered pod collection to techsupport Enhance the techsupport_dump.sh script to collect workflow CRs and workflow-triggered pods when auto node remediation feature is enabled. This helps with debugging workflow-based node remediation issues. Changes: - Add WORKFLOW_RESOURCES variable for workflow CRs - Collect workflow CRs (get, describe, yaml/json output) - Collect workflow-triggered pods identified by workflows.argoproj.io/workflow label - Add per-node log collection for workflow-triggered pods - Include error resilience with || true for ephemeral workflow pods * Make pod_logs function resilient to ephemeral pod failures Add error handling (|| true) to kubectl logs commands in pod_logs function to prevent script termination when collecting logs from ephemeral/terminated workflow pods. With set -e enabled, failed log collection would previously abort the entire techsupport run before reaching error handlers. Changes: - Add '2>&1 || true' to current container logs command - Add '2>&1 || true' to previous container logs command - Ensures individual pod log failures don't terminate script execution - Critical for short-lived workflow pods that may be deleted during collection * Add workflow controller pod collection to techsupport Collect information and logs from the workflow controller pod (identified by label app=amd-gpu-operator-workflow-controller) in addition to workflow CRs and workflow-triggered pods. Changes: - Add workflow controller pod collection in cluster-wide section - kubectl get/describe output in both text and JSON/YAML format - Add workflow controller pod log collection per node - Maintains error resilience with || true for optional feature --------- (cherry picked from commit 70b0104) Co-authored-by: Yan Sun <Yan.Sun3@amd.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

…clude "useSourceImage" for example DeviceConfig (ROCm#523) * Include DeviceConfig driver.useSourceImage in OCP(olm) install docs Signed-off-by: Landon LaSmith <LLaSmith@redhat.com> * Airgapped: Sync driver version with OpenShift(OLM) documentation Signed-off-by: Landon LaSmith <LLaSmith@redhat.com> --------- Signed-off-by: Landon LaSmith <LLaSmith@redhat.com>

* Add DeviceConfig collection in testmonitor * Fix DRA Tests

Add ServiceAccount, ClusterRole, and ClusterRoleBinding for the DRA driver so it can run on OpenShift clusters. The ClusterRole grants: - privileged SCC (required for OpenShift) - resourceslices CRUD (to publish GPU resources) - resourceclaims get (to process allocation requests) - nodes get (to look up node info for ResourceSlice ownership) Also add the DRA driver service account to the OLM bundle's extra-service-accounts list so OLM-managed installs create the SA. # Conflicts: # bundle/manifests/amd-gpu-operator.clusterserviceversion.yaml

…d (#1388) * Create DeviceClass from operator code on OpenShift when DRA is enabled On OpenShift, operator-sdk cannot deploy DeviceClass resources via the OLM bundle. This adds handleDeviceClass to the reconciler which creates the gpu.amd.com DeviceClass using an unstructured client when running on OpenShift with DRA driver enabled. The DeviceClass is cluster-scoped and shared, so it is created once (AlreadyExists is handled gracefully) and never deleted on DeviceConfig finalization. * Use deviceClassName constant instead of hardcoded string Address review feedback: extract "gpu.amd.com" into a const and use it throughout handleDeviceClass.

…opriate AMD GPU driver versions * add new `slesCMNameMapper` to parse SLES version strings like 'SUSE Linux Enterprise Server 15 SP6' to 'sles-15.6' * add `SLESDefaultDriverVersionsMapper` to select driver versions - SLES 15 SP6/SP7 -> driver 7.0.2 (ref: https://repo.radeon.com/amdgpu-install/7.0.2/sle/) - SLES 15 SP5 -> driver 6.2.2 (ref: https://repo.radeon.com/amdgpu-install/6.2.2/sle/) * register both 'sles' and 'suse' identifiers in mappers Co-authored-by: alex-isv <alex.zacharow@suse.com>

… SUSE AMD GPU driver image

…sles" * although, use-specified `BaseImageRegistry` still takes precedence * also extend tests in `internal/kmmodule/kmmodule_test.go` to test above changes in `resolveDockerfile` func

Priyankasaggu11929 mentioned this pull request Oct 28, 2025

Add SLES support for AMD gpu-operator #365

Open

1 task

sajmera-pensando force-pushed the staging branch from a22e2e4 to 1138d8e Compare November 19, 2025 21:14

yansun1996 and others added 28 commits November 19, 2025 13:15

[DOC] Make RVS parallel execution in all test runner example YAML

d622357

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[Feature] Support offline driver build for OpenShift (#1010)

d95fd0a

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[Feature] Update driver build source image description and related do…

eccb82b

…cs (#1021)

update utils container (#1019)

961986e

[Fix] fix helm e2e and DeviceConfig field rendering (#1029)

0d3c53d

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[Helm] Update NFD rules to label vnics along with nics (#968)

eeefd20

Co-authored-by: Yuva Shankar <11082310+yuva29@users.noreply.github.com>

[Feature] Add default PrometheusRule in OLM bundle to support OCP COO…

3ff0ddf

… dashboard Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[DOC] do not add toleration to devicePlugin to make it auto-restart a…

891fd1f

…fter partitioning Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[DOC] Add docs for ROCm 7.1.1 new test runner recipes (#1052)

9682a19

[DOC] Add notification for new amdgpu versioning scheme (#1065)

e0862cf

[Fix] Auto assign PROJECT_VERSION to be default operand image tag (#1…

1f16a2d

…071) Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[Build] update helm lint command to match with helm v3/v4

cb7f2ae

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

remediation e2e tests and bug fixes (#1054)

33c86e2

(cherry picked from commit 42a7329)

make max parallel workflows configurable for auto remediation (#1058)

102ffbc

* make max parallel workflows configurable for auto remediation * add zero value in default CR * address review comments (cherry picked from commit f023a5c)

[DOC] Add known limitations and platform supported

24feaa4

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[dcm] Documentation updates for DCM instinct docs (#1073)

63cc876

(cherry picked from commit 3f1a1ee2ea08f7675a6aba6cd60ed2f06ca7bdc6)

[Fix] fix blocked helm upgrade due to unexpected node status (#1085)

e23b156

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

Adding DCM to list of components in README (#1088)

60acce7

(cherry picked from commit c8409b8)

[Fix] fix base img registry template in dockerfile

58e4e39

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

remediation e2e tests for suspend and resume actions (#1068)

ce42b4b

* remediation e2e tests for suspend and resume actions * add e2e test for recoverypolicy cr * use init container image from dev.env (cherry picked from commit 3e0f7aa)

[DOC] Add know issue description for recent Device Plugin fix

b590e00

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[CI] Setup hourly build for operator utils image

c121134

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

auto remediation - use public test runner image as default image (#1110)

b26eeb7

(cherry picked from commit 4c737d7)

GPUOP-525 update auto node remediation documentation (#1109)

90496de

* GPUOP-525 update auto node remediation documentation * address review comments (cherry picked from commit 8e3f3e0)

[DOC] Fix docs sanity

dc01fcc

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

[Build] Deprecate helm charts for OpenShift

422706e

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

ci-penbot-01 and others added 25 commits March 31, 2026 11:25

handle imagePullSecrets in ANR workflow test runner and kmmmodule (#1…

84b9a49

…219) (ROCm#494) * handle imagePullSecrets in ANR workflow test runner and kmmmodule * address review comments (cherry picked from commit e6d01a9) Co-authored-by: Uday Bhaskar <udayb@amd.com>

[Fix] Base image upgrade to reduce CVE (#1227) (ROCm#493)

bd0756d

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

update ignore file and dir list (ROCm#497)

a80ca2f

skip non rendering files and directories (ROCm#498)

ac2086b

[CP 1283] GPUOP-618 fix helm upgrade issue with latest Argo CRDs (ROC…

6705daa

…m#501) * GPUOP-618 fix helm upgrade issue with latest Argo CRDs (#1283) (cherry picked from commit fe9ec91) * Apply suggestion from @biluriuday --------- Co-authored-by: Uday Bhaskar <udayb@amd.com>

anr - fixes (#1268) (ROCm#502)

b18f660

* anr - fixes for applylabels step * multiple anr fixes (cherry picked from commit b33e4c9) Co-authored-by: Uday Bhaskar <udayb@amd.com>

[Fix] GPUOP-608 read workflow directly from apisrv to avoid race cond…

114969c

…ition (#1281) (ROCm#503) (cherry picked from commit 9314824) Signed-off-by: yansun1996 <Yan.Sun3@amd.com> Co-authored-by: Yan Sun <Yan.Sun3@amd.com>

[Fix] upgrade grpc to address CVE-2026-33186 (#1272) (ROCm#505)

8ccaa90

Signed-off-by: yansun1996 <Yan.Sun3@amd.com>

ignore file list update (ROCm#517)

1b9635b

enable npd and anr e2e sims (#1245) (ROCm#506)

2d94c88

* enable npd and anr e2e sims * increase validation check duration (cherry picked from commit 67defdf) Co-authored-by: Uday Bhaskar <udayb@amd.com>

feat(tests/k8s-e2e): add GPU Operator e2e test suite with DME verific…

9dd94e0

…ation (#1295) (ROCm#509) (cherry picked from commit 65785ed) Co-authored-by: bhatturu <bhatturu@amd.com>

disble upgrade jobs - will be running them from v1.5.0 throttle (#135…

498f0b4

…4) (ROCm#526) (cherry picked from commit fa1328d092487fa7482c7d3166bbd5fd5fe6d74d) Co-authored-by: Srivatsa Sangli <58572624+sangli-pensando@users.noreply.github.com>

metricsclient cli change (#1293) (ROCm#521)

d130246

* metricsclient cli change * add test/e2e dependency to e2e sim * increase timeout (cherry picked from commit 5b1e46a) Co-authored-by: Praveen Kumar Shanmugam <58961022+spraveenio@users.noreply.github.com>

e2e test infra enhancements (#1172)

e2d7a4c

* Add DeviceConfig collection in testmonitor * Fix DRA Tests

Priyankasaggu11929 force-pushed the enable-sles-support branch from 4da60d3 to 666be77 Compare May 4, 2026 07:17

Priyankasaggu11929 added 3 commits May 4, 2026 07:22

add SLES Dockerfile template (DockerfileTemplate.sles) using prebuilt…

db84bdf

… SUSE AMD GPU driver image

tests: update internal/utils_test.go for added support for SLES 15 SP*

f322b40

use "registry.suse.com" as the default base image registry if OS == "…

7dfec5e

…sles" * although, use-specified `BaseImageRegistry` still takes precedence * also extend tests in `internal/kmmodule/kmmodule_test.go` to test above changes in `resolveDockerfile` func

Priyankasaggu11929 force-pushed the enable-sles-support branch from 666be77 to 7dfec5e Compare May 4, 2026 07:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Staging] Add SLES support for AMD gpu-operator#371

[Staging] Add SLES support for AMD gpu-operator#371
Priyankasaggu11929 wants to merge 109 commits intoROCm:stagingfrom
Priyankasaggu11929:enable-sles-support

Priyankasaggu11929 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

12 participants

Conversation

Priyankasaggu11929 commented Oct 28, 2025

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

12 participants