Skip to content

Update alpaka to 2.0.0 "Concepts Ahead"#10103

Merged
cmsbuild merged 1 commit intocms-sw:IB/CMSSW_16_0_X/masterfrom
fwyzard:IB/CMSSW_16_0_X/master_alpaka_2.0
Oct 31, 2025
Merged

Update alpaka to 2.0.0 "Concepts Ahead"#10103
cmsbuild merged 1 commit intocms-sw:IB/CMSSW_16_0_X/masterfrom
fwyzard:IB/CMSSW_16_0_X/master_alpaka_2.0

Conversation

@fwyzard
Copy link
Copy Markdown
Contributor

@fwyzard fwyzard commented Oct 1, 2025

This version introduces C++20 compatibility and removes the dependency on Boost.

See here for the release notes, and here for the full list of changes in the 2.0.0 release.

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 1, 2025

A new Pull Request was created by @fwyzard for branch IB/CMSSW_16_0_X/master.

@akritkbehera, @cmsbuild, @iarspider, @smuzaffar can you please review it and eventually sign? Thanks.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 1, 2025

cms-bot internal usage

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 1, 2025

enable gpu

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 1, 2025

please test with cms-sw/cmssw#49042

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 1, 2025

-1

Failed Tests: UnitTests RelVals-NVIDIA_T4 nvidia_t4UnitTests
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48406/summary.html
COMMIT: 17b606b
CMSSW: CMSSW_16_0_X_2025-10-01-1100/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10103/48406/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48406/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48406/git-merge-result

Unit Tests

I found 2 errors in the following unit tests:

---> test alpakaTestOneHistoContainerSerialSync had ERRORS
---> test alpakaTestPrefixScanSerialSync had ERRORS

RelVals-NVIDIA_T4

  • 17034.40217034.402_TTbar_14TeV+2025PU_Patatrack_PixelOnlyAlpaka/step2_TTbar_14TeV+2025PU_Patatrack_PixelOnlyAlpaka.log
  • 17034.40317034.403_TTbar_14TeV+2025PU_Patatrack_PixelOnlyAlpaka_Validation/step2_TTbar_14TeV+2025PU_Patatrack_PixelOnlyAlpaka_Validation.log
  • 17034.41217034.412_TTbar_14TeV+2025PU_Patatrack_ECALOnlyAlpaka/step2_TTbar_14TeV+2025PU_Patatrack_ECALOnlyAlpaka.log
Expand to see more relval errors ...

Comparison Summary

Summary:

  • You potentially added 12 lines to the logs
  • Reco comparison results: 6 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3923676
  • DQMHistoTests: Total failures: 29
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3923627
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 50 files compared)
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

AMD_MI300X Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 242 differences found in the comparisons
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 146284
  • DQMHistoTests: Total failures: 27588
  • DQMHistoTests: Total nulls: 6
  • DQMHistoTests: Total successes: 118690
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

AMD_W7900 Comparison Summary

Summary:

NVIDIA_H100 Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 228 differences found in the comparisons
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 146284
  • DQMHistoTests: Total failures: 24688
  • DQMHistoTests: Total nulls: 5
  • DQMHistoTests: Total successes: 121591
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

NVIDIA_L40S Comparison Summary

Summary:

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 1, 2025

please test with cms-sw/cmssw#49042

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 3, 2025

-1

Failed Tests: UnitTests
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48417/summary.html
COMMIT: 17b606b
CMSSW: CMSSW_16_0_X_2025-10-01-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10103/48417/install.sh to create a dev area with all the needed externals and cmssw changes.

Unit Tests

I found 2 errors in the following unit tests:

---> test alpakaTestOneHistoContainerSerialSync had ERRORS
---> test alpakaTestPrefixScanSerialSync had ERRORS

Comparison Summary

Summary:

  • You potentially added 16 lines to the logs
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3923676
  • DQMHistoTests: Total failures: 30
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3923626
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 50 files compared)
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

AMD_MI300X Comparison Summary

Summary:

AMD_W7900 Comparison Summary

Summary:

NVIDIA_H100 Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 226 differences found in the comparisons
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 146284
  • DQMHistoTests: Total failures: 24985
  • DQMHistoTests: Total nulls: 4
  • DQMHistoTests: Total successes: 121295
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

NVIDIA_L40S Comparison Summary

Summary:

NVIDIA_T4 Comparison Summary

Summary:

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 3, 2025

Pull request #10103 was updated.

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 3, 2025

please test with cms-sw/cmssw#49042

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 29, 2025

@smuzaffar cms-sw/cmssw#49042 is required to update CMSSW for alpaka 2.0

@cmsbuild
Copy link
Copy Markdown
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48883/summary.html
COMMIT: eaf0170
CMSSW: CMSSW_16_0_X_2025-10-28-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10103/48883/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48883/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/48883/git-merge-result

Comparison Summary

Summary:

  • You potentially removed 57 lines from the logs
  • Reco comparison results: 8 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3939953
  • DQMHistoTests: Total failures: 15879
  • DQMHistoTests: Total nulls: 22
  • DQMHistoTests: Total successes: 3924032
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 50 files compared)
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: found differences in 4 / 49 workflows

AMD_W7900 Comparison Summary

Summary:

NVIDIA_L40S Comparison Summary

Summary:

NVIDIA_T4 Comparison Summary

Summary:

@mmusich
Copy link
Copy Markdown
Contributor

mmusich commented Oct 30, 2025

@Parsifal-2045 (thanks!) run timing measurement using the HLT:75e33_timing menu (with the alpaka modifier) with and without this update, on GPU and CPU and on a "Milan" Run3 farm-like node and the NGT farm equipped with L40s.
You can find the results here.
No significant differences can be observed.

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 30, 2025

Thanks for the check 👍🏻

@smuzaffar
Copy link
Copy Markdown
Contributor

please test with cms-sw/cmssw#49042 for el8_aarch64_gcc13

@@ -1,5 +1,4 @@
<tool name="@TOOL@" version="@TOOL_VERSION@" revision="2">
<use name="boost"/>
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@fwyzard , can you please update the revision here?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you explain how it works ?

I understand updating the revision if there are only changes to the xml file, but if I already update the version, isn't that enough ?

In fact, wouldn't it make sense to reset the revision to 1 when one updates the version ?

Copy link
Copy Markdown
Contributor

@smuzaffar smuzaffar Oct 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

revision is kind of version of the xml file, so when ever you change the contents of xml file then update the revision too.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK.

I disagree, but will comply.

@smuzaffar
Copy link
Copy Markdown
Contributor

due to max commit status limit reached for this PR, so for now I have disabled bot to process it. Once we have a new commit pushed to this branch then I will re-enable bot to keep on processing this PR

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 30, 2025

rebased without changes to get a new commit

@@ -1,5 +1,4 @@
<tool name="@TOOL@" version="@TOOL_VERSION@" revision="2">
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
<tool name="@TOOL@" version="@TOOL_VERSION@" revision="2">
<tool name="@TOOL@" version="@TOOL_VERSION@" revision="3">

@smuzaffar
Copy link
Copy Markdown
Contributor

@fwyzard , due to commit status limit bot was not able to run the last PR tests. Please apply the suggested change https://github.com/cms-sw/cmsdist/pull/10103/files#r2477981803 (which will create a new commit/different commit and reset the commit statuses) so that bot can continue processing this PR

This version introduces C++20 compatibility and removes the dependency on Boost.

See https://github.com/alpaka-group/alpaka/releases/tag/2.0.0 for the release notes,
and https://github.com/alpaka-group/alpaka/pulls?q=milestone%3A2.0.0 for the full
list of changes in the 2.0.0 release.
@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 30, 2025

please test

@cmsbuild
Copy link
Copy Markdown
Contributor

Pull request #10103 was updated.

@smuzaffar
Copy link
Copy Markdown
Contributor

test parameters:

@smuzaffar
Copy link
Copy Markdown
Contributor

please test

@smuzaffar
Copy link
Copy Markdown
Contributor

please test for el8_aarch64_gcc13

@cmsbuild
Copy link
Copy Markdown
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/49102/summary.html
COMMIT: 69d6a42
CMSSW: CMSSW_16_0_X_2025-10-29-2300/el8_aarch64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10103/49102/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Copy Markdown
Contributor

-1

Failed Tests: RelVals-NVIDIA_T4
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dc1302/49101/summary.html
COMMIT: 69d6a42
CMSSW: CMSSW_16_0_X_2025-10-30-1100/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10103/49101/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals-NVIDIA_T4

  • 29834.70429834.704_TTbar_14TeV+Run4D110PU_lstOnGPUIters01TrackingOnly/step2_TTbar_14TeV+Run4D110PU_lstOnGPUIters01TrackingOnly.log

Comparison Summary

Summary:

  • You potentially added 16 lines to the logs
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3939953
  • DQMHistoTests: Total failures: 48
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3939885
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 50 files compared)
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

AMD_MI300X Comparison Summary

Summary:

AMD_W7900 Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 246 differences found in the comparisons
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 147869
  • DQMHistoTests: Total failures: 27724
  • DQMHistoTests: Total nulls: 11
  • DQMHistoTests: Total successes: 120134
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: found differences in 1 / 10 workflows

NVIDIA_H100 Comparison Summary

Summary:

NVIDIA_L40S Comparison Summary

Summary:

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 31, 2025

The test is already failing without changes on top of the IB, see for example https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-61b8db/49103/summary.html

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 31, 2025

ignore tests-rejected with ib-failure

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 31, 2025

@mandrenguyen @smuzaffar can we merge this from 16.0.0-pre2 ?

@mandrenguyen
Copy link
Copy Markdown

@fwyzard If @cms-sw/externals-l2 signs then I'll merge.

@iarspider
Copy link
Copy Markdown
Contributor

+externals

@cmsbuild
Copy link
Copy Markdown
Contributor

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_16_0_X/master IBs (test failures were overridden). This pull request will now be reviewed by the release team before it's merged. @ftenchini, @mandrenguyen, @sextonkennedy (and backports should be raised in the release meeting by the corresponding L2)
Notice This PR was tested with additional Pull Request(s), please also merge them if necessary: cms-sw/cmssw#49042

@mandrenguyen
Copy link
Copy Markdown

+1

@fwyzard
Copy link
Copy Markdown
Contributor Author

fwyzard commented Oct 31, 2025

Thanks !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants