Skip to content

Simplify Data Workflows in Tests (No JSON Check)#49020

Closed
AdrianoDee wants to merge 1 commit intocms-sw:masterfrom
AdrianoDee:nogolden_jenkins
Closed

Simplify Data Workflows in Tests (No JSON Check)#49020
AdrianoDee wants to merge 1 commit intocms-sw:masterfrom
AdrianoDee:nogolden_jenkins

Conversation

@AdrianoDee
Copy link
Copy Markdown
Contributor

PR description:

Lately (with #48490), spurious DAS errors appear frequently in the tests (see e.g. #49014 (comment)). This is due to random glitches in accessing https://cms-service-dqmdc.web.cern.ch/CAF/certification/ when running the tests (no access to EOS is available). This results in not finding any certification JSON files to use for skimming the datasets. One possibility could be to allow for a few retries in case of failures, but I find it anyway risky.

So this PR proposes simply to skip the golden JSON check when running the tests (so when JENKINS_PREFIX is defined).

I'm opening this as a draft just until the files are copied to CERN.

e286032666c249b4a4e76dadba0a3f68  pdmv_group  cms:/store/data/Run2022E/BTagMu/RAW/v1/000/359/037/00000/b101e8c5-4d7f-4853-896d-77967dff5ce6.root                                                                                      REPLICATING[0/1/0]      T2_CH_CERN        1         1.319 MB                     2025-09-29 09:19:12
d10878e0f790434c8ca656337a888d5e  pdmv_group  cms:/store/data/Run2023D/JetMET0/RAW/v1/000/369/869/00000/cf59ba02-cc61-4768-885f-b6b45baccfe2.root                                                                                     REPLICATING[0/1/0]      T2_CH_CERN        1         6.474 GB                     2025-09-29 10:06:22
09d38828b9b9419497e769ae5f9695b3  pdmv_group  cms:/store/data/Run2025B/ZeroBias/RAW/v1/000/391/531/00000/31f4860a-232d-43c0-aa51-e805866b0677.root                                                                                    REPLICATING[0/1/0]      T2_CH_CERN        1         70.210 MB                    2025-09-29 09:19:30
a289c7e9ddb247788c555d2c3222e47b  pdmv_group  cms:/store/data/Run2024C/JetMET0/RAW/v1/000/379/415/00000/41e6df86-f198-4b44-886e-5d68a8e8df1b.root                                                                                     REPLICATING[0/1/0]      T2_CH_CERN        1         6.010 GB                     2025-09-29 10:06:27
c62a2dc81ce1469ea7dda7c701c1a2b3  pdmv_group  cms:/store/data/Run2025C/JetMET0/RAW/v1/000/392/174/00000/f9fa84d2-905c-4ed2-8821-18a89675e4eb.root                                                                                     OK[1/0/0]               T2_CH_CERN        1         12.061 MB                    2025-09-29 09:19:32
699477364c664fcfa7a1389dc848024d  pdmv_group  cms:/store/data/Run2024E/DisplacedJet/RAW/v1/000/380/963/00000/7690f63b-c907-45b3-9ec9-fc5c9ad7c1ac.root                                                                                REPLICATING[0/1/0]      T2_CH_CERN        1         3.836 GB                     2025-09-29 10:06:36
bdb988b8473d4f19af41b44194b4fbb6  pdmv_group  cms:/store/data/Run2023D/JetMET0/RAW/v1/000/369/844/00000/333b74fc-a093-47dd-9d6c-a2373c690134.root                                                                                     REPLICATING[0/1/0]      T2_CH_CERN        1         1.544 MB                     2025-09-29 09:19:14
6db1bc89296a4ea4a9e4c587d7ec9c79  pdmv_group  cms:/store/data/Run2024E/DisplacedJet/RAW/v1/000/380/956/00000/140c01c0-1725-476e-adaa-37bbbde6a09d.root                                                                                REPLICATING[0/1/0]      T2_CH_CERN        1         1.343 MB                     2025-09-29 09:19:22
13a8ad84efec452495b7a9a6415a8d29  pdmv_group  cms:/store/data/Run2024I/Tau/RAW/v1/000/386/446/00000/b6577323-4ae5-45e7-b9d8-7f97fbc0cc15.root                                                                                         OK[1/0/0]               T2_CH_CERN        1         1.377 MB                     2025-09-29 09:19:28
6e3347c899894f44a53c5bb75244325e  pdmv_group  cms:/store/data/Run2024F/ParkingDoubleMuonLowMass0/RAW/v1/000/382/213/00000/13227a75-4eca-4f1a-8734-5c1e6bee9a82.root                                                                   OK[1/0/0]               T2_CH_CERN        1         3.201 GB                     2025-09-29 10:06:39
c2a08fcbcdff42c2b45df6c35c9314ea  pdmv_group  cms:/store/data/Run2024C/JetMET0/RAW/v1/000/379/413/00000/2c0fee08-8977-4e2a-b369-bfa7101aa2c8.root                                                                                     REPLICATING[0/1/0]      T2_CH_CERN        1         1.317 MB                     2025-09-29 09:19:19
779124754d9b42c2925605a9a77ffa1a  pdmv_group  cms:/store/data/Run2024B/ZeroBias/RAW/v1/000/378/981/00000/a27bb954-1259-470c-8874-4d94ff5df34a.root                                                                                    REPLICATING[0/1/0]      T2_CH_CERN        1         3.325 GB                     2025-09-29 09:19:17
520cf8dd550e4f5689113e5a1f6c485b  pdmv_group  cms:/store/data/Run2024F/ParkingDoubleMuonLowMass0/RAW/v1/000/382/209/00000/fa5a6a3c-6db8-42f7-baab-df1965111988.root                                                                   OK[1/0/0]               T2_CH_CERN        1         6.443 MB                     2025-09-29 09:19:24

PR validation:

Running the standard tests.

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Sep 29, 2025

cms-bot internal usage

@cmsbuild
Copy link
Copy Markdown
Contributor

@AdrianoDee AdrianoDee marked this pull request as ready for review September 30, 2025 09:38
@AdrianoDee
Copy link
Copy Markdown
Contributor Author

please test

@cmsbuild
Copy link
Copy Markdown
Contributor

A new Pull Request was created by @AdrianoDee for master.

It involves the following packages:

  • Configuration/PyReleaseValidation (upgrade, pdmv)

@AdrianoDee, @DickyChant, @Moanwar, @antoniovagnerini, @miquork, @srimanob, @subirsarkar can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @fabiocos, @makortel, @slomeo this is something you requested to watch as well.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Copy Markdown
Contributor

-1

Failed Tests: RelVals-INPUT
Size: This PR adds an extra 16KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-35e700/48372/summary.html
COMMIT: dda7573
CMSSW: CMSSW_16_0_X_2025-09-29-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49020/48372/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals-INPUT

  • 2023.00000012023.0000001_RunZeroBias2023B_10k/step2_RunZeroBias2023B_10k.log
  • 2022.00100012022.0010001_RunJetHT2022C_10k/step2_RunJetHT2022C_10k.log
  • 2022.00000012022.0000001_RunZeroBias2022B_10k/step2_RunZeroBias2022B_10k.log

Comparison Summary

The workflows 2025.0010001, 2025.0000001, 2024.0040001, 2024.0030001, 2024.0020001, 2024.0010001, 2024.0000001, 2023.0020001 have different files in step1_dasquery.log than the ones found in the baseline. You may want to check and retrigger the tests if necessary. You can check it in the "files" directory in the results of the comparisons

Summary:

  • You potentially added 1691 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 18726 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3503512
  • DQMHistoTests: Total failures: 8536
  • DQMHistoTests: Total nulls: 46
  • DQMHistoTests: Total successes: 3494910
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 40.6 KiB( 44 files compared)
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.363 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.320 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.059 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.020 KiB JetMET/SUSYDQM
  • DQMHistoSizes: changed ( 2024.0030001 ): 27.582 KiB PixelPhase1/Tracks
  • DQMHistoSizes: changed ( 2024.0030001 ): 6.617 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0030001 ): 1.564 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0030001 ): 0.164 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0030001 ): -0.090 KiB JetMET/SUSYDQM
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

7e73ab20d08f4b7daeed642eee78ce1f  pdmv_group  cms:/store/data/Run2023B/ZeroBias/RAW/v1/000/366/403/00000/a68ba598-9216-4e0a-81dd-e9b3f2a6a7df.root                                                                                    REPLICATING[0/1/0]      T2_CH_CERN        1         4.031 GB                     2025-09-30 12:26:14
b36619eea268434d8e3cb2d6b63d155e  pdmv_group  cms:/store/data/Run2022B/ZeroBias/RAW/v1/000/355/094/00000/d47379ec-8b37-491d-9772-c320dc85eb09.root                                                                                    REPLICATING[0/1/0]      T2_CH_CERN        1         27.066 MB                    2025-09-30 12:25:04
3c967bf8a1604219bb89ac0d4878ae52  pdmv_group  cms:/store/data/Run2022C/JetHT/RAW/v1/000/355/862/00000/e712efc3-ba98-484a-91c0-c567e2c8b336.root                                                                                       REPLICATING[0/1/0]      T2_CH_CERN        1         1.676 GB                     2025-09-30 12:25:49

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

please test

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 1, 2025

-1

Failed Tests: RelVals-INPUT
Size: This PR adds an extra 16KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-35e700/48393/summary.html
COMMIT: dda7573
CMSSW: CMSSW_16_0_X_2025-09-30-2300/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49020/48393/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals-INPUT

  • 2022.00000012022.0000001_RunZeroBias2022B_10k/step2_RunZeroBias2022B_10k.log

Comparison Summary

The workflows 2025.0010001, 2025.0000001, 2024.0070001, 2024.0050001, 2024.0040001, 2024.0030001, 2024.0020001, 2024.0010001, 2024.0000001, 2023.0020001 have different files in step1_dasquery.log than the ones found in the baseline. You may want to check and retrigger the tests if necessary. You can check it in the "files" directory in the results of the comparisons

Summary:

  • You potentially added 1584 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 28125 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3504568
  • DQMHistoTests: Total failures: 40275
  • DQMHistoTests: Total nulls: 114
  • DQMHistoTests: Total successes: 3464159
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 77.366 KiB( 44 files compared)
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.363 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.320 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.059 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.020 KiB JetMET/SUSYDQM
  • DQMHistoSizes: changed ( 2024.0030001,... ): 27.582 KiB PixelPhase1/Tracks
  • DQMHistoSizes: changed ( 2024.0030001 ): 6.617 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0030001 ): 1.564 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0030001 ): 0.164 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0030001 ): -0.090 KiB JetMET/SUSYDQM
  • DQMHistoSizes: changed ( 2024.0050001 ): 7.090 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0050001 ): ...
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

please test
(should be finally all staged)

@gabrielmscampos
Copy link
Copy Markdown
Member

The golden JSONs are also available through CVMFS: /cvmfs/cms-griddata.cern.ch/cat/metadata/DC/Collisions<year>/latest

If the bot has access to it, this could be a good retry option in case the EOS connection fails.

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

AdrianoDee commented Oct 2, 2025

The golden JSONs are also available through CVMFS: /cvmfs/cms-griddata.cern.ch/cat/metadata/DC/Collisions<year>/latest

If the bot has access to it, this could be a good retry option in case the EOS connection fails.

Right! Nice, I think it does (@smuzaffar?). I would leave this fix to go in and then push the extra check on cvmfs.

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

Also: are the tests stuck?

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 2, 2025

+1

Size: This PR adds an extra 16KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-35e700/48413/summary.html
COMMIT: dda7573
CMSSW: CMSSW_16_0_X_2025-10-01-2300/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49020/48413/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

The workflows 2025.0010001, 2025.0000001, 2024.0070001, 2024.0050001, 2024.0040001, 2024.0030001, 2024.0020001, 2024.0010001, 2024.0000001, 2023.0020001 have different files in step1_dasquery.log than the ones found in the baseline. You may want to check and retrigger the tests if necessary. You can check it in the "files" directory in the results of the comparisons

Summary:

  • You potentially added 1582 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 28119 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3504568
  • DQMHistoTests: Total failures: 40299
  • DQMHistoTests: Total nulls: 114
  • DQMHistoTests: Total successes: 3464135
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 77.366 KiB( 44 files compared)
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.363 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0020001 ): 2.320 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.059 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0020001 ): 0.020 KiB JetMET/SUSYDQM
  • DQMHistoSizes: changed ( 2024.0030001,... ): 27.582 KiB PixelPhase1/Tracks
  • DQMHistoSizes: changed ( 2024.0030001 ): 6.617 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0030001 ): 1.564 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 2024.0030001 ): 0.164 KiB RPC/DCSInfo
  • DQMHistoSizes: changed ( 2024.0030001 ): -0.090 KiB JetMET/SUSYDQM
  • DQMHistoSizes: changed ( 2024.0050001 ): 7.090 KiB Hcal/DigiRunHarvesting
  • DQMHistoSizes: changed ( 2024.0050001 ): ...
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

+pdmv

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 2, 2025

REMINDER @sextonkennedy, @mandrenguyen, @ftenchini: This PR was tested with #48999, please check if they should be merged together

@AdrianoDee
Copy link
Copy Markdown
Contributor Author

hold
(may be superseded by #49051)

@cmsbuild
Copy link
Copy Markdown
Contributor

cmsbuild commented Oct 2, 2025

Pull request has been put on hold by @AdrianoDee
They need to issue an unhold command to remove the hold state or L1 can unhold it for all

@cmsbuild cmsbuild added the hold label Oct 2, 2025
@AdrianoDee AdrianoDee closed this Oct 3, 2025
@AdrianoDee
Copy link
Copy Markdown
Contributor Author

Closed in favor of #49051

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants