Script to do release comparison #27

nothingface0 · 2024-12-05T16:15:33Z

Introduction

This PR adds a new script (cmssw_release_comparison.py) which combines the functionality of cmssw_deploy.py, an execution of the DQM clients and compareDQMOutput.py into a single script. The script is meant to be used in Point 5 machines.

ZhengChen/Praveen's scripts were used as a reference to make this script. The main difference is that this is a single script, written in python, takes all arguments from the commandline and does not require modifying the compareDQMOutput.py and compareHistograms.py scripts from CMSSW.

Tested with python3.8. No extra dependencies are introduced.

Functionality

The script works like so:

It creates two CMSSW releases (base and comp) with PRs optionally applied on them, using cmssw_deploy.py,
It changes their Global Tags and builds them, then
Runs the DQM clients, for each of the two releases, storing their output files. If any client returns a non-zero exit code, the script raises an exception.
Then, compareDQMOutput.py is executed to compare the output files of the two releases, upload the comparison results to the target DQMGUI and generate an HTML report of the changes found, and links to the DQMGUI to show them.

The main purpose is to make it as user-friendly as possible, making minimal changes to compareDQMOutput.py, while still retaining high configurability.

The script allows:

Selecting a different CMSSW version for the base and for the comp release.
Applying different PRs on the base or comp CMSSW.
Specifying Global Tags to be used for each release.
Optionally re-using existing CMSSW releases.
Specifying which DQMGUI to upload the comparison results to (hence requires [DQM] Update compareDQMOutput.py cms-sw/cmssw#46854 to be applied to the base CMSSW release, if it's < 15_0_0).

Logs generated:

Logs are stored for each client executed, under the logs dir of the CMSSW release they belong to.
Logs of the comparison script are stored in the output_dir subdirectory.

Requirements

A machine with cvmfs mounted, with access to CMSSW.
Streamer files for a run to be used as input for the DQM clients.
fff_dqmtools installed.

Example usage

python3 cmssw_release_comparison.py CMSSW_14_1_5_patch1 CMSSW_14_1_5_patch1 \
--pull-requests-base +46854 --pull-requests-comp +46622 \
--gt-express-base 141X_dataRun3_Express_v3 --gt-hlt-base 141X_dataRun3_HLT_v1 \
--gt-express-comp 141X_dataRun3_Express_v3 --gt-hlt-comp 141X_dataRun3_HLT_v1 \
--input-dir /eos/home-d/dpapagia/data/run384377 \
--run-key pp_run \
--output-dir /tmp/comparison_results \
--dqmgui-url http://localhost:8060/dqm/dev

In this example, the resulting comparison files are uploaded to a local DQMGUI (http://localhost:8060/dqm/dev), and the resulting HTML report will be found in /tmp/comparison_results/<datetime>_<base release>_vs_<comp release>/dqm-histo-comparison-summary.html.

Notes and caveats

Warning

Some clients' results are non-deterministic, and multiple runs of the same clients on the same CMSSW releases produce slightly different results.

Most of them seem to be harmless, e.g. differences between strings in the EventInfo directory.

Other examples are under the HcalCalib/LEDTask directory (TimingMean, TimingRMS) in the root file created by the Hcal client. Those look harmless too, as they probably have to do with timing the performance of the client which might vary per execution, depedning on the load of the machine.

Note

Several DQM clients seem to have special requirements to run and are not currently taken into account:

beam_dqm_sourceclient-live_cfg.py: Requires a db.key
beamhlt_dqm_sourceclient-live_cfg.py: Same as above.
beamspotdip_dqm_sourceclient-live_cfg.py: Must be run on P5 to access a DIP server
visualization-live_cfg.py: It doesn't seem to produce any ROOT file we can use for comparison.
visualization-live-secondInstance_cfg.py: Same as above

Note

cms-sw/cmssw#46854 needs to be applied to the base release, if a CMSSW version < 15_0_0 is used

Dummy workflows

compareDQMOutput.py was made to compare .root files generated from different "workflows". For this reason, it expects each .root file to be under a directory which is named after the workflow (in the format <int>.<int>_<description>).

To make the DQM clients' output compatible to compareDQMOutput.py, a "random" workflow number is given to each generated .root file, by placing each .root file in a dedicated directory, named after the dummy workflow id. This has the downside that it's not immediately apparent which "workflow" corresponds to which client. Some effort is put into replacing the workflow ids in the generated HTML, post-execution.

For reference, the resulting HTML will look something like this:

Tested comparisons

All the comparisons were run from within the dqm-dev-vm VM.

CMSSW_14_1_5_patch1 + 46854(required) vs CMSSW_14_1_5_patch1 + 46854: I.e., the exact same release against itself. No changes were expected, but still one can find at least 70 differences are found. The number of differences varies from execution to execution, ranging from 70 to 100. Almost all come from the EventInfo directory, so at least this many differences are always expected to be present. The command was:

python3 ~/workspace/fff_dqmtools/utils/cmssw_release_comparison.py CMSSW_14_1_5_patch1 CMSSW_14_1_5_patch1 -v -w /tmp --pull-requests-comp +46854 --pull-requests-base +46854 --gt-express-base 141X_dataRun3_Express_v3 --gt-hlt-base 141X_dataRun3_HLT_v1 --gt-express-comp 141X_dataRun3_Express_v3 --gt-hlt-comp 141X_dataRun3_HLT_v1 -c  ~/workspace/fff_dqmtools/utils/cmssw_deploy.py -i /eos/home-d/dpapagia/data/run384377 --reuse-releases -k pp_run --output-dir /tmp/comparison_results --dqmgui-url http://localhost:8060/dqm/dev

CMSSW_14_2_0 + 46854(required) vs CMSSW_14_2_0 + 41932. No GTs applied, as there are none for 14_2. Command:

 python3 ~/workspace/fff_dqmtools/utils/cmssw_release_comparison.py CMSSW_14_2_0 CMSSW_14_2_0 -v -w /tmp --pull-requests-comp +41932 --pull-requests-base +46854 -c  ~/workspace/fff_dqmtools/utils/cmssw_deploy.py -i /eos/home-d/dpapagia/data/run384377 --reuse-releases -k pp_run --output-dir /tmp/comparison_results --dqmgui-url http://localhost:8060/dqm/dev

baby steps: get args, validate, create the releases.

Instead of assuming that we want to compare a PR against a base release only.

Also create the "upload" dir before execution.

Also keep track of exit codes

To be used with the actual comparison

I think it's cleaner like that

Also use pathlib instead of os for consistency.

...so that the resulting files conform with compareDQMOutput.py and visDQMUpload.

For some extra peace of mind, in case the renaming of the output directories didn't go as planned.

So that multiple invocations of the comparison don't overwrite files in DQMGUI.

Instead of storing them in the "logs" dir under the release, create a subdir named after the common_label.

Mostly machete for special cases.

To make the comparison HTML more understandable, replace the (dummy) wofklow ids with the clients' names.

Also add comments describing what will probably go wrong when they run

Exit early if any of the clients does not return a non-zero exit code for any reason. No point in doing a comparison if the clients don't work properly.

nothingface0 force-pushed the release_comparison branch from 27aca3f to 4301b49 Compare December 6, 2024 15:57

nothingface0 force-pushed the release_comparison branch from fd6f3df to b0d04e0 Compare December 18, 2024 16:52

nothingface0 added 28 commits December 19, 2024 16:03

WIP on script to make release comparison

890e382

baby steps: get args, validate, create the releases.

Move script description

7e68e2c

Add GTs as arguments

ad4b0b3

Allow creating completely custom releases

60a3a31

Instead of assuming that we want to compare a PR against a base release only.

Added dqm client names and path

15e2e3e

Also build created releases

e771b98

Add input-dir argument (unused yet)

54eb630

Option to reuse releases if they exist

ed12480

Add comments, return type

0f0d15e

More info on input dir

13aab72

WIP on running the clients

765a150

Do preliminary checks in parse_args

2874185

Add run-key argument

a5dc4a7

Disable some clients which require special stuff

56be2a1

Restructure the order of some arguments, add default

4f77ffe

Add runUniqueKey

7baadb6

Add output_dir option

fe58fd4

Fix input_dir argument to not have a default

b86b39c

Add option to control number of jobs (unused yet)

4ad2c5b

Add option to control DQMGUI to upload to (unused yet).

20b2a8a

Restructure how clients are launched

27f633b

Also create the "upload" dir before execution.

Redirect stdout, stderr to a file

ecdea48

Also keep track of exit codes

Disable visualization clients for now

cfe457e

Comments, docstrings, cleanup

dd346d7

Add option to skip running the clients

5439922

Store the created release paths

71460b5

To be used with the actual comparison

Make assertions be a bit more verbose

4d61ae6

Change default dir where results are stored

df3ea1b

nothingface0 added 25 commits December 19, 2024 16:03

Add an extra logging call

83905e1

Args checks outside of parse_args

3d83b5b

I think it's cleaner like that

Create output_dir if it doesn't exist

729770f

Also use pathlib instead of os for consistency.

Change naming convention of output

ff7b0cf

...so that the resulting files conform with compareDQMOutput.py and visDQMUpload.

Run the comparison

b711b00

Added a TODO

e94a27b

Create a dedicated output dir on each comparison

9054498

Cleanup, minor changes

666f46d

Use a common label for outputs

8458344

Log comparison output, move html to results dir

feeeaf4

Cleanup, comments

1a5ad82

Added a validation of output files

2a29b92

For some extra peace of mind, in case the renaming of the output directories didn't go as planned.

Add --test-number to compareDQMOutput call

c09190a

So that multiple invocations of the comparison don't overwrite files in DQMGUI.

Docstring added

029010d

Change exec logs directory

eb442bf

Instead of storing them in the "logs" dir under the release, create a subdir named after the common_label.

Add post-execution actions

b8557b4

Mostly machete for special cases.

Comments, cleanup

2663f41

Cleanup

6d188db

Matchete was not needed after all

35ed30e

Use pathlib only instead of os.path

cc32ba1

Replace dummy workflow ids

dc76fc8

To make the comparison HTML more understandable, replace the (dummy) wofklow ids with the clients' names.

Make logs a bit more homogeneous

a5f5de8

Add logging

a5a622a

Cleanup on which clients work

ec54953

Also add comments describing what will probably go wrong when they run

Raise Exception if any client crashes

4e1c7c0

Exit early if any of the clients does not return a non-zero exit code for any reason. No point in doing a comparison if the clients don't work properly.

nothingface0 force-pushed the release_comparison branch from b0d04e0 to 4e1c7c0 Compare December 19, 2024 15:03

nothingface0 marked this pull request as ready for review December 19, 2024 15:04

nothingface0 changed the title ~~[WIP] Script to do release comparison~~ Script to do release comparison Dec 19, 2024

nothingface0 added 2 commits December 19, 2024 16:12

Fix paths that had not been updated

a989427

Change the default directories

f8b413b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script to do release comparison #27

Script to do release comparison #27

nothingface0 commented Dec 5, 2024 •

edited

Loading

Script to do release comparison #27

Are you sure you want to change the base?

Script to do release comparison #27

Conversation

nothingface0 commented Dec 5, 2024 • edited Loading

Introduction

Functionality

Requirements

Example usage

Notes and caveats

nothingface0 commented Dec 5, 2024 •

edited

Loading