-
Notifications
You must be signed in to change notification settings - Fork 18
Analysis checkout scripts
Checkout the recommended version of CMSSW and all packages that are needed for compiling and executing the analysis. The script also compiles everything properly.
It is recommended to first create a suitable directory and change to this directory before you checkout CMSSW in this directory. This should be a place where you can collect more versions for future versions of the analysis.
Before starting the checkout you might want to start a ssh-agent, so you do not have to type in your private key password repeatedly:
eval `ssh-agent -s` && ssh-add
Since switching CMSSW 102X, the analysis set up has to checked out inside a singularity shell, otherwise it will not work on our T2 (this might change with the upgrade of the T2 nodes from SLC 6 to 7).
wget https://raw.githubusercontent.com/cms-analysis/HiggsAnalysis-KITHiggsToTauTau/dictchanges_CMSSW102/scripts/checkout_packages.sh
source checkout_packages.sh
wget https://raw.githubusercontent.com/cms-analysis/HiggsAnalysis-KITHiggsToTauTau/master/scripts/checkout_packages.sh
source checkout_packages.sh
wget https://raw.githubusercontent.com/cms-analysis/HiggsAnalysis-KITHiggsToTauTau/dictchanges_CMSSW94/scripts/checkout_packages.sh
source checkout_packages.sh
This recipe exits and logs out from the current shell if an error occurs. If this is not wanted, remove this line.
The script requires some user input to facilitate the checkout of different CMSSW versions. First of all, you need to enter which CMSSW release you want to use. Currently we build our analysis on top of 10220 (and previously 747, 810 and 942). Enter the corresponding number according to your needs. After that you will be asked to enter the CombineHarvester developing branch you will be working on. If your analysis is based on the
- SM Htt analysis -> enter 'SM2016-dev'
- CP initial state Htt analysis -> enter 'SMCP2016-dev'
- CP final state Htt analysis -> enter 'HTTCPDecays18-dev'
- in any other case -> enter 'master'.
The script takes care that a working combination of CMSSW and CH is checked out so if you receive an error message you enter a not supported combination. Please check your settings again and do the source the checkout script again.
In every new shell, CMSSW needs to be setup up and also some settings for the analysis need to be declared.
export VO_CMS_SW_DIR=/cvmfs/cms.cern.ch
source $VO_CMS_SW_DIR/cmsset_default.sh
cd <path/to/CMSSW_X_Y_Z>/src/
cmsenv
source ${CMSSW_BASE}/src/HiggsAnalysis/KITHiggsToTauTau/scripts/ini_KITHiggsToTauTauAnalysis.sh
It is recommended to put this into a dedicated function/alias in the ~/.bashrc
or ~/.bash_profile
file.
Have a look at the "Running Artus" tutorial here.
Have a look at the "SVFit tools" tutorial here.
- Classifications:
tmvaClassification.py -h
- tmvaWrapper (recommended):
tmvaWrapper.py -h
- -i and -o are mandatory and you will be reminded if one is missing
- -S and -n are not mandatory but one of them must be specified for a proper training
- -S is the SplitValue: if this is specified you will do a regular training with a splitting of your sample defined by this variable
- -n is the number of N-Fold: with this you will do a N-Fold training and splitting is done accordingly
- --modify and --modificatioin: those parameters give you access to predefined training routines, those are defined below the argparse part
- use those parameters to use already predefined sequences or define your own in the code.
- You should write all trainings that belong together in one folder!
- Overtraining:
plot_overtraining.py -h
- ConfigWriter:
mvaConfigWriter.py -h
- store all trainings that belong together in the same folder
- use the Producer MVATestMethodsProducer to run all of your trainings with Artus, the config for this is contained in the output of ConfigWriter
- Correlations
- Producer:
correlation_SampleProducer.py -h
- Collector:
correlation_SampleCollector.py -h
- use the Producer to calculate correlations for an Artus run, there must be a merged folder within the directory of the run
- use the Collector to add Correlations of different channels and plot them
- Producer: if run for the first time you have to use -P to produce a set of rootfiles, that are the base for correlation calculation, as of now the options for BDT bin correlations and multiprocess are there but not implemented
- Collector: Per default the collector never adds MC Sample correlations and data correlations but you can add all MC Samples and compare it to data
- Producer:
- ArtusRunFilter:
MinimalPlotlevelFilter.h
-general instructions for the setup are in the source code - PostRunFilter:
reduce_mergedFiles.py -h
-this filter reduces the merged output from an Artus run to a reduced set, which is enough for plotting etc.