27 Jul 18:27

a-kapoor

v1.4.1

cc8e65c

Latest

Currently supports
Binary-classification (currently using XGBoost and DNN)	Examples: DY vs ttbar, DY prompt vs DY fake, good electrons vs bad electrons
Multi-sample classification (currently using XGBoost and DNN)	Examples: DY vs (ttbar and QCD)
Multi-class classification (currently using XGBoost and DNN)	Examples: DY vs ttbar vs QCD, good photons vs bad photons vs very bad photons

Assets 2

27 Jul 12:35

a-kapoor

v1.4

90ce300

Release for general use!

Currently supports
Binary-classification (currently using XGBoost and DNN)	Examples: DY vs ttbar, DY prompt vs DY fake, good electrons vs bad electrons
Multi-sample classification (currently using XGBoost and DNN)	Examples: DY vs (ttbar and QCD)
Multi-class classification (currently using XGBoost and DNN)	Examples: DY vs ttbar vs QCD, , good photons vs bad photons

Assets 2

25 Jul 20:40

a-kapoor

v1.3

1317697

v1.3 of ID-Trainer

ID-Trainer

A simple config-based tool for high-energy-physics machine learning tasks.

Full documentation and instructions to use are available here: https://akapoorcern.github.io/ID-Trainer/

Currently supports
Binary-classification (currently using XGBoost and DNN)	Examples: DY vs ttbar, DY prompt vs DY fake, good electrons vs bad electrons
Multi-sample classification (currently using XGBoost and DNN)	Examples: DY vs (ttbar and QCD)
Multi-class classification (currently using XGBoost and DNN)	Examples: DY vs ttbar vs QCD, , good photons vs bad photons

Salient features:
Parallel reading of root files (using DASK)
Runs on flat ntuples (even NanoAODs) out of the box
Adding multiple MVAs is very trivial (Subject to available computing power)
Cross-section and pt-eta reweighting can be handled together
Multi-Sample training possible
Multi-Class training possible
Ability to customize thresholds

What will be the output of the trainer:
Feature distributions
Statistics in training and testing
ROCs, loss plots, MVA scores
Confusion Matrices
Correlation plots
Trained models (h5/pb for DNN / pkl for XGBoost)

Optional outputs

Threshold values of scores for chosen working points
Efficiency vs pT and Efficiency vs eta plots for all classes
Reweighting plots for pT and eta
Comparison of new ID performance with benchmark ID flags

Assets 2

25 Jul 12:24

a-kapoor

v1.2

197bcfb

A simple config-based tool for machine learning tasks

ID-Trainer

A simple config-based tool for high-energy-physics machine learning tasks.

Full documentation and instructions to use are available here: https://akapoorcern.github.io/ID-Trainer/

Currently supports
Binary-classification (currently using XGBoost and DNN)	Examples: DY vs ttbar, DY prompt vs DY fake, good electrons vs bad electrons
Multi-sample classification (currently using XGBoost and DNN)	Examples: DY vs (ttbar and QCD)
Multi-class classification (currently using XGBoost and DNN)	Examples: DY vs ttbar vs QCD, , good photons vs bad photons

Salient features:
Parallel reading of root files (using DASK)
Runs on flat ntuples (even NanoAODs) out of the box
Adding multiple MVAs is very trivial (Subject to available computing power)
Cross-section and pt-eta reweighting can be handled together
Multi-Sample training possible
Multi-Class training possible
Ability to customize thresholds

What will be the output of the trainer:
Feature distributions
Statistics in training and testing
ROCs, loss plots, MVA scores
Confusion Matrices
Correlation plots
Trained models (h5 for DNN / pkl for XGBoost)

Optional outputs

Threshold values of scores for chosen working points
Efficiency vs pT and Efficiency vs eta plots for all classes
Reweighting plots for pT and eta
Comparison of new ID performance with benchmark ID flags

Assets 2

19 Apr 21:17

a-kapoor

v1.1

e652bb8

Releasing v1.1 of EGamma ID-Trainer Pre-release

Pre-release

###############Do not use v1.1 anymore############## Depreciated#################

Clone

git clone --branch v1.1 https://github.com/cms-egamma/ID-Trainer.git

Setup

source /cvmfs/sft.cern.ch/lcg/views/LCG_97python3/x86_64-centos7-gcc8-opt/setup.sh

Create a new config (Just copy the default one and start editing on top of it)

cp Tools/TrainConfig.py MyTrainConfig.py

More information on how to edit the config is in the attached pdf.

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

python Trainer.py MyTrainConfig

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)

Assets 3

14 Apr 20:42

a-kapoor

d22da64

Releasing v1 of EGamma ID-Trainer Pre-release

Pre-release

Clone

git clone --branch v1 https://github.com/cms-egamma/ID-Trainer.git

Setup

source /cvmfs/sft.cern.ch/lcg/views/LCG_97python3/x86_64-centos7-gcc8-opt/setup.sh

Create a new config (Just copy the default one and start editing on top of it)

cp Tools/TrainConfig.py Tools/NewTrainConfig.py

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

python Trainer.py Tools/NewTrainConfig

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)

More information in the attached pdf.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ID-Trainer

Optional outputs

ID-Trainer

Optional outputs

Clone

Setup

Create a new config (Just copy the default one and start editing on top of it)

More information on how to edit the config is in the attached pdf.

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)

Clone

Setup

Create a new config (Just copy the default one and start editing on top of it)

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)

Releases: cms-egamma/ID-Trainer

Release for general use! v1.4.1

Release for general use!

v1.3 of ID-Trainer

ID-Trainer

Optional outputs

A simple config-based tool for machine learning tasks

ID-Trainer

Optional outputs

Releasing v1.1 of EGamma ID-Trainer

Clone

Setup

Create a new config (Just copy the default one and start editing on top of it)

More information on how to edit the config is in the attached pdf.

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)

Releasing v1 of EGamma ID-Trainer

Clone

Setup

Create a new config (Just copy the default one and start editing on top of it)

All you need to do is to edit the NewTrainConfig.py with the settings for your analysis and then run

The Trainer.py will read the settings from the config file and run training

Suggestion: Do not remove or touch the original Tools/TrainConfig.py (Keep it for reference)