Jet Flavour Tagging

Version	Summary	Created by	Modification	Content Size	Created at	Operation
1		Antimo Cagnotta	--	1025	2022-11-04 10:28:57	\|
2	format correct	Catherine Yang	+ 34 word(s)	1059	2022-11-09 02:53:28	\| \|
3	format correct	Catherine Yang	Meta information modification	1059	2022-11-09 02:53:55	\|

This entry is adapted from the peer-reviewed paper 10.3390/app122010574

Jet Flavour Tagging briefly describes the main algorithms used to reconstruct heavy-flavour jets. Jet Substructure and Deep Tagging focuses on the identification of heavy-particle decay in boosted jets. These so-called tagger algorithms have a relevant role in physics studies since they allow researchers to successfully reconstruct and identify the particles that caused the jet and, in some cases, allow analyses that would otherwise be unfeasible.

machine learning jet tagging particle physics

1. The CSVv2 Tagger

Heavy-flavour jet tagging is linked to the properties of the heavy-hadrons in the jets. The CSVv2 algorithm is based on the CSV algorithm; however, displaced track information is combined with the relative secondary vertex as input for multivariate analysis. A feed-forward multilayer perceptron with one hidden layer is trained to tag the b-jet. The jet’s $p_{T}$ and

η

distributions are reweighted in order to have the same spectrum for all the jet flavours in the training, thereby avoiding discrimination based on the spectrum of these variables, which would introduce a dependence on the sample used. Three different jet categories are defined based on the number and type of secondary vertices reconstructed: RecoVertex, PseudoVertex, and NoVertex. The values of the discriminator of the three categories are combined with a likelihood ratio that takes into consideration the fraction of jet flavour derived in a sample composed of top quark–antiquark (

t \bar{t}

) events. Moreover, two different trainings are performed with c-jets and light-jets as the background. The final value of the discriminator is the weighted average of the two training outputs, with a relative weight of 1:3 for c-jet to light-jet trainings. The CSVv2 algorithm by default uses vertices reconstructed with the IVF algorithm, but it has also been studied with AVR reconstruction, and this is referred to as CVSv2 (AVR). Figure 1 shows the output of the two versions of the CSVv2 algorithm.

Figure 1. Distribution of the CSVv2 discriminant for jets of different flavour in

t \bar{t}

events: the output for the version with (a) IVF reconstruction and with (b) AVR reconstruction. The distributions are normalised to unit area. Jets without a selected track and secondary vertex are assigned a negative discriminator value. The first bin includes the underflow entries ^[1].

2. The DeepCSV Tagger

The DeepCSV algorithm was developed with a Deep Neural Network (DNN) with more hidden layers and more nodes per layer in order to improve the CSVv2 b-tagger. The input is the combination of the IVF secondary vertices and up to the first six track variables, taking into consideration all the jet-flavour and vertex categories. Variable preprocessing is used to speed up training and centres the distributions around zero with a root mean square equal to one. The jet

p_{T}

range used in training goes from 20

GeV

up to 1

TeV

and remains within the tracker acceptance by also using the preprocessed jet

p_{T}

and

η

as input.

The neural network, developed with KERAS ^[2], uses four fully connected hidden layers, and each layer has 100 nodes. The activation function of each node is a rectified linear unit that defines the output of the node, with the exception of the last layer, for which the output is a normalised exponential function interpreted as the probability of flavour f of the jet (P(f)). Five jet categories corresponding to the nodes in the output layer are defined: one for b hadron jets, at least two for b hadrons, one for c hadron and no b hadron, at least two for c hadron and no b hadron, and other jets. Figure 2 shows the DeepCSV probability P(f) distributions.

Figure 2. Discriminator distributions of (a) DeepCSV P(b), (b) DeepCSV P(

b \bar{b}

), (c) DeepCSV P(c), (d) DeepCSV P(

c \bar{c}

), (e) DeepCSV P(usdg), and (f) DeepCSV P(b) + P(

b \bar{b}

) ^[1].

The DeepCSV tagger is used also for c tagging, which combines the probabilities corresponding to the five categories. In particular, the DeepCSVCvsB discriminant is used to discriminate c jets from b jets and is defined as:

D e e p C S V C v s B = \frac{P (c) + P (c \bar{c})}{1 - P (u d s g)},

where

1 - P (u s d g)

is the probability of identifying an a, b, or c jet. In the same way, DeepCSVCvsL is defined to discriminate c jets from light jets:

D e e p C S V C v s L = \frac{P (c) + P (c \bar{c})}{1 - (P (b) + P (b \bar{b}))},

and the denominator is the probability of identifying a c jet or a light jet.

3. The DeepJet Tagger

Recently, a new network architecture was developed: the DeepJet tagger ^[3]. Different from CSVv2 and DeepCSV taggers, this architecture examines all jet constituents simultaneously. The DeepJet algorithm uses a large number of input variables that can be categorised into four groups: global variables (jet kinematics, the number of tracks in the jet, etc.), charged and neutral PF candidates, and variables of the SVs related to the jet. For the same reasons, the jets

p_{T}

and

η

are reweighted during data preprocessing to avoid discrimination closely related to the kinematic domain used during training.

The basic idea in the DeepJet architecture is to use low-level information from all subjet features. In order to process an input variable space of such dimensions, the architecture needs an appropriate training procedure. Four separate branches are used in the first step: all four of the groups listed above except the global variables are filtered through a

1 \times 1

convolutional layer. Each of the three outputs is then processed into a recurrent layer of the Long Short-Term Memory (LSTM) type ^[4]. The three LSTM outputs are collected with the global variables and then input in a fully connected layers. In order to discriminate between b-tagging, c-tagging, and quark/gluon tagging, the six output nodes of the previous layers are integrated into a multi-classifier.

Training is performed using the Adam optimiser with a learning rate of

3 \times 10^{- 4}

for 65 epochs and categorical cross entropy loss. The learning rate is halved if the validation sample loss stagnates for more than 10 epochs. In Figure 3, the Receiver–Operative Characteristic (ROC) curves for two different

p_{T}

ranges for the same dataset are reported and compared to the performance of the DeepCSV tagger. Such curves display the background misidentification efficiency versus the signal efficiency measured from Monte Carlo simulation.

Figure 3. ROC curves of the DeepJet and DeepCSV b-tagging algorithms on

t \bar{t}

events for which both top quark decay hadronically. In (a),

p_{T}^{j e t} > 30

GeV

, while in (b),

p_{T}^{j e t} > 90

GeV

^[3].

References

Sirunyan, A.M.; Tumasyan, A.; Adam, W.; Ambrogi, F.; Asilar, E.; Bergauer, T.; Brandstetter, J.; Dragicevic, M.; Erö, J.; Del Valle, A.E.; et al. Identification of heavy-flavour jets with the CMS detector in pp collisions at 13 TeV. J. Instrum. 2018, 13, P05011.
Chollet, F. Keras; GitHub: Seattle, WA, USA, 2015; Available online: https://keras.io/ (accessed on 16 October 2022).
Bols, E.; Kieseler, J.; Verzetti, M.; Stoye, M.; Stakia, A. Jet Flavour Classification Using DeepJet. J. Instrum. 2020, 15, P12012.
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780.

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.

Upload a video for this entry

Information

Subjects: Physics, Particles & Fields

Contributors MDPI registered users' name will be linked to their SciProfiles pages. To register with us, please refer to https://encyclopedia.pub/register :

Antimo Cagnotta

Francesco Carnevali

Agostino De Iorio

View Times: 571

Update Date: 09 Nov 2022

Table of Contents

Video Upload Options

Confirm

1. The CSVv2 Tagger

2. The DeepCSV Tagger

3. The DeepJet Tagger

References