Improvement of <svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-4.526799pt" id="M1" height="19.8275pt" version="1.1" viewBox="-0.0657574 -15.3007 15.118 19.8275" width="15.118pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-114" d="M474 429L457 433C435 440 389 448 367 448C348 448 323 446 309 443C266 434 195 406 148 366C78 307 23 210 23 101C23 35 55 -12 92 -12C118 -12 146 1 196 35C247 70 311 130 346 173H348L281 -148C268 -211 256 -221 208 -229L191 -232L187 -257L433 -245L437 -219L411 -216C357 -210 350 -205 362 -140L427 207C447 315 461 381 474 429ZM387 387C379 337 363 262 355 236C318 180 201 57 142 57C126 57 112 81 112 128C112 205 150 321 220 376C244 395 280 403 312 403C345 403 370 396 387 387Z"/></g><g transform="matrix(.012,0,0,-0.012,8.426,-7.578)"><path id="g50-51" d="M414 144C384 79 371 75 317 75H135L276 221C367 316 408 376 408 465C408 570 327 635 237 635C179 635 131 609 100 575L42 494L67 471C94 510 138 565 205 565C277 565 321 517 321 435C321 348 258 270 195 195C146 137 88 81 33 26V0H411C423 44 433 88 446 135L414 144Z"/></g></svg> Resolution in Semileptonic Decays Based on Machine Learning

Ge, Panting; Huang, Xiaotao; Saur, Miroslav; Sun, Liang

doi:https://doi.org/10.1155/2023/8127604

Advances in High Energy Physics

On this page

Abstract Introduction Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2023 | Article ID 8127604 | https://doi.org/10.1155/2023/8127604

Improvement of Resolution in Semileptonic Decays Based on Machine Learning

Panting Ge,¹Xiaotao Huang,²Miroslav Saur,³and Liang Sun¹

Academic Editor: Mariana Frank

Received22 Jul 2022

Revised19 Jan 2023

Accepted10 Feb 2023

Published27 Mar 2023

Abstract

The neutrino closure method is often used to obtain kinematics of semileptonic decays with one unreconstructed particle in hadron collider experiments. The kinematics of decays can be deducted by a twofold ambiguity with a quadratic equation. To resolve the twofold ambiguity, a novel method based on machine learning (ML) is proposed. We study the effect of different sets of features and regressors on the improvement of reconstructed invariant mass squared of system (). The result shows that the best performance is obtained by using the flight vector as the features and the multilayer perceptron (MLP) model as the regressor. Compared with the random choice, the MLP model improves the resolution of reconstructed by ~40%. Furthermore, the possibility of using this method on various semileptonic decays is shown.

1. Introduction

Semileptonic decays, mediated by a virtual boson which produces one lepton and the corresponding neutrino in addition to one or more hadrons, offer a good platform to study the weak as well as strong interaction effects [1]. Studies of semileptonic decays, therefore, have been paid much more attention in recent years, especially for the purposes of precise measurements on the Cabibbo-Kobayashi-Maskawa (CKM) matrix elements [2, 3], such as the determination of and . The precision measurement of the CKM matrix elements helps predict other branching fractions, such as . Additionally, the recent measurements of the branching fraction ratios measured in experiments show a slight disagreement with the standard model predictions [4]. Based on the above, the studies of semileptonic decays by LHCb experiment, which focuses on a heavy flavour studies in a forward region, show an increasing trend, although the presence of an unreconstructed neutrino is experimentally challenging.

At -factories operating at the (4S) resonance, the kinematics of missing particles in mesons can be reconstructed by balancing against the decay [5], while in hadron collider experiments, the studies of semileptonic decays pose a technical challenge [6] due to the unreconstructed neutrino in the final state. First of all, a large Lorentz boost can be produced by hadron collider experiments, especially at the forward rapidity covered by the LHCb experiment [7], which is one of the major experiments at LHC. Secondly, the decay kinematics can be restricted by the -hadron decay vertex and the measured flight vector which connects with the primary interaction vertex [8]. Finally, the mass of single missing particles can be deduced from the conservation of four momentums. Conservation of the transverse momentum to the flight vector provides two independent constraints on the semileptonic decays as well. A third constraint is that the parent -hadron mass should be conserved, though this condition has an ambiguity which produces two solutions.

A recently proposed lattice QCD method [9] for the precise calculation of the relevant hadronic form factors shows that the magnitudes of the CKM matrix elements can be calculated based on these known form factors and measurements of and . At the same time, a measurement of the ratio with a newly observed exclusive decay and has been performed by LHCb experiment [10]. This measurement has a significant effect on global fits to the parameters of the CKM matrix. Similarly, the single most precise determination of has been obtained from a -flavour lattice QCD calculation with domain-wall light quarks and relativistic heavy quarks, which is based on the mentioned decay mode [11]. LHCb recently made the first observation of the suppressed semileptonic decay and subsequently measured the ratio of the CKM matrix elements at low and high momentum transfer [12]. One of the challenges for the determination of CKM matrix elements in hadron collider experiments is to infer . To calculate the above, we need to reconstruct the neutrino momentum with a reasonable precision.

In Ref. [5], a linear regression based on estimating the -hadron momentum, using flight vector as input, can then be used to resolve the quadratic ambiguity. Based on the above study, we proposed a method using the MLP regressor based on 0.54 of the correlation coefficient of versus the -hadron momentum. This implies that there is underlying nonlinear dependence of the target on features which can not be captured by linear regressor. The work presented below is aimed at improving the resolution of semileptonic decays in hadron collider environment, based on ML with the Python library scikit-learn [13]. At first, the formula for the decay kinematics with a missing particle is briefly introduced. Then for this study, simulated events based on the RapidSim fast Monte Carlo (MC) generator [14] are used to simulate semileptonic decays in collision. Furthermore, different sets of features and regressors have been studied to select the flight vector and MLP model with the best performance. Then, using the decay as a test channel, the resolution improvement of is compared with random choice and the linear regressor method introduced in Ref. [5]. Finally, in order to examine the performance and to obtain a credible conclusion, other semileptonic decay channels are tested as well. This paper will use LHCb as a model detector, but the ideas should be available to any other hadron collider experiment in the future.

2. Theoretical Derivation of Neutrino Momentum

The decay is used as the example case in this articles and its topology described in Figure 1.

The momentum is required to be aligned with the reconstructed flight direction [15]. It can be known from the symmetry of the decay that the transverse momentum of the neutrino must be equal and its sign needs to be opposite to the transverse momentum of the visible system [5, 16, 17], that is, shown in the following:

From the momentum and energy conservation, we then have

Next, we use the mass constraint to derive ,

Then, we can get a quadratic equation for neutrino momentum in the following form: where the coefficients are defined as follows:

Finally, the neutrino momentum parallel to the flight direction can be determined up to a twofold ambiguity as

Due to the LHCb detector resolution effects [5], approximately 20~40% of the events selected by the properties of decay chains have an unphysical solution for , that is, the negative values of . Such events are discarded in this work. The momentum and the of signal candidates may now be determined with a twofold ambiguity. A choice needs to be made on which of the two solutions of or will be selected. The simplest way is to randomly pick one of the two solutions, but it will lead to a poor resolution of or . In order to improve the resolution, a linear regression algorithm is used by using the flight length and the polar angle of the flight vector as the features. Based on the above study [5], in this paper, a novel method based on ML has been proposed to further improve the resolution.

3. Simulation of Semileptonic Decay Production

The RapidSim event generator is used to simulate semileptonic decays in collision at TeV. About 1 million MC events are generated. The paper is using LHCb coordinate system which is defined as horizontal the beam axis into the LHCb detector, vertical and along the beam axis. Signal heavy-quark hadron events are restricted to be within a pseudorapidity () range , which corresponds to the approximate kinematic acceptance of the LHCb detector [18].

As the variables used in this study are dependent on the flight direction between the heavy-quark hadron production and its decay vertices, it is necessary for us to model the resolution in associated features; that is, we need to apply a proper smearing at first in order to simulate expected experimental resolution. The and coordinates of the heavy-quark hadron decay vertices are smeared by a Gaussian distribution with a sigma value of ±20 μm. A much larger resolution of ±200 μm is applied in the direction [5]. To reflect the known performance from the LHCb VELO detector [5, 19], the resolutions of production vertices for , , and ordinates are assumed at ±13 μm, ±13 μm, and ±70 μm, respectively. In all presented studies, the smeared flight length needs to be larger than 3 mm. These assumptions approximately meet the effect of online and offline selections from heavy-quark hadron decays in LHCb [5, 20].

4. Features and Regressors

The regression analysis is a set of statistical methods used for estimating the targeted value based on the relationships between regressor and features [13]. Therefore, it is important to select well-suited regressors and efficient features for different user-case scenarios.

In Ref. [5], the momentum of the -hadron as the mother particle is inferred based on a linear regression algorithm using two flight variables, and , where represents the flight distance of mother particle and is the polar angle of the flight vector. In our case, five sets of features have been chosen, as summarized in Table 1. All features are selected based on Section 2 and those used in Ref. [5], where , , and are the components of . Three different regressors are studied in this paper, labeled as “regressor A-C” [21–23], shown in Table 1. These regressors are selected from a full range of regression models included in the scikit-learn toolkit.

To test the performance of different sets of features and select the best one, we make conditional experiments. Figure 2 shows the performance on improvement and the root mean square (RMS) value of reconstructed -hadron momentum resolution () with different sets of input variables from the MLP regressor. It indicates that “label A” and “label C” have the same performance on improvement, which increased by 40%, while other sets are less than 35%. The mean and RMS values of in “label A,” “label B,” and “label C” are (10, 93) MeV/, (9, 95) MeV/, and (8, 93) MeV/, respectively. Based on the obtained results, we select “label C” as the main method for this study, that is, , , , and . Figure 3 shows the distributions of , , , and versus the -hadron momentum with the correlation coefficients of 0.54, -0.01, -0.00, and 0.52.

Once the input features are determined, the best regressor is selected by a similar method. Figure 4 shows the performance on improvement and the RMS value of momentum using the different regressors based on the “label C” input features. The resolution increase has been observed for regressors A, B, and C as 34%, 39%, and 40%, while the (mean and RMS) values of for that are (8, 94) MeV/, (9, 94) MeV/, and (8, 93) MeV/, respectively. The best of features is “label C” which consists of , , , and , while the best regressor is the MLP regressor.

5. Performance of MLP Regressor

This section describes the applications of the best regressor, MLP regressor, for different semileptonic decays, such as , , , and .

5.1. Tests on Channel

decay channel has been used to study the improvement of resolution with MLP regressor and “label C” feature. Figure 5 shows the distributions of resolution (, where and are the reconstructed and input value, respectively) in different conditions, labeled as “best,” “correct,” and “random.” “Best” represents the result which corresponds to the regression value. “Correct” is defined as the solution being the one closest to the true from the input MC. The value is set up here for comparison. “Random” is the solution based on selecting a random result of Equation (6). The result indicates an obvious improvement from “best” compared with that from “random.” The flowchart of the methodology is shown in Figure 6.

Table 2 shows the resolution on the reconstructed for different ways of selecting a solution of the twofold ambiguity and shows the improvement on the resolution of reconstructed compared to a random selection. Using the output of the MLP regression model with “label C” feature to select a solution improves the resolution on the reconstructed by ~40% when compared with a random selection.

To illustrate the robustness of the model, data samples with different sizes are tested. Figure 7 shows the improvement of reconstructed resolution and the RMS value of based on various input statistics, with the linear regressor included for comparison. The improvement of resolution in case of the MLP regressor is on average higher by 5% with respect to values obtained using the linear regressor. The RMS values of from the MLP regressor are clearer smaller than those from the random choice in all tested data samples; meanwhile on average, around 40% of improvements for reconstructed resolution can be achieved by MLP regressor.

To summarize this part, the MLP regression method can significantly improve the resolution up to 40% when compared to the random choice or up to 5% when compared with the linear regressor, so that a more precise measurement on the ratio based on our method is expected in the channel.

5.2. Tests on Other Channels

In order to scrutinize obtained results, selected method is applied to other semileptonic decays and revaluated. Three channels, namely, , , and , have been chosen to check the performance. The performance tests on other channels confirm that using the output of MLP regression, improved resolution can be obtained in all tested channels. More specially, the resolution improvement on the reconstructed with respect to a random selection is, on average, 40% by using the MLP regressor in the decay mode. For the channels of and , the resolution is improved by 37% and 20% on average, respectively. The MLP regressor, when compared with the linear regressor, can on average result in ~5% improvement on the obtained resolution for all studied decay channels.

6. Conclusions

A novel method to improve the resolution in semileptonic decays using a ML approach is studied in this paper. The information of flight vector (, , , and ), labeled as “label C,” shows the highest discrimination power, while the MLP regressor is the best regressor. We found the following: (i)Using the MLP regression model with “label C” feature improves the resolution on the reconstructed by an average of ~40% when compared to the random choice or up to 5% when compared with the linear regressor method introduced in Ref. [5], when the decay is used as a test channel(ii)The method also have similar performance on improving the reconstructed resolution in a wide range of semileptonic decays, namely, , , and (iii)What is more, the proposed method can potentially improve measurements of differential decay rates of semileptonic heavy flavour hadrons decays in hadron collider experiments such as LHCb(iv)The studies presented here use the example of the LHCb experiment, but the ideas should be available to any other hadron collider experiment in the current and future

However, the room for improvement using sole software means is rather limited due to the experimental resolution of the vertex positioning that we have assumed (±200 μm in the direction and ±20 μm in or direction) based on the LHCb experiment.

Data Availability

The RapidSim event generator is used to simulate semileptonic decays in collision at TeV. About 1 million MC events are generated.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by grants from the Natural Science Foundation of China (nos. 11735010, U1932108, U2032102, and 12061131006). The authors would like to thank Murphy Zheng (Murphy-Zheng Creative Studio) for polishing Figure 1 and Zhihao Xu (University of Chinese Academy of Sciences) for the useful discussion. M.S. acknowledges the support from the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement no. 714536: PRECISION.

References

J. Dingfelder and T. Mannel, “Leptonic and semileptonic decays of mesons,” Reviews of Modern Physics, vol. 88, no. 3, article 035008, 2016.
View at: Publisher Site | Google Scholar
J. Charles, A. Höcker, H. Lacker et al., “ violation and the CKM matrix: assessing the impact of the asymmetric factories,” The European Physical Journal C, vol. 41, no. 1, pp. 1–131, 2005.
View at: Publisher Site | Google Scholar
G. Ricciardi and M. Rotondo, “Determination of the Cabibbo-Kobayashi-Maskawa matrix element ,” Journal of Physics G: Nuclear and Particle Physics, vol. 47, article 113001, 2020.
View at: Publisher Site | Google Scholar
Heavy Flavor Averaging Group (HFLAV), Y. Amhis, S. Banerjee et al., “Averages of -hadron, -hadron, and -lepton properties as of summer 2016,” The European Physical Journal C, vol. 77, no. 12, p. 895, 2017.
View at: Publisher Site | Google Scholar
G. Ciezarek, A. Lupato, M. Rotondo, and M. Vesterinen, “Reconstruction of semileptonically decaying beauty hadrons produced in high energy collisions,” Journal of High Energy Physics, vol. 2017, article 21, 2017.
View at: Publisher Site | Google Scholar
P. Gambino, A. S. Kronfeld, M. Rotondo et al., “Challenges in semileptonic decays,” The European Physical Journal C, vol. 80, no. 10, p. 966, 2020.
View at: Publisher Site | Google Scholar
The LHCb Collaboration, A. Augusto Alves Jr., L. M. A. Filho et al., “The LHCb detector at the LHC,” Journal of Instrumentation, vol. 3, article S08005, 2008.
View at: Publisher Site | Google Scholar
S. Dambach, U. Langenegger, and A. Starodumov, “Neutrino reconstruction with topological information,” Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 569, no. 3, pp. 824–828, 2006.
View at: Publisher Site | Google Scholar
W. Detmold, C. Lehner, and S. Meinel, “Λ_b→pι^- and form factors from lattice QCD with relativistic heavy quarks,” Physical Review D, vol. 92, no. 3, article 034503, 2015.
View at: Google Scholar
The LHCb collaboration, R. Aaij, B. Adeva et al., “Determination of the quark coupling strength using baryonic decays,” Nature Physics, vol. 11, no. 9, pp. 743–747, 2015.
View at: Publisher Site | Google Scholar
J. . M. Flynn, T. Izubuchi, T. Kawanai et al., “ and form factors and from -flavor lattice QCD with domain-wall light quarks and relativistic heavy quarks,” Physical Review D, vol. 91, no. 7, article 074510, 2015.
View at: Publisher Site | Google Scholar
R. Aaij, C. Abellan Beteta, T. Ackernley et al., “First observation of the decay and Measurement of ,” Physical Review Letters, vol. 126, no. 8, article 081804, 2021.
View at: Publisher Site | Google Scholar
F. Pedregosa, G. Varoquaux, A. Gramfort et al., “Scikit-learn: machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
View at: Google Scholar
G. A. Cowan, D. C. Craik, and M. D. Needham, “RapidSim: an application for the fast simulation of heavy-quark hadron decays,” Computer Physics Communications, vol. 214, pp. 239–246, 2017.
View at: Publisher Site | Google Scholar
S. Stone and L. Zhang, “Method of studying decays with one missing particle,” Advances in High Energy Physics, vol. 2014, Article ID 931257, 5 pages, 2014.
View at: Publisher Site | Google Scholar
E791 Collaboration, E. M. Aitala, S. Amato et al., “Search for mixing in semileptonic decay modes,” Physical Review Letters, vol. 77, pp. 2384–2387, 1996.
View at: Publisher Site | Google Scholar
The LHCb collaboration, R. Aaij, C. A. Beteta et al., “Measurement of the shape of the differential decay rate,” Journal of High Energy Physics, vol. 12, p. 144, 2020.
View at: Publisher Site | Google Scholar
B. Audurier and LHCb Collaboration, “Highlights from the LHCb experiment,” Nuclear Physics A, vol. 1005, article 122001, 2021.
View at: Publisher Site | Google Scholar
R. Geertsema, K. Akiba, M. van Beuzekom et al., “Charge collection properties of prototype sensors for the LHCb VELO upgrade,” Journal of Instrumentation, vol. 16, no. 2, article P02029, 2021.
View at: Publisher Site | Google Scholar
LHCb Collaboration, R. Aaij, S. Akar et al., “Design and performance of the LHCb trigger and full real-time reconstruction in run 2 of the LHC,” Journal of Instrumentation, vol. 14, no. 4, article P04013, 2019.
View at: Publisher Site | Google Scholar
R. Garcia, M. Anzorena, J. F. Valdés-Galicia et al., “Particle identification and analysis in the SciCRT using machine learning tools,” Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 1003, article 165326, 2021.
View at: Publisher Site | Google Scholar
T. Lebese and X. Ruan, “The use of generative adversarial networks to characterise new physics in multi-lepton final states at the LHC,” https://arxiv.org/abs/2105.14933.
View at: Google Scholar
E. Yüksel, D. Soydaner, and H. Bahtiyar, “Nuclear binding energy predictions using neural networks: application of the multilayer perceptron,” International Journal of Modern Physics E, vol. 30, no. 3, article 2150017, 2021.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2023 Panting Ge et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The publication of this article was funded by SCOAP³.

PDF Download Citation

Download other formats

Order printed copies

Views

316

Downloads

353

Citations