Abstract

The neutrino closure method is often used to obtain kinematics of semileptonic decays with one unreconstructed particle in hadron collider experiments. The kinematics of decays can be deducted by a twofold ambiguity with a quadratic equation. To resolve the twofold ambiguity, a novel method based on machine learning (ML) is proposed. We study the effect of different sets of features and regressors on the improvement of reconstructed invariant mass squared of system (). The result shows that the best performance is obtained by using the flight vector as the features and the multilayer perceptron (MLP) model as the regressor. Compared with the random choice, the MLP model improves the resolution of reconstructed by ~40%. Furthermore, the possibility of using this method on various semileptonic decays is shown.

1. Introduction

Semileptonic decays, mediated by a virtual boson which produces one lepton and the corresponding neutrino in addition to one or more hadrons, offer a good platform to study the weak as well as strong interaction effects [1]. Studies of semileptonic decays, therefore, have been paid much more attention in recent years, especially for the purposes of precise measurements on the Cabibbo-Kobayashi-Maskawa (CKM) matrix elements [2, 3], such as the determination of and . The precision measurement of the CKM matrix elements helps predict other branching fractions, such as . Additionally, the recent measurements of the branching fraction ratios measured in experiments show a slight disagreement with the standard model predictions [4]. Based on the above, the studies of semileptonic decays by LHCb experiment, which focuses on a heavy flavour studies in a forward region, show an increasing trend, although the presence of an unreconstructed neutrino is experimentally challenging.

At -factories operating at the (4S) resonance, the kinematics of missing particles in mesons can be reconstructed by balancing against the decay [5], while in hadron collider experiments, the studies of semileptonic decays pose a technical challenge [6] due to the unreconstructed neutrino in the final state. First of all, a large Lorentz boost can be produced by hadron collider experiments, especially at the forward rapidity covered by the LHCb experiment [7], which is one of the major experiments at LHC. Secondly, the decay kinematics can be restricted by the -hadron decay vertex and the measured flight vector which connects with the primary interaction vertex [8]. Finally, the mass of single missing particles can be deduced from the conservation of four momentums. Conservation of the transverse momentum to the flight vector provides two independent constraints on the semileptonic decays as well. A third constraint is that the parent -hadron mass should be conserved, though this condition has an ambiguity which produces two solutions.

A recently proposed lattice QCD method [9] for the precise calculation of the relevant hadronic form factors shows that the magnitudes of the CKM matrix elements can be calculated based on these known form factors and measurements of and . At the same time, a measurement of the ratio with a newly observed exclusive decay and has been performed by LHCb experiment [10]. This measurement has a significant effect on global fits to the parameters of the CKM matrix. Similarly, the single most precise determination of has been obtained from a -flavour lattice QCD calculation with domain-wall light quarks and relativistic heavy quarks, which is based on the mentioned decay mode [11]. LHCb recently made the first observation of the suppressed semileptonic decay and subsequently measured the ratio of the CKM matrix elements at low and high momentum transfer [12]. One of the challenges for the determination of CKM matrix elements in hadron collider experiments is to infer . To calculate the above, we need to reconstruct the neutrino momentum with a reasonable precision.

In Ref. [5], a linear regression based on estimating the -hadron momentum, using flight vector as input, can then be used to resolve the quadratic ambiguity. Based on the above study, we proposed a method using the MLP regressor based on 0.54 of the correlation coefficient of versus the -hadron momentum. This implies that there is underlying nonlinear dependence of the target on features which can not be captured by linear regressor. The work presented below is aimed at improving the resolution of semileptonic decays in hadron collider environment, based on ML with the Python library scikit-learn [13]. At first, the formula for the decay kinematics with a missing particle is briefly introduced. Then for this study, simulated events based on the RapidSim fast Monte Carlo (MC) generator [14] are used to simulate semileptonic decays in collision. Furthermore, different sets of features and regressors have been studied to select the flight vector and MLP model with the best performance. Then, using the decay as a test channel, the resolution improvement of is compared with random choice and the linear regressor method introduced in Ref. [5]. Finally, in order to examine the performance and to obtain a credible conclusion, other semileptonic decay channels are tested as well. This paper will use LHCb as a model detector, but the ideas should be available to any other hadron collider experiment in the future.

2. Theoretical Derivation of Neutrino Momentum

The decay is used as the example case in this articles and its topology described in Figure 1.

The momentum is required to be aligned with the reconstructed flight direction [15]. It can be known from the symmetry of the decay that the transverse momentum of the neutrino must be equal and its sign needs to be opposite to the transverse momentum of the visible system [5, 16, 17], that is, shown in the following:

From the momentum and energy conservation, we then have

Next, we use the mass constraint to derive ,

Then, we can get a quadratic equation for neutrino momentum in the following form: where the coefficients are defined as follows:

Finally, the neutrino momentum parallel to the flight direction can be determined up to a twofold ambiguity as

Due to the LHCb detector resolution effects [5], approximately 20~40% of the events selected by the properties of decay chains have an unphysical solution for , that is, the negative values of . Such events are discarded in this work. The momentum and the of signal candidates may now be determined with a twofold ambiguity. A choice needs to be made on which of the two solutions of or will be selected. The simplest way is to randomly pick one of the two solutions, but it will lead to a poor resolution of or . In order to improve the resolution, a linear regression algorithm is used by using the flight length and the polar angle of the flight vector as the features. Based on the above study [5], in this paper, a novel method based on ML has been proposed to further improve the resolution.

3. Simulation of Semileptonic Decay Production

The RapidSim event generator is used to simulate semileptonic decays in collision at TeV. About 1 million MC events are generated. The paper is using LHCb coordinate system which is defined as horizontal the beam axis into the LHCb detector, vertical and along the beam axis. Signal heavy-quark hadron events are restricted to be within a pseudorapidity () range , which corresponds to the approximate kinematic acceptance of the LHCb detector [18].

As the variables used in this study are dependent on the flight direction between the heavy-quark hadron production and its decay vertices, it is necessary for us to model the resolution in associated features; that is, we need to apply a proper smearing at first in order to simulate expected experimental resolution. The and coordinates of the heavy-quark hadron decay vertices are smeared by a Gaussian distribution with a sigma value of ±20 μm. A much larger resolution of ±200 μm is applied in the direction [5]. To reflect the known performance from the LHCb VELO detector [5, 19], the resolutions of production vertices for , , and ordinates are assumed at ±13 μm, ±13 μm, and ±70 μm, respectively. In all presented studies, the smeared flight length needs to be larger than 3 mm. These assumptions approximately meet the effect of online and offline selections from heavy-quark hadron decays in LHCb [5, 20].

4. Features and Regressors

The regression analysis is a set of statistical methods used for estimating the targeted value based on the relationships between regressor and features [13]. Therefore, it is important to select well-suited regressors and efficient features for different user-case scenarios.

In Ref. [5], the momentum of the -hadron as the mother particle is inferred based on a linear regression algorithm using two flight variables, and , where represents the flight distance of mother particle and is the polar angle of the flight vector. In our case, five sets of features have been chosen, as summarized in Table 1. All features are selected based on Section 2 and those used in Ref. [5], where , , and are the components of . Three different regressors are studied in this paper, labeled as “regressor A-C” [2123], shown in Table 1. These regressors are selected from a full range of regression models included in the scikit-learn toolkit.

To test the performance of different sets of features and select the best one, we make conditional experiments. Figure 2 shows the performance on improvement and the root mean square (RMS) value of reconstructed -hadron momentum resolution () with different sets of input variables from the MLP regressor. It indicates that “label A” and “label C” have the same performance on improvement, which increased by 40%, while other sets are less than 35%. The mean and RMS values of in “label A,” “label B,” and “label C” are (10, 93) MeV/, (9, 95) MeV/, and (8, 93) MeV/, respectively. Based on the obtained results, we select “label C” as the main method for this study, that is, , , , and . Figure 3 shows the distributions of , , , and versus the -hadron momentum with the correlation coefficients of 0.54, -0.01, -0.00, and 0.52.

Once the input features are determined, the best regressor is selected by a similar method. Figure 4 shows the performance on improvement and the RMS value of momentum using the different regressors based on the “label C” input features. The resolution increase has been observed for regressors A, B, and C as 34%, 39%, and 40%, while the (mean and RMS) values of for that are (8, 94) MeV/, (9, 94) MeV/, and (8, 93) MeV/, respectively. The best of features is “label C” which consists of , , , and , while the best regressor is the MLP regressor.

5. Performance of MLP Regressor

This section describes the applications of the best regressor, MLP regressor, for different semileptonic decays, such as , , , and .

5.1. Tests on Channel

decay channel has been used to study the improvement of resolution with MLP regressor and “label C” feature. Figure 5 shows the distributions of resolution (, where and are the reconstructed and input value, respectively) in different conditions, labeled as “best,” “correct,” and “random.” “Best” represents the result which corresponds to the regression value. “Correct” is defined as the solution being the one closest to the true from the input MC. The value is set up here for comparison. “Random” is the solution based on selecting a random result of Equation (6). The result indicates an obvious improvement from “best” compared with that from “random.” The flowchart of the methodology is shown in Figure 6.

Table 2 shows the resolution on the reconstructed for different ways of selecting a solution of the twofold ambiguity and shows the improvement on the resolution of reconstructed compared to a random selection. Using the output of the MLP regression model with “label C” feature to select a solution improves the resolution on the reconstructed by ~40% when compared with a random selection.

To illustrate the robustness of the model, data samples with different sizes are tested. Figure 7 shows the improvement of reconstructed resolution and the RMS value of based on various input statistics, with the linear regressor included for comparison. The improvement of resolution in case of the MLP regressor is on average higher by 5% with respect to values obtained using the linear regressor. The RMS values of from the MLP regressor are clearer smaller than those from the random choice in all tested data samples; meanwhile on average, around 40% of improvements for reconstructed resolution can be achieved by MLP regressor.

To summarize this part, the MLP regression method can significantly improve the resolution up to 40% when compared to the random choice or up to 5% when compared with the linear regressor, so that a more precise measurement on the ratio based on our method is expected in the channel.

5.2. Tests on Other Channels

In order to scrutinize obtained results, selected method is applied to other semileptonic decays and revaluated. Three channels, namely, , , and , have been chosen to check the performance. The performance tests on other channels confirm that using the output of MLP regression, improved resolution can be obtained in all tested channels. More specially, the resolution improvement on the reconstructed with respect to a random selection is, on average, 40% by using the MLP regressor in the decay mode. For the channels of and , the resolution is improved by 37% and 20% on average, respectively. The MLP regressor, when compared with the linear regressor, can on average result in ~5% improvement on the obtained resolution for all studied decay channels.

6. Conclusions

A novel method to improve the resolution in semileptonic decays using a ML approach is studied in this paper. The information of flight vector (, , , and ), labeled as “label C,” shows the highest discrimination power, while the MLP regressor is the best regressor. We found the following: (i)Using the MLP regression model with “label C” feature improves the resolution on the reconstructed by an average of ~40% when compared to the random choice or up to 5% when compared with the linear regressor method introduced in Ref. [5], when the decay is used as a test channel(ii)The method also have similar performance on improving the reconstructed resolution in a wide range of semileptonic decays, namely, , , and (iii)What is more, the proposed method can potentially improve measurements of differential decay rates of semileptonic heavy flavour hadrons decays in hadron collider experiments such as LHCb(iv)The studies presented here use the example of the LHCb experiment, but the ideas should be available to any other hadron collider experiment in the current and future

However, the room for improvement using sole software means is rather limited due to the experimental resolution of the vertex positioning that we have assumed (±200 μm in the direction and ±20 μm in or direction) based on the LHCb experiment.

Data Availability

The RapidSim event generator is used to simulate semileptonic decays in collision at TeV. About 1 million MC events are generated.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by grants from the Natural Science Foundation of China (nos. 11735010, U1932108, U2032102, and 12061131006). The authors would like to thank Murphy Zheng (Murphy-Zheng Creative Studio) for polishing Figure 1 and Zhihao Xu (University of Chinese Academy of Sciences) for the useful discussion. M.S. acknowledges the support from the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement no. 714536: PRECISION.