Data analysis in high energy physics often deals with data samples consisting of a mixture of signal and background events. The sPlot technique is a common method to subtract the contribution of the ...background by assigning weights to events. Part of the weights are by design negative. Negative weights lead to the divergence of some machine learning algorithms training due to absence of the lower bound in the loss function. In this paper we propose a mathematically rigorous way to train machine learning algorithms on data samples with background described by sPlot to obtain signal probabilities conditioned on observables, without encountering negative event weight at all. This allows usage of any out-of-the-box machine learning methods on such data.
Machine Learning on sWeighted data Borisyak, M; Kazeev, N
Journal of physics. Conference series,
04/2020, Letnik:
1525, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Data analysis in high energy physics has to deal with data samples produced from different sources. One of the most widely used ways to unfold their contributions is the sPlot technique. It uses the ...results of a maximum likelihood fit to assign weights to events. Some weights produced by sPlot are by design negative. Negative weights make it difficult to apply machine learning methods. The loss function becomes unbounded. This leads to divergent neural network training. In this paper we propose a mathematically rigorous way to transform the weights obtained by sPlot into class probabilities conditioned on observables, thus enabling to apply any machine learning algorithm out-of-the-box.
Daily operation of a large-scale experiment is a challenging task, particularly from perspectives of routine monitoring of quality for data being taken. We describe an approach that uses Machine ...Learning for the automated system to monitor data quality, which is based on partial use of data qualified manually by detector experts. The system automatically classifies marginal cases: both of good an bad data, and use human expert decision to classify remaining "grey area" cases. This study uses collision data collected by the CMS experiment at LHC in 2010. We demonstrate that proposed workflow is able to automatically process at least 20% of samples without noticeable degradation of the result.
Muon Trigger for Mobile Phones Borisyak, M; Usvyatsov, M; Mulhearn, M ...
Journal of physics. Conference series,
10/2017, Letnik:
898, Številka:
3
Journal Article
Recenzirano
Odprti dostop
The CRAYFIS experiment proposes to use privately owned mobile phones as a ground detector array for Ultra High Energy Cosmic Rays. Upon interacting with Earth's atmosphere, these events produce ...extensive particle showers which can be detected by cameras on mobile phones. A typical shower contains minimally-ionizing particles such as muons. As these particles interact with CMOS image sensors, they may leave tracks of faintly-activated pixels that are sometimes hard to distinguish from random detector noise. Triggers that rely on the presence of very bright pixels within an image frame are not efficient in this case. We present a trigger algorithm based on Convolutional Neural Networks which selects images containing such tracks and are evaluated in a lazy manner: the response of each successive layer is computed only if activation of the current layer satisfies a continuation criterion. Usage of neural networks increases the sensitivity considerably comparable with image thresholding, while the lazy evaluation allows for execution of the trigger under the limited computational power of mobile phones.
High-energy physics experiments rely on reconstruction of the trajectories of particles produced at the interaction point. This is a challenging task, especially in the high track multiplicity ...environment generated by p-p collisions at the LHC energies. A typical event includes hundreds of signal examples (interesting decays) and a significant amount of noise (uninteresting examples). This work describes a modification of the Artificial Retina algorithm for fast track finding: numerical optimization methods were adopted for fast local track search. This approach allows for considerable reduction of the total computational time per event. Test results on simplified simulated model of LHCb VELO (VErtex LOcator) detector are presented. Also this approach is well-suited for implementation of paralleled computations as GPGPU which look very attractive in the context of upcoming detector upgrades.
Daily operation of a large-scale experiment is a resource consuming task, particularly from perspectives of routine data quality monitoring. Typically, data comes from different sub-detectors and the ...global quality of data depends on the combinatorial performance of each of them. In this paper, the problem of identifying channels in which anomalies occurred is considered. We introduce a generic deep learning model and prove that, under reasonable assumptions, the model learns to identify 'channels' which are affected by an anomaly. Such model could be used for data quality manager cross-check and assistance and identifying good channels in anomalous data samples. The main novelty of the method is that the model does not require ground truth labels for each channel, only global flag is used. This effectively distinguishes the model from classical classification methods. Being applied to CMS data collected in the year 2010, this approach proves its ability to decompose anomaly by separate channels.
The data sample of Λ0b→J/ψpK− decays acquired with the LHCb detector from 7 and 8 TeV pp collisions, corresponding to an integrated luminosity of 3 fb−1, is inspected for the presence of J/ψp or ...J/ψK− contributions with minimal assumptions about K−p contributions. It is demonstrated at more than nine standard deviations that Λ0b→J/ψpK− decays cannot be described with K−p contributions alone, and that J/ψp contributions play a dominant role in this incompatibility. These model-independent results support the previously obtained model-dependent evidence for P+c→J/ψp charmonium-pentaquark states in the same data sample.
A full amplitude analysis of Λ0b→J/ψpπ− decays is performed with a data sample acquired with the LHCb detector from 7 and 8 TeV pp collisions, corresponding to an integrated luminosity of 3 fb−1. A ...significantly better description of the data is achieved when, in addition to the previously observed nucleon excitations N→pπ−, either the Pc(4380)+ and Pc(4450)+→J/ψp states, previously observed in Λ0b→J/ψpK− decays, or the Zc(4200)−→J/ψπ− state, previously reported in B0→J/ψK+π− decays, or all three, are included in the amplitude models. The data support a model containing all three exotic states, with a significance of more than three standard deviations. Within uncertainties, the data are consistent with the Pc(4380)+ and Pc(4450)+ production rates expected from their previous observation taking account of Cabibbo suppression.
A search for CP violation in D-0 -> K-K+ and D-0 -> pi(-)pi(+) decays is performed using pp collision data, corresponding to an integrated luminosity of 3 fb(-1), collected using the LHCb detector at ...center-of-mass energies of 7 and 8 TeV. The flavor of the charm meson is inferred from the charge of the pion in D*(+) -> D-0 pi(+) and D*(-) -> (D) over bar (0)pi(-) decays. The difference between the CP asymmetries in D-0 -> K-K+ and D-0 -> pi(-)pi(+) decays, Delta A(CP) A(CP)(K-K+) - A(CP)(pi(-)pi(+)), is measured to be -0.10 +/- 0.08(stat) +/- 0.03(syst)%. This is the most precise measurement of a time-integrated CP asymmetry in the charm sector from a single experiment.
A
bstract
Associated production of bottomonia and open charm hadrons in pp collisions at
s
=
7
and 8 TeV is observed using data corresponding to an integrated luminosity of 3 fb
−1
accumulated with ...the LHCb detector. The observation of five combinations, Y(1S)D
0
, Y(2S)D
0
, Y(1S)D
+
, Y(2S)D
+
and Y(1S)D
s
+
, is reported. Production crosssections are measured for Y(1S)D
0
and Y(1S)D
+
pairs in the forward region. The measured cross-sections and the differential distributions indicate the dominance of double parton scattering as the main production mechanism.