-
Is Tokenization Needed for Masked Particle Modelling?
Authors:
Matthew Leigh,
Samuel Klein,
Fran�ois Charton,
Tobias Golling,
Lukas Heinrich,
Michael Kagan,
In�s Ochoa,
Margarita Osadchy
Abstract:
In this work, we significantly enhance masked particle modeling (MPM), a self-supervised learning scheme for constructing highly expressive representations of unordered sets relevant to developing foundation models for high-energy physics. In MPM, a model is trained to recover the missing elements of a set, a learning objective that requires no labels and can be applied directly to experimental da…
▽ More
In this work, we significantly enhance masked particle modeling (MPM), a self-supervised learning scheme for constructing highly expressive representations of unordered sets relevant to developing foundation models for high-energy physics. In MPM, a model is trained to recover the missing elements of a set, a learning objective that requires no labels and can be applied directly to experimental data. We achieve significant performance improvements over previous work on MPM by addressing inefficiencies in the implementation and incorporating a more powerful decoder. We compare several pre-training tasks and introduce new reconstruction methods that utilize conditional generative models without data tokenization or discretization. We show that these new methods outperform the tokenized learning objective from the original MPM on a new test bed for foundation models for jets, which includes using a wide variety of downstream tasks relevant to jet physics, such as classification, secondary vertex finding, and track identification.
△ Less
Submitted 1 October, 2024; v1 submitted 19 September, 2024;
originally announced September 2024.
-
RODEM Jet Datasets
Authors:
Knut Zoch,
John Andrew Raine,
Debajyoti Sengupta,
Tobias Golling
Abstract:
We present the RODEM Jet Datasets, a comprehensive collection of simulated large-radius jets designed to support the development and evaluation of machine-learning algorithms in particle physics. These datasets encompass a diverse range of jet sources, including quark/gluon jets, jets from the decay of W bosons, top quarks, and heavy new-physics particles. The datasets provide detailed substructur…
▽ More
We present the RODEM Jet Datasets, a comprehensive collection of simulated large-radius jets designed to support the development and evaluation of machine-learning algorithms in particle physics. These datasets encompass a diverse range of jet sources, including quark/gluon jets, jets from the decay of W bosons, top quarks, and heavy new-physics particles. The datasets provide detailed substructure information, including jet kinematics, constituent kinematics, and track displacement details, enabling a wide range of applications in jet tagging, anomaly detection, and generative modelling.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Accelerating template generation in resonant anomaly detection searches with optimal transport
Authors:
Matthew Leigh,
Debajyoti Sengupta,
Benjamin Nachman,
Tobias Golling
Abstract:
We introduce Resonant Anomaly Detection with Optimal Transport (RAD-OT), a method for generating signal templates in resonant anomaly detection searches. RAD-OT leverages the fact that the conditional probability density of the target features vary approximately linearly along the optimal transport path connecting the resonant feature. This does not assume that the conditional density itself is li…
▽ More
We introduce Resonant Anomaly Detection with Optimal Transport (RAD-OT), a method for generating signal templates in resonant anomaly detection searches. RAD-OT leverages the fact that the conditional probability density of the target features vary approximately linearly along the optimal transport path connecting the resonant feature. This does not assume that the conditional density itself is linear with the resonant feature, allowing RAD-OT to efficiently capture multimodal relationships, changes in resolution, etc. By solving the optimal transport problem, RAD-OT can quickly build a template by interpolating between the background distributions in two sideband regions. We demonstrate the performance of RAD-OT using the LHC Olympics R\&D dataset, where we find comparable sensitivity and improved stability with respect to deep learning-based approaches.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
PIPPIN: Generating variable length full events from partons
Authors:
Guillaume Qu�tant,
John Andrew Raine,
Matthew Leigh,
Debajyoti Sengupta,
Tobias Golling
Abstract:
This paper presents a novel approach for directly generating full events at detector-level from parton-level information, leveraging cutting-edge machine learning techniques. To address the challenge of multiplicity variations between parton and reconstructed object spaces, we employ transformers, score-based models and normalizing flows. Our method tackles the inherent complexities of the stochas…
▽ More
This paper presents a novel approach for directly generating full events at detector-level from parton-level information, leveraging cutting-edge machine learning techniques. To address the challenge of multiplicity variations between parton and reconstructed object spaces, we employ transformers, score-based models and normalizing flows. Our method tackles the inherent complexities of the stochastic transition between these two spaces and achieves remarkably accurate results. The combination of innovative techniques and the achieved accuracy demonstrates the potential of our approach in advancing the field and opens avenues for further exploration. This research contributes to the ongoing efforts in high-energy physics and generative modelling, providing a promising direction for enhanced precision in fast detector simulation.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
SkyCURTAINs: Model agnostic search for Stellar Streams with Gaia data
Authors:
Debajyoti Sengupta,
Stephen Mulligan,
David Shih,
John Andrew Raine,
Tobias Golling
Abstract:
We present SkyCURTAINs, a data driven and model agnostic method to search for stellar streams in the Milky Way galaxy using data from the Gaia telescope. SkyCURTAINs is a weakly supervised machine learning algorithm that builds a background enriched template in the signal region by leveraging the correlation of the source's characterising features with their proper motion in the sky. This allows f…
▽ More
We present SkyCURTAINs, a data driven and model agnostic method to search for stellar streams in the Milky Way galaxy using data from the Gaia telescope. SkyCURTAINs is a weakly supervised machine learning algorithm that builds a background enriched template in the signal region by leveraging the correlation of the source's characterising features with their proper motion in the sky. This allows for a more representative template of the background in the signal region, and reduces the false positives in the search for stellar streams. The minimal model assumptions in the SkyCURTAINs method allow for a flexible and efficient search for various kinds of anomalies such as streams, globular clusters, or dwarf galaxies directly from the data. We test the performance of SkyCURTAINs on the GD-1 stream and show that it is able to recover the stream with a purity of 75.4% which is an improvement of over 10% over existing machine learning based methods while retaining a signal efficiency of 37.9%.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Cluster Scanning: a novel approach to resonance searches
Authors:
Ivan Oleksiyuk,
John Andrew Raine,
Michael Kr�mer,
Svyatoslav Voloshynovskiy,
Tobias Golling
Abstract:
We propose a new model-independent method for new physics searches called Cluster Scanning. It uses the k-means algorithm to perform clustering in the space of low-level event or jet observables, and separates potentially anomalous clusters to construct a signal-enriched region. The spectra of a selected observable (e.g. invariant mass) in these two regions are then used to determine whether a res…
▽ More
We propose a new model-independent method for new physics searches called Cluster Scanning. It uses the k-means algorithm to perform clustering in the space of low-level event or jet observables, and separates potentially anomalous clusters to construct a signal-enriched region. The spectra of a selected observable (e.g. invariant mass) in these two regions are then used to determine whether a resonant signal is present. A pseudo-analysis on the LHC Olympics dataset with a $Z'$ resonance shows that Cluster Scanning outperforms the widely used 4-parameter functional background fitting procedures, reducing the number of signal events needed to reach a $3σ$ significant access by a factor of 0.61. Emphasis is placed on the speed of the method, which allows the test statistic to be calibrated on synthetic data.
△ Less
Submitted 21 May, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
Authors:
Tobias Golling,
Lukas Heinrich,
Michael Kagan,
Samuel Klein,
Matthew Leigh,
Margarita Osadchy,
John Andrew Raine
Abstract:
We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards bui…
▽ More
We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.
△ Less
Submitted 11 July, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
Improving new physics searches with diffusion models for event observables and jet constituents
Authors:
Debajyoti Sengupta,
Matthew Leigh,
John Andrew Raine,
Samuel Klein,
Tobias Golling
Abstract:
We introduce a new technique called Drapes to enhance the sensitivity in searches for new physics at the LHC. By training diffusion models on side-band data, we show how background templates for the signal region can be generated either directly from noise, or by partially applying the diffusion process to existing data. In the partial diffusion case, data can be drawn from side-band regions, with…
▽ More
We introduce a new technique called Drapes to enhance the sensitivity in searches for new physics at the LHC. By training diffusion models on side-band data, we show how background templates for the signal region can be generated either directly from noise, or by partially applying the diffusion process to existing data. In the partial diffusion case, data can be drawn from side-band regions, with the inverse diffusion performed for new target conditional values, or from the signal region, preserving the distribution over the conditional property that defines the signal region. We apply this technique to the hunt for resonances using the LHCO di-jet dataset, and achieve state-of-the-art performance for background template generation using high level input features. We also show how Drapes can be applied to low level inputs with jet constituents, reducing the model dependence on the choice of input observables. Using jet constituents we can further improve sensitivity to the signal process, but observe a loss in performance where the signal significance before applying any selection is below 4$σ$.
△ Less
Submitted 19 December, 2023; v1 submitted 15 December, 2023;
originally announced December 2023.
-
EPiC-ly Fast Particle Cloud Generation with Flow-Matching and Diffusion
Authors:
Erik Buhmann,
Cedric Ewen,
Darius A. Faroughy,
Tobias Golling,
Gregor Kasieczka,
Matthew Leigh,
Guillaume Quétant,
John Andrew Raine,
Debajyoti Sengupta,
David Shih
Abstract:
Jets at the LHC, typically consisting of a large number of highly correlated particles, are a fascinating laboratory for deep generative modeling. In this paper, we present two novel methods that generate LHC jets as point clouds efficiently and accurately. We introduce \epcjedi, which combines score-matching diffusion models with the Equivariant Point Cloud (EPiC) architecture based on the deep s…
▽ More
Jets at the LHC, typically consisting of a large number of highly correlated particles, are a fascinating laboratory for deep generative modeling. In this paper, we present two novel methods that generate LHC jets as point clouds efficiently and accurately. We introduce \epcjedi, which combines score-matching diffusion models with the Equivariant Point Cloud (EPiC) architecture based on the deep sets framework. This model offers a much faster alternative to previous transformer-based diffusion models without reducing the quality of the generated jets. In addition, we introduce \epcfm, the first permutation equivariant continuous normalizing flow (CNF) for particle cloud generation. This model is trained with {\it flow-matching}, a scalable and easy-to-train objective based on optimal transport that directly regresses the vector fields connecting the Gaussian noise prior to the data distribution. Our experiments demonstrate that \epcjedi and \epcfm both achieve state-of-the-art performance on the top-quark JetNet datasets whilst maintaining fast generation speed. Most notably, we find that the \epcfm model consistently outperforms all the other generative models considered here across every metric. Finally, we also introduce two new particle cloud performance metrics: the first based on the Kullback-Leibler divergence between feature distributions, the second is the negative log-posterior of a multi-model ParticleNet classifier.
△ Less
Submitted 29 September, 2023;
originally announced October 2023.
-
Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation
Authors:
Tobias Golling,
Samuel Klein,
Radha Mastandrea,
Benjamin Nachman,
John Andrew Raine
Abstract:
Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for m…
▽ More
Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for morphing because they require knowledge of the probability density of the starting dataset. In most cases in particle physics, we can generate more examples, but we do not know densities explicitly. We propose a protocol called flows for flows for training normalizing flows to morph one dataset into another even if the underlying probability density of neither dataset is known explicitly. This enables a morphing strategy trained with maximum likelihood estimation, a setup that has been shown to be highly effective in related tasks. We study variations on this protocol to explore how far the data points are moved to statistically match the two datasets. Furthermore, we show how to condition the learned flows on particular features in order to create a morphing function for every value of the conditioning feature. For illustration, we demonstrate flows for flows for toy examples as well as a collider physics example involving dijet events
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
The Interplay of Machine Learning--based Resonant Anomaly Detection Methods
Authors:
Tobias Golling,
Gregor Kasieczka,
Claudius Krause,
Radha Mastandrea,
Benjamin Nachman,
John Andrew Raine,
Debajyoti Sengupta,
David Shih,
Manuel Sommerhalder
Abstract:
Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal…
▽ More
Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal that make use of simulated or detected data in different ways, there has not yet been a study of the methods' complementarity. To this end, we address two questions. First, in the absence of any signal, do different methods pick the same events as signal-like? If not, then we can significantly reduce the false-positive rate by comparing different methods on the same dataset. Second, if there is a signal, are different methods fully correlated? Even if their maximum performance is the same, since we do not know how much signal is present, it may be beneficial to combine approaches. Using the Large Hadron Collider (LHC) Olympics dataset, we provide quantitative answers to these questions. We find that there are significant gains possible by combining multiple methods, which will strengthen the search program at the LHC and beyond.
△ Less
Submitted 14 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
PC-Droid: Faster diffusion and improved quality for particle cloud generation
Authors:
Matthew Leigh,
Debajyoti Sengupta,
John Andrew Raine,
Guillaume Qu�tant,
Tobias Golling
Abstract:
Building on the success of PC-JeDi we introduce PC-Droid, a substantially improved diffusion model for the generation of jet particle clouds. By leveraging a new diffusion formulation, studying more recent integration solvers, and training on all jet types simultaneously, we are able to achieve state-of-the-art performance for all types of jets across all evaluation metrics. We study the trade-off…
▽ More
Building on the success of PC-JeDi we introduce PC-Droid, a substantially improved diffusion model for the generation of jet particle clouds. By leveraging a new diffusion formulation, studying more recent integration solvers, and training on all jet types simultaneously, we are able to achieve state-of-the-art performance for all types of jets across all evaluation metrics. We study the trade-off between generation speed and quality by comparing two attention based architectures, as well as the potential of consistency distillation to reduce the number of diffusion steps. Both the faster architecture and consistency models demonstrate performance surpassing many competing models, with generation time up to two orders of magnitude faster than PC-JeDi and three orders of magnitude faster than Delphes.
△ Less
Submitted 18 August, 2023; v1 submitted 13 July, 2023;
originally announced July 2023.
-
Decorrelation using Optimal Transport
Authors:
Malte Algren,
John Andrew Raine,
Tobias Golling
Abstract:
Being able to decorrelate a feature space from protected attributes is an area of active research and study in ethics, fairness, and also natural sciences. We introduce a novel decorrelation method using Convex Neural Optimal Transport Solvers (Cnots) that is able to decorrelate a continuous feature space against protected attributes with optimal transport. We demonstrate how well it performs in t…
▽ More
Being able to decorrelate a feature space from protected attributes is an area of active research and study in ethics, fairness, and also natural sciences. We introduce a novel decorrelation method using Convex Neural Optimal Transport Solvers (Cnots) that is able to decorrelate a continuous feature space against protected attributes with optimal transport. We demonstrate how well it performs in the context of jet classification in high energy physics, where classifier scores are desired to be decorrelated from the mass of a jet. The decorrelation achieved in binary classification approaches the levels achieved by the state-of-the-art using conditional normalising flows. When moving to multiclass outputs the optimal transport approach performs significantly better than the state-of-the-art, suggesting substantial gains at decorrelating multidimensional feature spaces.
△ Less
Submitted 14 July, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
$ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows
Authors:
John Andrew Raine,
Matthew Leigh,
Knut Zoch,
Tobias Golling
Abstract:
In this work we introduce $ν^2$-Flows, an extension of the $ν$-Flows method to final states containing multiple neutrinos. The architecture can natively scale for all combinations of object types and multiplicities in the final state for any desired neutrino multiplicities. In $t\bar{t}$ dilepton events, the momenta of both neutrinos and correlations between them are reconstructed more accurately…
▽ More
In this work we introduce $ν^2$-Flows, an extension of the $ν$-Flows method to final states containing multiple neutrinos. The architecture can natively scale for all combinations of object types and multiplicities in the final state for any desired neutrino multiplicities. In $t\bar{t}$ dilepton events, the momenta of both neutrinos and correlations between them are reconstructed more accurately than when using the most popular standard analytical techniques, and solutions are found for all events. Inference time is significantly faster than competing methods, and can be reduced further by evaluating in parallel on graphics processing units. We apply $ν^2$-Flows to $t\bar{t}$ dilepton events and show that the per-bin uncertainties in unfolded distributions is much closer to the limit of performance set by perfect neutrino reconstruction than standard techniques. For the chosen double differential observables $ν^2$-Flows results in improved statistical precision for each bin by a factor of 1.5 to 2 in comparison to the Neutrino Weighting method and up to a factor of four in comparison to the Ellipse approach.
△ Less
Submitted 15 December, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation
Authors:
Debajyoti Sengupta,
Samuel Klein,
John Andrew Raine,
Tobias Golling
Abstract:
Model independent techniques for constructing background data templates using generative models have shown great promise for use in searches for new physics processes at the LHC. We introduce a major improvement to the CURTAINs method by training the conditional normalizing flow between two side-band regions using maximum likelihood estimation instead of an optimal transport loss. The new training…
▽ More
Model independent techniques for constructing background data templates using generative models have shown great promise for use in searches for new physics processes at the LHC. We introduce a major improvement to the CURTAINs method by training the conditional normalizing flow between two side-band regions using maximum likelihood estimation instead of an optimal transport loss. The new training objective improves the robustness and fidelity of the transformed data and is much faster and easier to train.
We compare the performance against the previous approach and the current state of the art using the LHC Olympics anomaly detection dataset, where we see a significant improvement in sensitivity over the original CURTAINs method. Furthermore, CURTAINsF4F requires substantially less computational resources to cover a large number of signal regions than other fully data driven approaches. When using an efficient configuration, an order of magnitude more models can be trained in the same time required for ten signal regions, without a significant drop in performance.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting
Authors:
Malte Algren,
Tobias Golling,
Manuel Guth,
Chris Pollard,
John Andrew Raine
Abstract:
We present an alternative to reweighting techniques for modifying distributions to account for a desired change in an underlying conditional distribution, as is often needed to correct for mis-modelling in a simulated sample. We employ conditional normalizing flows to learn the full conditional probability distribution from which we sample new events for conditional values drawn from the target di…
▽ More
We present an alternative to reweighting techniques for modifying distributions to account for a desired change in an underlying conditional distribution, as is often needed to correct for mis-modelling in a simulated sample. We employ conditional normalizing flows to learn the full conditional probability distribution from which we sample new events for conditional values drawn from the target distribution to produce the desired, altered distribution. In contrast to common reweighting techniques, this procedure is independent of binning choice and does not rely on an estimate of the density ratio between two distributions.
In several toy examples we show that normalizing flows outperform reweighting approaches to match the distribution of the target.We demonstrate that the corrected distribution closes well with the ground truth, and a statistical uncertainty on the training dataset can be ascertained with bootstrapping. In our examples, this leads to a statistical precision up to three times greater than using reweighting techniques with identical sample sizes for the source and target distributions. We also explore an application in the context of high energy particle physics.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
The Mass-ive Issue: Anomaly Detection in Jet Physics
Authors:
Tobias Golling,
Takuya Nobe,
Dimitrios Proios,
John Andrew Raine,
Debajyoti Sengupta,
Slava Voloshynovskiy,
Jean-Francois Arguin,
Julien Leissner Martin,
Jacinthe Pilette,
Debottam Bakshi Gupta,
Amir Farbin
Abstract:
In the hunt for new and unobserved phenomena in particle physics, attention has turned in recent years to using advanced machine learning techniques for model independent searches. In this paper we highlight the main challenge of applying anomaly detection to jet physics, where preserving an unbiased estimator of the jet mass remains a critical piece of any model independent search. Using Variatio…
▽ More
In the hunt for new and unobserved phenomena in particle physics, attention has turned in recent years to using advanced machine learning techniques for model independent searches. In this paper we highlight the main challenge of applying anomaly detection to jet physics, where preserving an unbiased estimator of the jet mass remains a critical piece of any model independent search. Using Variational Autoencoders and multiple industry-standard anomaly detection metrics, we demonstrate the unavoidable nature of this problem.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Topological Reconstruction of Particle Physics Processes using Graph Neural Networks
Authors:
Lukas Ehrke,
John Andrew Raine,
Knut Zoch,
Manuel Guth,
Tobias Golling
Abstract:
We present a new approach, the Topograph, which reconstructs underlying physics processes, including the intermediary particles, by leveraging underlying priors from the nature of particle physics decays and the flexibility of message passing graph neural networks. The Topograph not only solves the combinatoric assignment of observed final state objects, associating them to their original mother p…
▽ More
We present a new approach, the Topograph, which reconstructs underlying physics processes, including the intermediary particles, by leveraging underlying priors from the nature of particle physics decays and the flexibility of message passing graph neural networks. The Topograph not only solves the combinatoric assignment of observed final state objects, associating them to their original mother particles, but directly predicts the properties of intermediate particles in hard scatter processes and their subsequent decays. In comparison to standard combinatoric approaches or modern approaches using graph neural networks, which scale exponentially or quadratically, the complexity of Topographs scales linearly with the number of reconstructed objects.
We apply Topographs to top quark pair production in the all hadronic decay channel, where we outperform the standard approach and match the performance of the state-of-the-art machine learning technique.
△ Less
Submitted 13 October, 2023; v1 submitted 24 March, 2023;
originally announced March 2023.
-
PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics
Authors:
Matthew Leigh,
Debajyoti Sengupta,
Guillaume Quétant,
John Andrew Raine,
Knut Zoch,
Tobias Golling
Abstract:
In this paper, we present a new method to efficiently generate jets in High Energy Physics called PC-JeDi. This method utilises score-based diffusion models in conjunction with transformers which are well suited to the task of generating jets as particle clouds due to their permutation equivariance. PC-JeDi achieves competitive performance with current state-of-the-art methods across several metri…
▽ More
In this paper, we present a new method to efficiently generate jets in High Energy Physics called PC-JeDi. This method utilises score-based diffusion models in conjunction with transformers which are well suited to the task of generating jets as particle clouds due to their permutation equivariance. PC-JeDi achieves competitive performance with current state-of-the-art methods across several metrics that evaluate the quality of the generated jets. Although slower than other models, due to the large number of forward passes required by diffusion models, it is still substantially faster than traditional detailed simulation. Furthermore, PC-JeDi uses conditional generation to produce jets with a desired mass and transverse momentum for two different particles, top quarks and gluons.
△ Less
Submitted 21 February, 2024; v1 submitted 9 March, 2023;
originally announced March 2023.
-
FETA: Flow-Enhanced Transportation for Anomaly Detection
Authors:
Tobias Golling,
Samuel Klein,
Radha Mastandrea,
Benjamin Nachman
Abstract:
Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a mapping between high-f…
▽ More
Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a mapping between high-fidelity SM simulations and the data. The flow is trained in sideband regions with the signal region blinded, and the flow is conditioned on the resonant feature (mass) such that it can be interpolated into the signal region. To illustrate this approach, we use simulated collisions from the Large Hadron Collider (LHC) Olympics Dataset. We find that our flow-constructed background method has competitive sensitivity with other recent proposals and can therefore provide complementary information to improve future searches.
△ Less
Submitted 14 June, 2023; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Flows for Flows: Training Normalizing Flows Between Arbitrary Distributions with Maximum Likelihood Estimation
Authors:
Samuel Klein,
John Andrew Raine,
Tobias Golling
Abstract:
Normalizing flows are constructed from a base distribution with a known density and a diffeomorphism with a tractable Jacobian. The base density of a normalizing flow can be parameterised by a different normalizing flow, thus allowing maps to be found between arbitrary distributions. We demonstrate and explore the utility of this approach and show it is particularly interesting in the case of cond…
▽ More
Normalizing flows are constructed from a base distribution with a known density and a diffeomorphism with a tractable Jacobian. The base density of a normalizing flow can be parameterised by a different normalizing flow, thus allowing maps to be found between arbitrary distributions. We demonstrate and explore the utility of this approach and show it is particularly interesting in the case of conditional normalizing flows and for introducing optimal transport constraints on maps that are constructed using normalizing flows.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Decorrelation with conditional normalizing flows
Authors:
Samuel Klein,
Tobias Golling
Abstract:
The sensitivity of many physics analyses can be enhanced by constructing discriminants that preferentially select signal events. Such discriminants become much more useful if they are uncorrelated with a set of protected attributes. In this paper we show that a normalizing flow conditioned on the protected attributes can be used to find a decorrelated representation for any discriminant. As a norm…
▽ More
The sensitivity of many physics analyses can be enhanced by constructing discriminants that preferentially select signal events. Such discriminants become much more useful if they are uncorrelated with a set of protected attributes. In this paper we show that a normalizing flow conditioned on the protected attributes can be used to find a decorrelated representation for any discriminant. As a normalizing flow is invertible the separation power of the resulting discriminant will be unchanged at any fixed value of the protected attributes. We demonstrate the efficacy of our approach by building supervised jet taggers that produce almost no sculpting in the mass distribution of the background.
△ Less
Submitted 15 December, 2022; v1 submitted 4 November, 2022;
originally announced November 2022.
-
ν-Flows: Conditional Neutrino Regression
Authors:
Matthew Leigh,
John Andrew Raine,
Knut Zoch,
Tobias Golling
Abstract:
We present $ν$-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks. This method allows the recovery of the full neutrino momentum which is usually left as a free parameter and permits one to sample neutrino values under a learned conditional likelihood given eve…
▽ More
We present $ν$-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks. This method allows the recovery of the full neutrino momentum which is usually left as a free parameter and permits one to sample neutrino values under a learned conditional likelihood given event observations. We demonstrate the success of $ν$-Flows in a case study by applying it to simulated semileptonic $t\bar{t}$ events and show that it can lead to more accurate momentum reconstruction, particularly of the longitudinal coordinate. We also show that this has direct benefits in a downstream task of jet association, leading to an improvement of up to a factor of 1.41 compared to conventional methods.
△ Less
Submitted 22 June, 2023; v1 submitted 1 July, 2022;
originally announced July 2022.
-
Flowification: Everything is a Normalizing Flow
Authors:
Bálint Máté,
Samuel Klein,
Tobias Golling,
François Fleuret
Abstract:
The two key characteristics of a normalizing flow is that it is invertible (in particular, dimension preserving) and that it monitors the amount by which it changes the likelihood of data points as samples are propagated along the network. Recently, multiple generalizations of normalizing flows have been introduced that relax these two conditions. On the other hand, neural networks only perform a…
▽ More
The two key characteristics of a normalizing flow is that it is invertible (in particular, dimension preserving) and that it monitors the amount by which it changes the likelihood of data points as samples are propagated along the network. Recently, multiple generalizations of normalizing flows have been introduced that relax these two conditions. On the other hand, neural networks only perform a forward pass on the input, there is neither a notion of an inverse of a neural network nor is there one of its likelihood contribution. In this paper we argue that certain neural network architectures can be enriched with a stochastic inverse pass and that their likelihood contribution can be monitored in a way that they fall under the generalized notion of a normalizing flow mentioned above. We term this enrichment flowification. We prove that neural networks only containing linear layers, convolutional layers and invertible activations such as LeakyReLU can be flowified and evaluate them in the generative setting on image datasets.
△ Less
Submitted 26 January, 2023; v1 submitted 30 May, 2022;
originally announced May 2022.
-
CURTAINs for your Sliding Window: Constructing Unobserved Regions by Transforming Adjacent Intervals
Authors:
John Andrew Raine,
Samuel Klein,
Debajyoti Sengupta,
Tobias Golling
Abstract:
We propose a new model independent technique for constructing background data templates for use in searches for new physics processes at the LHC. This method, called CURTAINs, uses invertible neural networks to parametrise the distribution of side band data as a function of the resonant observable. The network learns a transformation to map any data point from its value of the resonant observable…
▽ More
We propose a new model independent technique for constructing background data templates for use in searches for new physics processes at the LHC. This method, called CURTAINs, uses invertible neural networks to parametrise the distribution of side band data as a function of the resonant observable. The network learns a transformation to map any data point from its value of the resonant observable to another chosen value. Using CURTAINs, a template for the background data in the signal window is constructed by mapping the data from the side-bands into the signal region. We perform anomaly detection using the CURTAINs background template to enhance the sensitivity to new physics in a bump hunt. We demonstrate its performance in a sliding window search across a wide range of mass values. Using the LHC Olympics dataset, we demonstrate that CURTAINs matches the performance of other leading approaches which aim to improve the sensitivity of bump hunts, can be trained on a much smaller range of the invariant mass, and is fully data driven.
△ Less
Submitted 10 February, 2023; v1 submitted 17 March, 2022;
originally announced March 2022.
-
SUPA: A Lightweight Diagnostic Simulator for Machine Learning in Particle Physics
Authors:
Atul Kumar Sinha,
Daniele Paliotta,
Bálint Máté,
Sebastian Pina-Otey,
John A. Raine,
Tobias Golling,
François Fleuret
Abstract:
Deep learning methods have gained popularity in high energy physics for fast modeling of particle showers in detectors. Detailed simulation frameworks such as the gold standard Geant4 are computationally intensive, and current deep generative architectures work on discretized, lower resolution versions of the detailed simulation. The development of models that work at higher spatial resolutions is…
▽ More
Deep learning methods have gained popularity in high energy physics for fast modeling of particle showers in detectors. Detailed simulation frameworks such as the gold standard Geant4 are computationally intensive, and current deep generative architectures work on discretized, lower resolution versions of the detailed simulation. The development of models that work at higher spatial resolutions is currently hindered by the complexity of the full simulation data, and by the lack of simpler, more interpretable benchmarks. Our contribution is SUPA, the SUrrogate PArticle propagation simulator, an algorithm and software package for generating data by simulating simplified particle propagation, scattering and shower development in matter. The generation is extremely fast and easy to use compared to Geant4, but still exhibits the key characteristics and challenges of the detailed simulation. We support this claim experimentally by showing that performance of generative models on data from our simulator reflects the performance on a dataset generated with Geant4. The proposed simulator generates thousands of particle showers per second on a desktop machine, a speed up of up to 6 orders of magnitudes over Geant4, and stores detailed geometric information about the shower propagation. SUPA provides much greater flexibility for setting initial conditions and defining multiple benchmarks for the development of models. Moreover, interpreting particle showers as point clouds creates a connection to geometric machine learning and provides challenging and fundamentally new datasets for the field.
The code for SUPA is available at https://github.com/itsdaniele/SUPA.
△ Less
Submitted 21 October, 2022; v1 submitted 10 February, 2022;
originally announced February 2022.
-
Turbo-Sim: a generalised generative model with a physical latent space
Authors:
Guillaume Qu�tant,
Mariia Drozdova,
Vitaliy Kinakh,
Tobias Golling,
Slava Voloshynovskiy
Abstract:
We present Turbo-Sim, a generalised autoencoder framework derived from principles of information theory that can be used as a generative model. By maximising the mutual information between the input and the output of both the encoder and the decoder, we are able to rediscover the loss terms usually found in adversarial autoencoders and generative adversarial networks, as well as various more sophi…
▽ More
We present Turbo-Sim, a generalised autoencoder framework derived from principles of information theory that can be used as a generative model. By maximising the mutual information between the input and the output of both the encoder and the decoder, we are able to rediscover the loss terms usually found in adversarial autoencoders and generative adversarial networks, as well as various more sophisticated related models. Our generalised framework makes these models mathematically interpretable and allows for a diversity of new ones by setting the weight of each loss term separately. The framework is also independent of the intrinsic architecture of the encoder and the decoder thus leaving a wide choice for the building blocks of the whole network. We apply Turbo-Sim to a collider physics generation problem: the transformation of the properties of several particles from a theory space, right after the collision, to an observation space, right after the detection in an experiment.
△ Less
Submitted 21 December, 2021; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Information-theoretic stochastic contrastive conditional GAN: InfoSCC-GAN
Authors:
Vitaliy Kinakh,
Mariia Drozdova,
Guillaume Qu�tant,
Tobias Golling,
Slava Voloshynovskiy
Abstract:
Conditional generation is a subclass of generative problems where the output of the generation is conditioned by the attribute information. In this paper, we present a stochastic contrastive conditional generative adversarial network (InfoSCC-GAN) with an explorable latent space. The InfoSCC-GAN architecture is based on an unsupervised contrastive encoder built on the InfoNCE paradigm, an attribut…
▽ More
Conditional generation is a subclass of generative problems where the output of the generation is conditioned by the attribute information. In this paper, we present a stochastic contrastive conditional generative adversarial network (InfoSCC-GAN) with an explorable latent space. The InfoSCC-GAN architecture is based on an unsupervised contrastive encoder built on the InfoNCE paradigm, an attribute classifier and an EigenGAN generator. We propose a novel training method, based on generator regularization using external or internal attributes every $n$-th iteration, using a pre-trained contrastive encoder and a pre-trained classifier. The proposed InfoSCC-GAN is derived based on an information-theoretic formulation of mutual information maximization between input data and latent space representation as well as latent space and generated data. Thus, we demonstrate a link between the training objective functions and the above information-theoretic formulation. The experimental results show that InfoSCC-GAN outperforms the "vanilla" EigenGAN in the image generation on AFHQ and CelebA datasets. In addition, we investigate the impact of discriminator architectures and loss functions by performing ablation studies. Finally, we demonstrate that thanks to the EigenGAN generator, the proposed framework enjoys a stochastic generation in contrast to vanilla deterministic GANs yet with the independent training of encoder, classifier, and generator in contrast to existing frameworks. Code, experimental results, and demos are available online at https://github.com/vkinakh/InfoSCC-GAN.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Generation of data on discontinuous manifolds via continuous stochastic non-invertible networks
Authors:
Mariia Drozdova,
Vitaliy Kinakh,
Guillaume Qu�tant,
Tobias Golling,
Slava Voloshynovskiy
Abstract:
The generation of discontinuous distributions is a difficult task for most known frameworks such as generative autoencoders and generative adversarial networks. Generative non-invertible models are unable to accurately generate such distributions, require long training and often are subject to mode collapse. Variational autoencoders (VAEs), which are based on the idea of keeping the latent space t…
▽ More
The generation of discontinuous distributions is a difficult task for most known frameworks such as generative autoencoders and generative adversarial networks. Generative non-invertible models are unable to accurately generate such distributions, require long training and often are subject to mode collapse. Variational autoencoders (VAEs), which are based on the idea of keeping the latent space to be Gaussian for the sake of a simple sampling, allow an accurate reconstruction, while they experience significant limitations at generation task. In this work, instead of trying to keep the latent space to be Gaussian, we use a pre-trained contrastive encoder to obtain a clustered latent space. Then, for each cluster, representing a unimodal submanifold, we train a dedicated low complexity network to generate this submanifold from the Gaussian distribution. The proposed framework is based on the information-theoretic formulation of mutual information maximization between the input data and latent space representation. We derive a link between the cost functions and the information-theoretic formulation. We apply our approach to synthetic 2D distributions to demonstrate both reconstruction and generation of discontinuous distributions using continuous stochastic networks.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Funnels: Exact maximum likelihood with dimensionality reduction
Authors:
Samuel Klein,
John A. Raine,
Sebastian Pina-Otey,
Slava Voloshynovskiy,
Tobias Golling
Abstract:
Normalizing flows are diffeomorphic, typically dimension-preserving, models trained using the likelihood of the model. We use the SurVAE framework to construct dimension reducing surjective flows via a new layer, known as the funnel. We demonstrate its efficacy on a variety of datasets, and show it improves upon or matches the performance of existing flows while having a reduced latent space size.…
▽ More
Normalizing flows are diffeomorphic, typically dimension-preserving, models trained using the likelihood of the model. We use the SurVAE framework to construct dimension reducing surjective flows via a new layer, known as the funnel. We demonstrate its efficacy on a variety of datasets, and show it improves upon or matches the performance of existing flows while having a reduced latent space size. The funnel layer can be constructed from a wide range of transformations including restricted convolution and feed forward layers.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
The Tracking Machine Learning challenge : Throughput phase
Authors:
Sabrina Amrouche,
Laurent Basara,
Paolo Calafiura,
Dmitry Emeliyanov,
Victor Estrade,
Steven Farrell,
C�cile Germain,
Vladimir Vava Gligorov,
Tobias Golling,
Sergey Gorbunov,
Heather Gray,
Isabelle Guyon,
Mikhail Hushchyn,
Vincenzo Innocente,
Moritz Kiehn,
Marcel Kunze,
Edward Moyse,
David Rousseau,
Andreas Salzburger,
Andrey Ustyuzhanin,
Jean-Roch Vlimant
Abstract:
This paper reports on the second "Throughput" phase of the Tracking Machine Learning (TrackML) challenge on the Codalab platform. As in the first "Accuracy" phase, the participants had to solve a difficult experimental problem linked to tracking accurately the trajectory of particles as e.g. created at the Large Hadron Collider (LHC): given O($10^5$) points, the participants had to connect them in…
▽ More
This paper reports on the second "Throughput" phase of the Tracking Machine Learning (TrackML) challenge on the Codalab platform. As in the first "Accuracy" phase, the participants had to solve a difficult experimental problem linked to tracking accurately the trajectory of particles as e.g. created at the Large Hadron Collider (LHC): given O($10^5$) points, the participants had to connect them into O($10^4$) individual groups that represent the particle trajectories which are approximated helical. While in the first phase only the accuracy mattered, the goal of this second phase was a compromise between the accuracy and the speed of inference. Both were measured on the Codalab platform where the participants had to upload their software. The best three participants had solutions with good accuracy and speed an order of magnitude faster than the state of the art when the challenge was designed. Although the core algorithms were less diverse than in the first phase, a diversity of techniques have been used and are described in this paper. The performance of the algorithms are analysed in depth and lessons derived.
△ Less
Submitted 14 May, 2021; v1 submitted 3 May, 2021;
originally announced May 2021.
-
Hashing and metric learning for charged particle tracking
Authors:
Sabrina Amrouche,
Moritz Kiehn,
Tobias Golling,
Andreas Salzburger
Abstract:
We propose a novel approach to charged particle tracking at high intensity particle colliders based on Approximate Nearest Neighbors search. With hundreds of thousands of measurements per collision to be reconstructed e.g. at the High Luminosity Large Hadron Collider, the currently employed combinatorial track finding approaches become inadequate. Here, we use hashing techniques to separate measur…
▽ More
We propose a novel approach to charged particle tracking at high intensity particle colliders based on Approximate Nearest Neighbors search. With hundreds of thousands of measurements per collision to be reconstructed e.g. at the High Luminosity Large Hadron Collider, the currently employed combinatorial track finding approaches become inadequate. Here, we use hashing techniques to separate measurements into buckets of 20-50 hits and increase their purity using metric learning. Two different approaches are studied to further resolve tracks inside buckets: Local Fisher Discriminant Analysis and Neural Networks for triplet similarity learning. We demonstrate the proposed approach on simulated collisions and show significant speed improvement with bucket tracking efficiency of 96% and a fake rate of 8% on unseen particle events.
△ Less
Submitted 16 January, 2021;
originally announced January 2021.
-
Variational Autoencoders for Anomalous Jet Tagging
Authors:
Taoli Cheng,
Jean-Fran�ois Arguin,
Julien Leissner-Martin,
Jacinthe Pilette,
Tobias Golling
Abstract:
We present a detailed study on Variational Autoencoders (VAEs) for anomalous jet tagging at the Large Hadron Collider. By taking in low-level jet constituents' information, and training with background QCD jets in an unsupervised manner, the VAE is able to encode important information for reconstructing jets, while learning an expressive posterior distribution in the latent space. When using the V…
▽ More
We present a detailed study on Variational Autoencoders (VAEs) for anomalous jet tagging at the Large Hadron Collider. By taking in low-level jet constituents' information, and training with background QCD jets in an unsupervised manner, the VAE is able to encode important information for reconstructing jets, while learning an expressive posterior distribution in the latent space. When using the VAE as an anomaly detector, we present different approaches to detect anomalies: directly comparing in the input space or, instead, working in the latent space. In order to facilitate general search approaches such as bump-hunt, mass-decorrelated VAEs based on distance correlation regularization are also studied. We find that the naive mass-decorrelated VAEs fail at maintaining proper detection performance, by assigning higher probabilities to some anomalous samples. To build a performant mass-decorrelated anomalous jet tagger, we propose the Outlier Exposed VAE (OE-VAE), for which some outlier samples are introduced in the training process to guide the learned information. OE-VAEs are employed to achieve two goals at the same time: increasing sensitivity of outlier detection and decorrelating jet mass from the anomaly score. We succeed in reaching excellent results from both aspects. Code implementation of this work can be found at https://github.com/taolicheng/VAE-Jet
△ Less
Submitted 29 November, 2022; v1 submitted 3 July, 2020;
originally announced July 2020.
-
MuPix and ATLASPix -- Architectures and Results
Authors:
A. Sch�ning,
J. Anders,
H. Augustin,
M. Benoit,
N. Berger,
S. Dittmeier,
F. Ehrler,
A. Fehr,
T. Golling,
S. Gonzalez Sevilla,
J. Hammerich,
A. Herkert,
L. Huth,
G. Iacobucci,
D. Immig,
M. Kiehn,
J. Kr�ger,
F. Meier,
A. Meneses Gonzalez,
A. Miucci,
L. O. S. Noehte,
I. Peric,
M. Prathapan,
T. Rudzki,
R. Schimassek
, et al. (7 additional authors not shown)
Abstract:
High Voltage Monolithic Active Pixel Sensors (HV-MAPS) are based on a commercial High Voltage CMOS process and collect charge by drift inside a reversely biased diode. HV-MAPS represent a promising technology for future pixel tracking detectors. Two recent developments are presented. The MuPix has a continuous readout and is being developed for the Mu3e experiment whereas the ATLASPix is being dev…
▽ More
High Voltage Monolithic Active Pixel Sensors (HV-MAPS) are based on a commercial High Voltage CMOS process and collect charge by drift inside a reversely biased diode. HV-MAPS represent a promising technology for future pixel tracking detectors. Two recent developments are presented. The MuPix has a continuous readout and is being developed for the Mu3e experiment whereas the ATLASPix is being developed for LHC applications with a triggered readout. Both variants have a fully monolithic design including state machines, clock circuitries and serial drivers. Several prototypes and design variants were characterised in the lab and in testbeam campaigns to measure efficiencies, noise, time resolution and radiation tolerance. Results from recent MuPix and ATLASPix prototypes are presented and prospects for future improvements are discussed.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
The Tracking Machine Learning challenge : Accuracy phase
Authors:
Sabrina Amrouche,
Laurent Basara,
Paolo Calafiura,
Victor Estrade,
Steven Farrell,
Diogo R. Ferreira,
Liam Finnie,
Nicole Finnie,
C�cile Germain,
Vladimir Vava Gligorov,
Tobias Golling,
Sergey Gorbunov,
Heather Gray,
Isabelle Guyon,
Mikhail Hushchyn,
Vincenzo Innocente,
Moritz Kiehn,
Edward Moyse,
Jean-Francois Puget,
Yuval Reina,
David Rousseau,
Andreas Salzburger,
Andrey Ustyuzhanin,
Jean-Roch Vlimant,
Johan Sokrates Wind
, et al. (2 additional authors not shown)
Abstract:
This paper reports the results of an experiment in high energy physics: using the power of the "crowd" to solve difficult experimental problems linked to tracking accurately the trajectory of particles in the Large Hadron Collider (LHC). This experiment took the form of a machine learning challenge organized in 2018: the Tracking Machine Learning Challenge (TrackML). Its results were discussed at…
▽ More
This paper reports the results of an experiment in high energy physics: using the power of the "crowd" to solve difficult experimental problems linked to tracking accurately the trajectory of particles in the Large Hadron Collider (LHC). This experiment took the form of a machine learning challenge organized in 2018: the Tracking Machine Learning Challenge (TrackML). Its results were discussed at the competition session at the Neural Information Processing Systems conference (NeurIPS 2018). Given 100.000 points, the participants had to connect them into about 10.000 arcs of circles, following the trajectory of particles issued from very high energy proton collisions. The competition was difficult with a dozen front-runners well ahead of a pack. The single competition score is shown to be accurate and effective in selecting the best algorithms from the domain point of view. The competition has exposed a diversity of approaches, with various roles for Machine Learning, a number of which are discussed in the document
△ Less
Submitted 3 May, 2021; v1 submitted 14 April, 2019;
originally announced April 2019.
-
Searching for long-lived particles beyond the Standard Model at the Large Hadron Collider
Authors:
Juliette Alimena,
James Beacham,
Martino Borsato,
Yangyang Cheng,
Xabier Cid Vidal,
Giovanna Cottin,
Albert De Roeck,
Nishita Desai,
David Curtin,
Jared A. Evans,
Simon Knapen,
Sabine Kraml,
Andre Lessa,
Zhen Liu,
Sascha Mehlhase,
Michael J. Ramsey-Musolf,
Heather Russell,
Jessie Shelton,
Brian Shuve,
Monica Verducci,
Jose Zurita,
Todd Adams,
Michael Adersberger,
Cristiano Alpigiani,
Artur Apresyan
, et al. (176 additional authors not shown)
Abstract:
Particles beyond the Standard Model (SM) can generically have lifetimes that are long compared to SM particles at the weak scale. When produced at experiments such as the Large Hadron Collider (LHC) at CERN, these long-lived particles (LLPs) can decay far from the interaction vertex of the primary proton-proton collision. Such LLP signatures are distinct from those of promptly decaying particles t…
▽ More
Particles beyond the Standard Model (SM) can generically have lifetimes that are long compared to SM particles at the weak scale. When produced at experiments such as the Large Hadron Collider (LHC) at CERN, these long-lived particles (LLPs) can decay far from the interaction vertex of the primary proton-proton collision. Such LLP signatures are distinct from those of promptly decaying particles that are targeted by the majority of searches for new physics at the LHC, often requiring customized techniques to identify, for example, significantly displaced decay vertices, tracks with atypical properties, and short track segments. Given their non-standard nature, a comprehensive overview of LLP signatures at the LHC is beneficial to ensure that possible avenues of the discovery of new physics are not overlooked. Here we report on the joint work of a community of theorists and experimentalists with the ATLAS, CMS, and LHCb experiments --- as well as those working on dedicated experiments such as MoEDAL, milliQan, MATHUSLA, CODEX-b, and FASER --- to survey the current state of LLP searches at the LHC, and to chart a path for the development of LLP searches into the future, both in the upcoming Run 3 and at the High-Luminosity LHC. The work is organized around the current and future potential capabilities of LHC experiments to generally discover new LLPs, and takes a signature-based approach to surveying classes of models that give rise to LLPs rather than emphasizing any particular theory motivation. We develop a set of simplified models; assess the coverage of current searches; document known, often unexpected backgrounds; explore the capabilities of proposed detector upgrades; provide recommendations for the presentation of search results; and look towards the newest frontiers, namely high-multiplicity "dark showers", highlighting opportunities for expanding the LHC reach for these signals.
△ Less
Submitted 11 March, 2019;
originally announced March 2019.
-
Charge collection characterisation with the Transient Current Technique of the ams H35DEMO CMOS detector after proton irradiation
Authors:
John Anders,
Mathieu Benoit,
Saverio Braccini,
Raimon Casanova,
Hucheng Chen,
Kai Chen,
Francesco Armando di Bello,
Armin Fehr,
Didier Ferrere,
Dean Forshaw,
Tobias Golling,
Sergio Gonzalez-Sevilla,
Giuseppe Iacobucci,
Moritz Kiehn,
Francesco Lanni,
Hongbin Liu,
Lingxin Meng,
Claudia Merlassino,
Antonio Miucci,
Marzio Nessi,
Ivan Perić,
Marco Rimoldi,
D M S Sultan,
Mateus Vincente Barreto Pinto,
Eva Vilella
, et al. (4 additional authors not shown)
Abstract:
This paper reports on the characterisation with Transient Current Technique measurements of the charge collection and depletion depth of a radiation-hard high-voltage CMOS pixel sensor produced at ams AG. Several substrate resistivities were tested before and after proton irradiation with two different sources: the 24 GeV Proton Synchrotron at CERN and the 16.7 MeV Cyclotron at Bern Inselspital.
This paper reports on the characterisation with Transient Current Technique measurements of the charge collection and depletion depth of a radiation-hard high-voltage CMOS pixel sensor produced at ams AG. Several substrate resistivities were tested before and after proton irradiation with two different sources: the 24 GeV Proton Synchrotron at CERN and the 16.7 MeV Cyclotron at Bern Inselspital.
△ Less
Submitted 25 July, 2018;
originally announced July 2018.
-
Machine Learning in High Energy Physics Community White Paper
Authors:
Kim Albertsson,
Piero Altoe,
Dustin Anderson,
John Anderson,
Michael Andrews,
Juan Pedro Araque Espinosa,
Adam Aurisano,
Laurent Basara,
Adrian Bevan,
Wahid Bhimji,
Daniele Bonacorsi,
Bjorn Burkle,
Paolo Calafiura,
Mario Campanelli,
Louis Capps,
Federico Carminati,
Stefano Carrazza,
Yi-fan Chen,
Taylor Childers,
Yann Coadou,
Elias Coniavitis,
Kyle Cranmer,
Claire David,
Douglas Davis,
Andrea De Simone
, et al. (103 additional authors not shown)
Abstract:
Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d…
▽ More
Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We detail a roadmap for their implementation, software and hardware resource requirements, collaborative initiatives with the data science community, academia and industry, and training the particle physics community in data science. The main objective of the document is to connect and motivate these areas of research and development with the physics drivers of the High-Luminosity Large Hadron Collider and future neutrino experiments and identify the resource needs for their implementation. Additionally we identify areas where collaboration with external communities will be of great benefit.
△ Less
Submitted 16 May, 2019; v1 submitted 8 July, 2018;
originally announced July 2018.
-
Test beam measurement of ams H35 HV-CMOS capacitively coupled pixel sensor prototypes with high-resistivity substrate
Authors:
M. Benoit,
S. Braccini,
R. Casanova,
E. Cavallaro,
H. Chen,
K. Chen,
F. A. Di Bello,
D. Ferrere,
D. Frizzell,
T. Golling,
S. Gonzalez-Sevilla,
S. Grinstein,
G. Iacobucci,
M. Kiehn,
F. Lanni,
H. Liu,
J. Metcalfe,
L. Meng,
C. Merlassino,
A. Miucci,
D. Muenstermann,
M. Nessi,
H. Okawa,
I. Perić,
M. Rimoldi
, et al. (12 additional authors not shown)
Abstract:
In the context of the studies of the ATLAS High Luminosity LHC programme, radiation tolerant pixel detectors in CMOS technologies are investigated. To evaluate the effects of substrate resistivity on CMOS sensor performance, the H35DEMO demonstrator, containing different diode and amplifier designs, was produced in ams H35 HV-CMOS technology using four different substrate resistivities spanning fr…
▽ More
In the context of the studies of the ATLAS High Luminosity LHC programme, radiation tolerant pixel detectors in CMOS technologies are investigated. To evaluate the effects of substrate resistivity on CMOS sensor performance, the H35DEMO demonstrator, containing different diode and amplifier designs, was produced in ams H35 HV-CMOS technology using four different substrate resistivities spanning from $\mathrm{80}$ to $\mathrm{1000~Ω\cdot cm}$. A glueing process using a high-precision flip-chip machine was developed in order to capacitively couple the sensors to FE-I4 Readout ASIC using a thin layer of epoxy glue with good uniformity over a large surface. The resulting assemblies were measured in beam test at the Fermilab Test Beam Facilities with 120 GeV protons and CERN SPS H8 beamline using 80 GeV pions. The in-time efficiency and tracking properties measured for the different sensor types are shown to be compatible with the ATLAS ITk requirements for its pixel sensors.
△ Less
Submitted 3 December, 2018; v1 submitted 22 December, 2017;
originally announced December 2017.
-
Testbeam results of irradiated ams H18 HV-CMOS pixel sensor prototypes
Authors:
M. Benoit,
S. Braccini,
G. Casse,
H. Chen,
K. Chen,
F. A. Di Bello,
D. Ferrere,
T. Golling,
S. Gonzalez-Sevilla,
G. Iacobucci,
M. Kiehn,
F. Lanni,
H. Liu,
L. Meng,
C. Merlassino,
A. Miucci,
D. Muenstermann,
M. Nessi,
H. Okawa,
I. Peric,
M. Rimoldi,
B. Ristic,
M. Vicente Barrero Pinto,
J. Vossebeld,
M. Weber
, et al. (4 additional authors not shown)
Abstract:
HV-CMOS pixel sensors are a promising option for the tracker upgrade of the ATLAS experiment at the LHC, as well as for other future tracking applications in which large areas are to be instrumented with radiation-tolerant silicon pixel sensors. We present results of testbeam characterisations of the $4^{\mathrm{th}}$ generation of Capacitively Coupled Pixel Detectors (CCPDv4) produced with the am…
▽ More
HV-CMOS pixel sensors are a promising option for the tracker upgrade of the ATLAS experiment at the LHC, as well as for other future tracking applications in which large areas are to be instrumented with radiation-tolerant silicon pixel sensors. We present results of testbeam characterisations of the $4^{\mathrm{th}}$ generation of Capacitively Coupled Pixel Detectors (CCPDv4) produced with the ams H18 HV-CMOS process that have been irradiated with different particles (reactor neutrons and 18 MeV protons) to fluences between $1\cdot 10^{14}$ and $5\cdot 10^{15}$ 1-MeV-n$_\textrm{eq}$/cm$^2$. The sensors were glued to ATLAS FE-I4 pixel readout chips and measured at the CERN SPS H8 beamline using the FE-I4 beam telescope. Results for all fluences are very encouraging with all hit efficiencies being better than 97% for bias voltages of $85\,$V. The sample irradiated to a fluence of $1\cdot 10^{15}$ n$_\textrm{eq}$/cm$^2$ - a relevant value for a large volume of the upgraded tracker - exhibited 99.7% average hit efficiency. The results give strong evidence for the radiation tolerance of HV-CMOS sensors and their suitability as sensors for the experimental HL-LHC upgrades and future large-area silicon-based tracking detectors in high-radiation environments.
△ Less
Submitted 28 November, 2017; v1 submitted 8 November, 2016;
originally announced November 2016.
-
Physics at a 100 TeV pp collider: beyond the Standard Model phenomena
Authors:
T. Golling,
M. Hance,
P. Harris,
M. L. Mangano,
M. McCullough,
F. Moortgat,
P. Schwaller,
R. Torre,
P. Agrawal,
D. S. M. Alves,
S. Antusch,
A. Arbey,
B. Auerbach,
G. Bambhaniya,
M. Battaglia,
M. Bauer,
P. S. Bhupal Dev,
A. Boveia,
J. Bramante,
O. Buchmueller,
M. Buschmann,
J. Chakrabortty,
M. Chala,
S. Chekanov,
C. -Y. Chen
, et al. (89 additional authors not shown)
Abstract:
This report summarises the physics opportunities in the search and study of physics beyond the Standard Model at a 100 TeV pp collider.
This report summarises the physics opportunities in the search and study of physics beyond the Standard Model at a 100 TeV pp collider.
△ Less
Submitted 2 June, 2016;
originally announced June 2016.
-
Results of the 2015 testbeam of a 180 nm AMS High-Voltage CMOS sensor prototype
Authors:
M. Benoit,
J. Bilbao de Mendizabal,
G. Casse,
H. Chen,
K. Chen,
F. A. Di Bello,
D. Ferrere,
T. Golling,
S. Gonzalez-Sevilla,
G. Iacobucci,
F. Lanni,
H. Liu,
F. Meloni,
L. Meng,
A. Miucci,
D. Muenstermann,
M. Nessi,
I. Peric,
M. Rimoldi,
B. Ristic,
M. Vicente Barrero Pinto,
J. Vossebeld,
M. Weber,
W. Wu,
L. Xu
Abstract:
Active pixel sensors based on the High-Voltage CMOS technology are being investigated as a viable option for the future pixel tracker of the ATLAS experiment at the High-Luminosity LHC. This paper reports on the testbeam measurements performed at the H8 beamline of the CERN Super Proton Synchrotron on a High-Voltage CMOS sensor prototype produced in 180 nm AMS technology. Results in terms of track…
▽ More
Active pixel sensors based on the High-Voltage CMOS technology are being investigated as a viable option for the future pixel tracker of the ATLAS experiment at the High-Luminosity LHC. This paper reports on the testbeam measurements performed at the H8 beamline of the CERN Super Proton Synchrotron on a High-Voltage CMOS sensor prototype produced in 180 nm AMS technology. Results in terms of tracking efficiency and timing performance, for different threshold and bias conditions, are shown.
△ Less
Submitted 30 June, 2016; v1 submitted 24 March, 2016;
originally announced March 2016.
-
The FE-I4 Telescope for particle tracking in testbeam experiments
Authors:
M. Benoit,
J. Bilbao De Mendizabal,
F. A. Di Bello,
D. Ferrere,
T. Golling,
S. Gonzalez-Sevilla,
G. Iacobucci,
M. Kocian,
D. Muenstermann,
B. Ristic,
A. Sciuccati
Abstract:
A testbeam telescope, based on ATLAS IBL silicon pixel modules, has been built. It comprises six planes of planar silicon sensors with 250 x 50 um^2 pitch, read out by ATLAS FE-I4 chips. In the CERN SPS H8 beamline (180 GeV pi+) a resolution of better than 8 x 12 um^2 at the position of the device under test was achieved. The telescope reached a trigger rate of 6kHz with two measured devices. It i…
▽ More
A testbeam telescope, based on ATLAS IBL silicon pixel modules, has been built. It comprises six planes of planar silicon sensors with 250 x 50 um^2 pitch, read out by ATLAS FE-I4 chips. In the CERN SPS H8 beamline (180 GeV pi+) a resolution of better than 8 x 12 um^2 at the position of the device under test was achieved. The telescope reached a trigger rate of 6kHz with two measured devices. It is mainly designed for studies using FE-I4 based prototypes, but has also been successfully run with independent DAQ systems. Specialised trigger schemes ensure data synchronisation between these external devices and the telescope. A region-of-interest trigger can be formed by setting masks on the first and the last pixel sensor planes. The setup infrastructure provides centrally controlled and monitored high and low voltage power supplies, silicon oil cooling, temperature and humidity sensors and movable stages.
△ Less
Submitted 20 June, 2016; v1 submitted 24 March, 2016;
originally announced March 2016.
-
SUSY Simplified Models at 14, 33, and 100 TeV Proton Colliders
Authors:
Timothy Cohen,
Tobias Golling,
Mike Hance,
Anna Henrichs,
Kiel Howe,
Joshua Loyal,
Sanjay Padhi,
Jay G. Wacker
Abstract:
Results are presented for a variety of SUSY Simplified Models at the 14 TeV LHC as well as a 33 and 100 TeV proton collider. Our focus is on models whose signals are driven by colored production. We present projections of the upper limit and discovery reach in the gluino-neutralino (for both light and heavy flavor decays), squark-neutralino, and gluino-squark Simplified Model planes. Depending on…
▽ More
Results are presented for a variety of SUSY Simplified Models at the 14 TeV LHC as well as a 33 and 100 TeV proton collider. Our focus is on models whose signals are driven by colored production. We present projections of the upper limit and discovery reach in the gluino-neutralino (for both light and heavy flavor decays), squark-neutralino, and gluino-squark Simplified Model planes. Depending on the model a jets + MET, mono-jet, or same-sign di-lepton search is applied. The impact of pileup is explored. This study utilizes the Snowmass backgrounds and combined detector. Assuming 3000 fb^{-1} of integrated luminosity, a gluino that decays to light flavor quarks can be discovered below 2.3 TeV at the 14 TeV LHC and below 11 TeV at a 100 TeV machine.
△ Less
Submitted 14 May, 2014; v1 submitted 25 November, 2013;
originally announced November 2013.
-
Snowmass 2013 Top quark working group report
Authors:
K. Agashe,
R. Erbacher,
C. E. Gerber,
K. Melnikov,
R. Schwienhorst,
A. Mitov,
M. Vos,
S. Wimpenny,
J. Adelman,
M. Baumgart,
A. Garcia-Bellido,
A. Loginov,
A. Jung,
M. Schulze,
J. Shelton,
N. Craig,
M. Velasco,
T. Golling,
J. Hubisz,
A. Ivanov,
M. Perelstein,
S. Chekanov,
J. Dolen,
J. Pilot,
R. Pöschl
, et al. (145 additional authors not shown)
Abstract:
This report summarizes the work of the Energy Frontier Top Quark working group of the 2013 Community Summer Study (Snowmass).
This report summarizes the work of the Energy Frontier Top Quark working group of the 2013 Community Summer Study (Snowmass).
△ Less
Submitted 8 November, 2013;
originally announced November 2013.
-
New Particles Working Group Report of the Snowmass 2013 Community Summer Study
Authors:
Y. Gershtein,
M. Luty,
M. Narain,
L. -T. Wang,
D. Whiteson,
K. Agashe,
L. Apanasevich,
G. Artoni,
A. Avetisyan,
H. Baer,
C. Bartels,
M. Bauer,
D. Berge,
M. Berggren,
S. Bhattacharya,
K. Black,
T. Bose,
J. Brau,
R. Brock,
E. Brownson,
M. Cahill-Rowley,
A. Cakir,
A. Chaus,
T. Cohen,
B. Coleppa
, et al. (70 additional authors not shown)
Abstract:
This report summarizes the work of the Energy Frontier New Physics working group of the 2013 Community Summer Study (Snowmass).
This report summarizes the work of the Energy Frontier New Physics working group of the 2013 Community Summer Study (Snowmass).
△ Less
Submitted 1 November, 2013;
originally announced November 2013.
-
Charming the Higgs
Authors:
Cédric Delaunay,
Tobias Golling,
Gilad Perez,
Yotam Soreq
Abstract:
We show that current Higgs data permit a significantly enhanced Higgs coupling to charm pairs, comparable to the Higgs to bottom pairs coupling in the Standard Model, without resorting to additional new physics sources in Higgs production. With a mild level of the latter current data even allow for the Higgs to charm pairs to be the dominant decay channel. An immediate consequence of such a large…
▽ More
We show that current Higgs data permit a significantly enhanced Higgs coupling to charm pairs, comparable to the Higgs to bottom pairs coupling in the Standard Model, without resorting to additional new physics sources in Higgs production. With a mild level of the latter current data even allow for the Higgs to charm pairs to be the dominant decay channel. An immediate consequence of such a large charm coupling is a significant reduction of the Higgs signal strengths into the known final states as in particular into bottom pairs. This might reduce the visible vector-boson associated Higgs production rate to a level that could compromise the prospects of ever observing it. We however demonstrate that a significant fraction of this reduced signal can be recovered by jet-flavor-tagging targeted towards charm-flavored jets. Finally we argue that an enhanced Higgs to charm pairs coupling can be obtained in various new physics scenarios in the presence of only a mild accidental cancellation between various contributions.
△ Less
Submitted 25 October, 2013;
originally announced October 2013.
-
A Comparison of Future Proton Colliders Using SUSY Simplified Models: A Snowmass Whitepaper
Authors:
Timothy Cohen,
Tobias Golling,
Mike Hance,
Anna Henrichs,
Kiel Howe,
Joshua Loyal,
Sanjay Padhi,
Jay G. Wacker
Abstract:
We present a summary of results for SUSY Simplified Model searches at future proton colliders: the 14 TeV LHC as well as a 33 TeV proton collider and a 100 TeV proton collider. Upper limits and discovery significances are provided for the gluino-neutralino (for both light and heavy flavor decays), squark-neutralino, and gluino-squark Simplified Model planes. Events are processed with the Snowmass…
▽ More
We present a summary of results for SUSY Simplified Model searches at future proton colliders: the 14 TeV LHC as well as a 33 TeV proton collider and a 100 TeV proton collider. Upper limits and discovery significances are provided for the gluino-neutralino (for both light and heavy flavor decays), squark-neutralino, and gluino-squark Simplified Model planes. Events are processed with the Snowmass combined detector and Standard Model backgrounds are computed using the Snowmass samples. We place emphasis on comparisons between different collider scenarios, along with the lessons learned regarding the impact of systematic errors and pileup. More details are provided in a companion paper.
△ Less
Submitted 30 November, 2013; v1 submitted 30 September, 2013;
originally announced October 2013.
-
Warped Extra Dimensional Benchmarks for Snowmass 2013
Authors:
Kaustubh Agashe,
Oleg Antipin,
Mihailo Backović,
Aaron Effron,
Alex Emerman,
Johannes Erdmann,
Tobias Golling,
Shrihari Gopalakrishna,
Tuomas Hapola,
Shih-Chieh Hsu,
Jos� Juknevich,
Seung J. Lee,
Tanumoy Mandal,
August Miller,
Edward Moyse,
Tuhin Subhra Mukherjee,
Chris Pollard,
Soumya Sadhukhan,
Daniel Whiteson,
Stephane Willocq
Abstract:
The framework of a warped extra dimension with the Standard Model (SM) fields propagating in it is a very well-motivated extension of the SM since it can address both the Planck-weak and flavor hierarchy problems of the SM. We consider signals at the 14 and 33 TeV large hadron collider (LHC) resulting from the direct production of the new particles in this framework, i.e.,Kaluza-Klein (KK) excitat…
▽ More
The framework of a warped extra dimension with the Standard Model (SM) fields propagating in it is a very well-motivated extension of the SM since it can address both the Planck-weak and flavor hierarchy problems of the SM. We consider signals at the 14 and 33 TeV large hadron collider (LHC) resulting from the direct production of the new particles in this framework, i.e.,Kaluza-Klein (KK) excitations of the SM particles. We focus on spin-1 (gauge boson) and spin-2 (graviton) KK particles and their decays to top/bottom quarks (flavor-conserving) and W/Z and Higgs bosons, in particular. We propose two benchmarks for this purpose, with the right-handed (RH) or LH top quark, respectively, being localized very close to the TeV end of the extra dimension. We present some new results at the 14 TeV (with 300 fb$^-1$ and 3000 fb$^-1$) and 33 TeV LHC. We find that the prospects for discovery of these particles are quite promising, especially at the high-luminosity upgrade.
△ Less
Submitted 30 September, 2013;
originally announced September 2013.
-
LHC searches for physics beyond the Standard Model with top quarks
Authors:
Tobias Golling
Abstract:
Searches are presented for physics beyond the Standard Model involving top-quark and related signatures. The results are based on proton-proton collision data corresponding to integrated luminosities between 1 fb-1 and 5 fb-1 collected at a center-of-mass energy of 7 TeV with the ATLAS and CMS detectors at the Large Hadron Collider in 2011. The data are found to be consistent with the Standard Mod…
▽ More
Searches are presented for physics beyond the Standard Model involving top-quark and related signatures. The results are based on proton-proton collision data corresponding to integrated luminosities between 1 fb-1 and 5 fb-1 collected at a center-of-mass energy of 7 TeV with the ATLAS and CMS detectors at the Large Hadron Collider in 2011. The data are found to be consistent with the Standard Model. The non-observation of a signal is converted to limits at the 95% confidence level on the production cross section times branching ratio and on the masses of the hypothesized new particles for appropriate benchmark models.
△ Less
Submitted 1 February, 2013;
originally announced February 2013.