-
Applications of flow models to the generation of correlated lattice QCD ensembles
Authors:
Ryan Abbott,
Aleksandar Botev,
Denis Boyda,
Daniel C. Hackett,
Gurtej Kanwar,
Sébastien Racanière,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Julian M. Urban
Abstract:
Machine-learned normalizing flows can be used in the context of lattice quantum field theory to generate statistically correlated ensembles of lattice gauge fields at different action parameters. This work demonstrates how these correlations can be exploited for variance reduction in the computation of observables. Three different proof-of-concept applications are demonstrated using a novel residu…
▽ More
Machine-learned normalizing flows can be used in the context of lattice quantum field theory to generate statistically correlated ensembles of lattice gauge fields at different action parameters. This work demonstrates how these correlations can be exploited for variance reduction in the computation of observables. Three different proof-of-concept applications are demonstrated using a novel residual flow architecture: continuum limits of gauge theories, the mass dependence of QCD observables, and hadronic matrix elements based on the Feynman-Hellmann approach. In all three cases, it is shown that statistical uncertainties are significantly reduced when machine-learned flows are incorporated as compared with the same calculations performed with uncorrelated ensembles or direct reweighting.
△ Less
Submitted 28 May, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Advances in machine-learning-based sampling motivated by lattice quantum chromodynamics
Authors:
Kyle Cranmer,
Gurtej Kanwar,
Sébastien Racanière,
Danilo J. Rezende,
Phiala E. Shanahan
Abstract:
Sampling from known probability distributions is a ubiquitous task in computational science, underlying calculations in domains from linguistics to biology and physics. Generative machine-learning (ML) models have emerged as a promising tool in this space, building on the success of this approach in applications such as image, text, and audio generation. Often, however, generative tasks in scienti…
▽ More
Sampling from known probability distributions is a ubiquitous task in computational science, underlying calculations in domains from linguistics to biology and physics. Generative machine-learning (ML) models have emerged as a promising tool in this space, building on the success of this approach in applications such as image, text, and audio generation. Often, however, generative tasks in scientific domains have unique structures and features -- such as complex symmetries and the requirement of exactness guarantees -- that present both challenges and opportunities for ML. This Perspective outlines the advances in ML-based sampling motivated by lattice quantum field theory, in particular for the theory of quantum chromodynamics. Enabling calculations of the structure and interactions of matter from our most fundamental understanding of particle physics, lattice quantum chromodynamics is one of the main consumers of open-science supercomputing worldwide. The design of ML algorithms for this application faces profound challenges, including the necessity of scaling custom ML architectures to the largest supercomputers, but also promises immense benefits, and is spurring a wave of development in ML-based sampling more broadly. In lattice field theory, if this approach can realize its early promise it will be a transformative step towards first-principles physics calculations in particle, nuclear and condensed matter physics that are intractable with traditional approaches.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
Normalizing flows for lattice gauge theory in arbitrary space-time dimension
Authors:
Ryan Abbott,
Michael S. Albergo,
Aleksandar Botev,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Gurtej Kanwar,
Alexander G. D. G. Matthews,
Sébastien Racanière,
Ali Razavi,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Julian M. Urban
Abstract:
Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tracta…
▽ More
Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tractable and unbiased Jacobian determinants, a key ingredient for scalable and asymptotically exact flow-based sampling algorithms. For concreteness, results from a proof-of-principle application to SU(3) lattice gauge theory in four space-time dimensions are reported.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Aspects of scaling and scalability for flow-based sampling of lattice QCD
Authors:
Ryan Abbott,
Michael S. Albergo,
Aleksandar Botev,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Alexander G. D. G. Matthews,
Sébastien Racanière,
Ali Razavi,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Julian M. Urban
Abstract:
Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the vi…
▽ More
Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the viability of sampling algorithms for lattice field theory at scale has traditionally been accomplished using simple cost scaling laws, but as we discuss in this work, their utility is limited for flow-based approaches. We conclude that flow-based approaches to sampling are better thought of as a broad family of algorithms with different scaling properties, and that scalability must be assessed experimentally.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Sampling QCD field configurations with gauge-equivariant flow models
Authors:
Ryan Abbott,
Michael S. Albergo,
Aleksandar Botev,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Gurtej Kanwar,
Alexander G. D. G. Matthews,
Sébastien Racanière,
Ali Razavi,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Julian M. Urban
Abstract:
Machine learning methods based on normalizing flows have been shown to address important challenges, such as critical slowing-down and topological freezing, in the sampling of gauge field configurations in simple lattice field theories. A critical question is whether this success will translate to studies of QCD. This Proceedings presents a status update on advances in this area. In particular, it…
▽ More
Machine learning methods based on normalizing flows have been shown to address important challenges, such as critical slowing-down and topological freezing, in the sampling of gauge field configurations in simple lattice field theories. A critical question is whether this success will translate to studies of QCD. This Proceedings presents a status update on advances in this area. In particular, it is illustrated how recently developed algorithmic components may be combined to construct flow-based sampling algorithms for QCD in four dimensions. The prospects and challenges for future use of this approach in at-scale applications are summarized.
△ Less
Submitted 20 August, 2022; v1 submitted 7 August, 2022;
originally announced August 2022.
-
Gauge-equivariant flow models for sampling in lattice field theories with pseudofermions
Authors:
Ryan Abbott,
Michael S. Albergo,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Gurtej Kanwar,
Sébastien Racanière,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Betsy Tian,
Julian M. Urban
Abstract:
This work presents gauge-equivariant architectures for flow-based sampling in fermionic lattice field theories using pseudofermions as stochastic estimators for the fermionic determinant. This is the default approach in state-of-the-art lattice field theory calculations, making this development critical to the practical application of flow models to theories such as QCD. Methods by which flow-base…
▽ More
This work presents gauge-equivariant architectures for flow-based sampling in fermionic lattice field theories using pseudofermions as stochastic estimators for the fermionic determinant. This is the default approach in state-of-the-art lattice field theory calculations, making this development critical to the practical application of flow models to theories such as QCD. Methods by which flow-based sampling approaches can be improved via standard techniques such as even/odd preconditioning and the Hasenbusch factorization are also outlined. Numerical demonstrations in two-dimensional U(1) and SU(3) gauge theories with $N_f=2$ flavors of fermions are provided.
△ Less
Submitted 16 October, 2022; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Symmetry-Based Representations for Artificial and Biological General Intelligence
Authors:
Irina Higgins,
Sébastien Racanière,
Danilo Rezende
Abstract:
Biological intelligence is remarkable in its ability to produce complex behaviour in many diverse situations through data efficient, generalisable and transferable skill acquisition. It is believed that learning "good" sensory representations is important for enabling this, however there is little agreement as to what a good representation should look like. In this review article we are going to a…
▽ More
Biological intelligence is remarkable in its ability to produce complex behaviour in many diverse situations through data efficient, generalisable and transferable skill acquisition. It is believed that learning "good" sensory representations is important for enabling this, however there is little agreement as to what a good representation should look like. In this review article we are going to argue that symmetry transformations are a fundamental principle that can guide our search for what makes a good representation. The idea that there exist transformations (symmetries) that affect some aspects of the system but not others, and their relationship to conserved quantities has become central in modern physics, resulting in a more unified theoretical framework and even ability to predict the existence of new particles. Recently, symmetries have started to gain prominence in machine learning too, resulting in more data efficient and generalisable algorithms that can mimic some of the complex behaviours produced by biological intelligence. Finally, first demonstrations of the importance of symmetry transformations for representation learning in the brain are starting to arise in neuroscience. Taken together, the overwhelming positive effect that symmetries bring to these disciplines suggest that they may be an important general framework that determines the structure of the universe, constrains the nature of natural tasks and consequently shapes both biological and artificial intelligence.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
Flow-based sampling in the lattice Schwinger model at criticality
Authors:
Michael S. Albergo,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Gurtej Kanwar,
Sébastien Racanière,
Danilo J. Rezende,
Fernando Romero-López,
Phiala E. Shanahan,
Julian M. Urban
Abstract:
Recent results suggest that flow-based algorithms may provide efficient sampling of field distributions for lattice field theory applications, such as studies of quantum chromodynamics and the Schwinger model. In this work, we provide a numerical demonstration of robust flow-based sampling in the Schwinger model at the critical value of the fermion mass. In contrast, at the same parameters, conven…
▽ More
Recent results suggest that flow-based algorithms may provide efficient sampling of field distributions for lattice field theory applications, such as studies of quantum chromodynamics and the Schwinger model. In this work, we provide a numerical demonstration of robust flow-based sampling in the Schwinger model at the critical value of the fermion mass. In contrast, at the same parameters, conventional methods fail to sample all parts of configuration space, leading to severely underestimated uncertainties.
△ Less
Submitted 23 February, 2022;
originally announced February 2022.
-
Normalizing flows for atomic solids
Authors:
Peter Wirnsberger,
George Papamakarios,
Borja Ibarz,
Sébastien Racanière,
Andrew J. Ballard,
Alexander Pritzel,
Charles Blundell
Abstract:
We present a machine-learning approach, based on normalizing flows, for modelling atomic solids. Our model transforms an analytically tractable base distribution into the target solid without requiring ground-truth samples for training. We report Helmholtz free energy estimates for cubic and hexagonal ice modelled as monatomic water as well as for a truncated and shifted Lennard-Jones system, and…
▽ More
We present a machine-learning approach, based on normalizing flows, for modelling atomic solids. Our model transforms an analytically tractable base distribution into the target solid without requiring ground-truth samples for training. We report Helmholtz free energy estimates for cubic and hexagonal ice modelled as monatomic water as well as for a truncated and shifted Lennard-Jones system, and find them to be in excellent agreement with literature values and with estimates from established baseline methods. We further investigate structural properties and show that the model samples are nearly indistinguishable from the ones obtained with molecular dynamics. Our results thus demonstrate that normalizing flows can provide high-quality samples and free energy estimates without the need for multi-staging.
△ Less
Submitted 28 April, 2022; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Implicit Riemannian Concave Potential Maps
Authors:
Danilo J. Rezende,
Sébastien Racanière
Abstract:
We are interested in the challenging problem of modelling densities on Riemannian manifolds with a known symmetry group using normalising flows. This has many potential applications in physical sciences such as molecular dynamics and quantum simulations. In this work we combine ideas from implicit neural layers and optimal transport theory to propose a generalisation of existing work on exponentia…
▽ More
We are interested in the challenging problem of modelling densities on Riemannian manifolds with a known symmetry group using normalising flows. This has many potential applications in physical sciences such as molecular dynamics and quantum simulations. In this work we combine ideas from implicit neural layers and optimal transport theory to propose a generalisation of existing work on exponential map flows, Implicit Riemannian Concave Potential Maps, IRCPMs. IRCPMs have some nice properties such as simplicity of incorporating symmetries and are less expensive than ODE-flows. We provide an initial theoretical analysis of its properties and layout sufficient conditions for stable optimisation. Finally, we illustrate the properties of IRCPMs with density estimation experiments on tori and spheres.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
Flow-based sampling for fermionic lattice field theories
Authors:
Michael S. Albergo,
Gurtej Kanwar,
Sébastien Racanière,
Danilo J. Rezende,
Julian M. Urban,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Phiala E. Shanahan
Abstract:
Algorithms based on normalizing flows are emerging as promising machine learning approaches to sampling complicated probability distributions in a way that can be made asymptotically exact. In the context of lattice field theory, proof-of-principle studies have demonstrated the effectiveness of this approach for scalar theories, gauge theories, and statistical systems. This work develops approache…
▽ More
Algorithms based on normalizing flows are emerging as promising machine learning approaches to sampling complicated probability distributions in a way that can be made asymptotically exact. In the context of lattice field theory, proof-of-principle studies have demonstrated the effectiveness of this approach for scalar theories, gauge theories, and statistical systems. This work develops approaches that enable flow-based sampling of theories with dynamical fermions, which is necessary for the technique to be applied to lattice field theory studies of the Standard Model of particle physics and many condensed matter systems. As a practical demonstration, these methods are applied to the sampling of field configurations for a two-dimensional theory of massless staggered fermions coupled to a scalar field via a Yukawa interaction.
△ Less
Submitted 28 December, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Introduction to Normalizing Flows for Lattice Field Theory
Authors:
Michael S. Albergo,
Denis Boyda,
Daniel C. Hackett,
Gurtej Kanwar,
Kyle Cranmer,
Sébastien Racanière,
Danilo Jimenez Rezende,
Phiala E. Shanahan
Abstract:
This notebook tutorial demonstrates a method for sampling Boltzmann distributions of lattice field theories using a class of machine learning models known as normalizing flows. The ideas and approaches proposed in arXiv:1904.12072, arXiv:2002.02428, and arXiv:2003.06413 are reviewed and a concrete implementation of the framework is presented. We apply this framework to a lattice scalar field theor…
▽ More
This notebook tutorial demonstrates a method for sampling Boltzmann distributions of lattice field theories using a class of machine learning models known as normalizing flows. The ideas and approaches proposed in arXiv:1904.12072, arXiv:2002.02428, and arXiv:2003.06413 are reviewed and a concrete implementation of the framework is presented. We apply this framework to a lattice scalar field theory and to U(1) gauge theory, explicitly encoding gauge symmetries in the flow-based approach to the latter. This presentation is intended to be interactive and working with the attached Jupyter notebook is recommended.
△ Less
Submitted 6 August, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Authors:
Mehdi Mirza,
Andrew Jaegle,
Jonathan J. Hunt,
Arthur Guez,
Saran Tunyasuvunakool,
Alistair Muldal,
Théophane Weber,
Peter Karkus,
Sébastien Racanière,
Lars Buesing,
Timothy Lillicrap,
Nicolas Heess
Abstract:
Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They…
▽ More
Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They understand the state of the game by looking at the physical board in front of them and modify it by manipulating pieces using touch and fine-grained motor control. Mastering complicated physical systems with abstract goals is a central challenge for artificial intelligence, but it remains out of reach for existing RL algorithms. To encourage progress towards this goal we introduce a set of physically embedded planning problems and make them publicly available. We embed challenging symbolic tasks (Sokoban, tic-tac-toe, and Go) in a physics engine to produce a set of tasks that require perception, reasoning, and motor control over long time horizons. Although existing RL algorithms can tackle the symbolic versions of these tasks, we find that they struggle to master even the simplest of their physically embedded counterparts. As a first step towards characterizing the space of solution to these tasks, we introduce a strong baseline that uses a pre-trained expert game player to provide hints in the abstract space to an RL agent's policy while training it on the full sensorimotor control task. The resulting agent solves many of the tasks, underlining the need for methods that bridge the gap between abstract planning and embodied control. See illustrating video at https://youtu.be/RwHiHlym_1k.
△ Less
Submitted 29 October, 2020; v1 submitted 11 September, 2020;
originally announced September 2020.
-
Sampling using $SU(N)$ gauge equivariant flows
Authors:
Denis Boyda,
Gurtej Kanwar,
Sébastien Racanière,
Danilo Jimenez Rezende,
Michael S. Albergo,
Kyle Cranmer,
Daniel C. Hackett,
Phiala E. Shanahan
Abstract:
We develop a flow-based sampling algorithm for $SU(N)$ lattice gauge theories that is gauge-invariant by construction. Our key contribution is constructing a class of flows on an $SU(N)$ variable (or on a $U(N)$ variable by a simple alternative) that respect matrix conjugation symmetry. We apply this technique to sample distributions of single $SU(N)$ variables and to construct flow-based samplers…
▽ More
We develop a flow-based sampling algorithm for $SU(N)$ lattice gauge theories that is gauge-invariant by construction. Our key contribution is constructing a class of flows on an $SU(N)$ variable (or on a $U(N)$ variable by a simple alternative) that respect matrix conjugation symmetry. We apply this technique to sample distributions of single $SU(N)$ variables and to construct flow-based samplers for $SU(2)$ and $SU(3)$ lattice gauge theory in two dimensions.
△ Less
Submitted 18 September, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Disentangling by Subspace Diffusion
Authors:
David Pfau,
Irina Higgins,
Aleksandar Botev,
Sébastien Racanière
Abstract:
We present a novel nonparametric algorithm for symmetry-based disentangling of data manifolds, the Geometric Manifold Component Estimator (GEOMANCER). GEOMANCER provides a partial answer to the question posed by Higgins et al. (2018): is it possible to learn how to factorize a Lie group solely from observations of the orbit of an object it acts on? We show that fully unsupervised factorization of…
▽ More
We present a novel nonparametric algorithm for symmetry-based disentangling of data manifolds, the Geometric Manifold Component Estimator (GEOMANCER). GEOMANCER provides a partial answer to the question posed by Higgins et al. (2018): is it possible to learn how to factorize a Lie group solely from observations of the orbit of an object it acts on? We show that fully unsupervised factorization of a data manifold is possible if the true metric of the manifold is known and each factor manifold has nontrivial holonomy -- for example, rotation in 3D. Our algorithm works by estimating the subspaces that are invariant under random walk diffusion, giving an approximation to the de Rham decomposition from differential geometry. We demonstrate the efficacy of GEOMANCER on several complex synthetic manifolds. Our work reduces the question of whether unsupervised disentangling is possible to the question of whether unsupervised metric learning is possible, providing a unifying insight into the geometric nature of representation learning.
△ Less
Submitted 18 November, 2020; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Equivariant flow-based sampling for lattice gauge theory
Authors:
Gurtej Kanwar,
Michael S. Albergo,
Denis Boyda,
Kyle Cranmer,
Daniel C. Hackett,
Sébastien Racanière,
Danilo Jimenez Rezende,
Phiala E. Shanahan
Abstract:
We define a class of machine-learned flow-based sampling algorithms for lattice gauge theories that are gauge-invariant by construction. We demonstrate the application of this framework to U(1) gauge theory in two spacetime dimensions, and find that near critical points in parameter space the approach is orders of magnitude more efficient at sampling topological quantities than more traditional sa…
▽ More
We define a class of machine-learned flow-based sampling algorithms for lattice gauge theories that are gauge-invariant by construction. We demonstrate the application of this framework to U(1) gauge theory in two spacetime dimensions, and find that near critical points in parameter space the approach is orders of magnitude more efficient at sampling topological quantities than more traditional sampling procedures such as Hybrid Monte Carlo and Heat Bath.
△ Less
Submitted 13 March, 2020;
originally announced March 2020.
-
Targeted free energy estimation via learned mappings
Authors:
Peter Wirnsberger,
Andrew J. Ballard,
George Papamakarios,
Stuart Abercrombie,
Sébastien Racanière,
Alexander Pritzel,
Danilo Jimenez Rezende,
Charles Blundell
Abstract:
Free energy perturbation (FEP) was proposed by Zwanzig more than six decades ago as a method to estimate free energy differences, and has since inspired a huge body of related methods that use it as an integral building block. Being an importance sampling based estimator, however, FEP suffers from a severe limitation: the requirement of sufficient overlap between distributions. One strategy to mit…
▽ More
Free energy perturbation (FEP) was proposed by Zwanzig more than six decades ago as a method to estimate free energy differences, and has since inspired a huge body of related methods that use it as an integral building block. Being an importance sampling based estimator, however, FEP suffers from a severe limitation: the requirement of sufficient overlap between distributions. One strategy to mitigate this problem, called Targeted Free Energy Perturbation, uses a high-dimensional mapping in configuration space to increase overlap of the underlying distributions. Despite its potential, this method has attracted only limited attention due to the formidable challenge of formulating a tractable mapping. Here, we cast Targeted FEP as a machine learning problem in which the mapping is parameterized as a neural network that is optimized so as to increase overlap. We develop a new model architecture that respects permutational and periodic symmetries often encountered in atomistic simulations and test our method on a fully-periodic solvation system. We demonstrate that our method leads to a substantial variance reduction in free energy estimates when compared against baselines, without requiring any additional data.
△ Less
Submitted 18 August, 2020; v1 submitted 12 February, 2020;
originally announced February 2020.
-
Normalizing Flows on Tori and Spheres
Authors:
Danilo Jimenez Rezende,
George Papamakarios,
Sébastien Racanière,
Michael S. Albergo,
Gurtej Kanwar,
Phiala E. Shanahan,
Kyle Cranmer
Abstract:
Normalizing flows are a powerful tool for building expressive distributions in high dimensions. So far, most of the literature has concentrated on learning flows on Euclidean spaces. Some problems however, such as those involving angles, are defined on spaces with more complex geometries, such as tori or spheres. In this paper, we propose and compare expressive and numerically stable flows on such…
▽ More
Normalizing flows are a powerful tool for building expressive distributions in high dimensions. So far, most of the literature has concentrated on learning flows on Euclidean spaces. Some problems however, such as those involving angles, are defined on spaces with more complex geometries, such as tori or spheres. In this paper, we propose and compare expressive and numerically stable flows on such spaces. Our flows are built recursively on the dimension of the space, starting from flows on circles, closed intervals or spheres.
△ Less
Submitted 1 July, 2020; v1 submitted 6 February, 2020;
originally announced February 2020.
-
Hamiltonian Generative Networks
Authors:
Peter Toth,
Danilo Jimenez Rezende,
Andrew Jaegle,
Sébastien Racanière,
Aleksandar Botev,
Irina Higgins
Abstract:
The Hamiltonian formalism plays a central role in classical and quantum physics. Hamiltonians are the main tool for modelling the continuous time evolution of systems with conserved quantities, and they come equipped with many useful properties, like time reversibility and smooth interpolation in time. These properties are important for many machine learning problems - from sequence prediction to…
▽ More
The Hamiltonian formalism plays a central role in classical and quantum physics. Hamiltonians are the main tool for modelling the continuous time evolution of systems with conserved quantities, and they come equipped with many useful properties, like time reversibility and smooth interpolation in time. These properties are important for many machine learning problems - from sequence prediction to reinforcement learning and density modelling - but are not typically provided out of the box by standard tools such as recurrent neural networks. In this paper, we introduce the Hamiltonian Generative Network (HGN), the first approach capable of consistently learning Hamiltonian dynamics from high-dimensional observations (such as images) without restrictive domain assumptions. Once trained, we can use HGN to sample new trajectories, perform rollouts both forward and backward in time and even speed up or slow down the learned dynamics. We demonstrate how a simple modification of the network architecture turns HGN into a powerful normalising flow model, called Neural Hamiltonian Flow (NHF), that uses Hamiltonian dynamics to model expressive densities. We hope that our work serves as a first practical demonstration of the value that the Hamiltonian formalism can bring to deep learning.
△ Less
Submitted 14 February, 2020; v1 submitted 30 September, 2019;
originally announced September 2019.
-
Equivariant Hamiltonian Flows
Authors:
Danilo Jimenez Rezende,
Sébastien Racanière,
Irina Higgins,
Peter Toth
Abstract:
This paper introduces equivariant hamiltonian flows, a method for learning expressive densities that are invariant with respect to a known Lie-algebra of local symmetry transformations while providing an equivariant representation of the data. We provide proof of principle demonstrations of how such flows can be learnt, as well as how the addition of symmetry invariance constraints can improve dat…
▽ More
This paper introduces equivariant hamiltonian flows, a method for learning expressive densities that are invariant with respect to a known Lie-algebra of local symmetry transformations while providing an equivariant representation of the data. We provide proof of principle demonstrations of how such flows can be learnt, as well as how the addition of symmetry invariance constraints can improve data efficiency and generalisation. Finally, we make connections to disentangled representation learning and show how this work relates to a recently proposed definition.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Automated curricula through setter-solver interactions
Authors:
Sebastien Racaniere,
Andrew K. Lampinen,
Adam Santoro,
David P. Reichert,
Vlad Firoiu,
Timothy P. Lillicrap
Abstract:
Reinforcement learning algorithms use correlations between policies and rewards to improve agent performance. But in dynamic or sparsely rewarding environments these correlations are often too small, or rewarding events are too infrequent to make learning feasible. Human education instead relies on curricula--the breakdown of tasks into simpler, static challenges with dense rewards--to build up to…
▽ More
Reinforcement learning algorithms use correlations between policies and rewards to improve agent performance. But in dynamic or sparsely rewarding environments these correlations are often too small, or rewarding events are too infrequent to make learning feasible. Human education instead relies on curricula--the breakdown of tasks into simpler, static challenges with dense rewards--to build up to complex behaviors. While curricula are also useful for artificial agents, hand-crafting them is time consuming. This has lead researchers to explore automatic curriculum generation. Here we explore automatic curriculum generation in rich, dynamic environments. Using a setter-solver paradigm we show the importance of considering goal validity, goal feasibility, and goal coverage to construct useful curricula. We demonstrate the success of our approach in rich but sparsely rewarding 2D and 3D environments, where an agent is tasked to achieve a single goal selected from a set of possible goals that varies between episodes, and identify challenges for future work. Finally, we demonstrate the value of a novel technique that guides agents towards a desired goal distribution. Altogether, these results represent a substantial step towards applying automatic task curricula to learn complex, otherwise unlearnable goals, and to our knowledge are the first to demonstrate automated curriculum generation for goal-conditioned agents in environments where the possible goals vary between episodes.
△ Less
Submitted 21 January, 2020; v1 submitted 27 September, 2019;
originally announced September 2019.
-
Differentiable Game Mechanics
Authors:
Alistair Letcher,
David Balduzzi,
Sebastien Racaniere,
James Martens,
Jakob Foerster,
Karl Tuyls,
Thore Graepel
Abstract:
Deep learning is built on the foundational guarantee that gradient descent on an objective function converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, that exhibit multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objecti…
▽ More
Deep learning is built on the foundational guarantee that gradient descent on an objective function converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, that exhibit multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objective architectures proliferate. In this paper, we develop new tools to understand and control the dynamics in n-player differentiable games.
The key result is to decompose the game Jacobian into two components. The first, symmetric component, is related to potential games, which reduce to gradient descent on an implicit function. The second, antisymmetric component, relates to Hamiltonian games, a new class of games that obey a conservation law akin to conservation laws in classical mechanical systems. The decomposition motivates Symplectic Gradient Adjustment (SGA), a new algorithm for finding stable fixed points in differentiable games. Basic experiments show SGA is competitive with recently proposed algorithms for finding stable fixed points in GANs -- while at the same time being applicable to, and having guarantees in, much more general cases.
△ Less
Submitted 13 May, 2019;
originally announced May 2019.
-
An investigation of model-free planning
Authors:
Arthur Guez,
Mehdi Mirza,
Karol Gregor,
Rishabh Kabra,
Sébastien Racanière,
Théophane Weber,
David Raposo,
Adam Santoro,
Laurent Orseau,
Tom Eccles,
Greg Wayne,
David Silver,
Timothy Lillicrap
Abstract:
The field of reinforcement learning (RL) is facing increasingly challenging domains with combinatorial complexity. For an RL agent to address these challenges, it is essential that it can plan effectively. Prior work has typically utilized an explicit model of the environment, combined with a specific planning algorithm (such as tree search). More recently, a new family of methods have been propos…
▽ More
The field of reinforcement learning (RL) is facing increasingly challenging domains with combinatorial complexity. For an RL agent to address these challenges, it is essential that it can plan effectively. Prior work has typically utilized an explicit model of the environment, combined with a specific planning algorithm (such as tree search). More recently, a new family of methods have been proposed that learn how to plan, by providing the structure for planning via an inductive bias in the function approximator (such as a tree structured neural network), trained end-to-end by a model-free RL algorithm. In this paper, we go even further, and demonstrate empirically that an entirely model-free approach, without special structure beyond standard neural network components such as convolutional networks and LSTMs, can learn to exhibit many of the characteristics typically associated with a model-based planner. We measure our agent's effectiveness at planning in terms of its ability to generalize across a combinatorial and irreversible state space, its data efficiency, and its ability to utilize additional thinking time. We find that our agent has many of the characteristics that one might expect to find in a planning algorithm. Furthermore, it exceeds the state-of-the-art in challenging combinatorial domains such as Sokoban and outperforms other model-free approaches that utilize strong inductive biases toward planning.
△ Less
Submitted 20 May, 2019; v1 submitted 11 January, 2019;
originally announced January 2019.
-
Towards a Definition of Disentangled Representations
Authors:
Irina Higgins,
David Amos,
David Pfau,
Sebastien Racaniere,
Loic Matthey,
Danilo Rezende,
Alexander Lerchner
Abstract:
How can intelligent agents solve a diverse set of tasks in a data-efficient manner? The disentangled representation learning approach posits that such an agent would benefit from separating out (disentangling) the underlying structure of the world into disjoint parts of its representation. However, there is no generally agreed-upon definition of disentangling, not least because it is unclear how t…
▽ More
How can intelligent agents solve a diverse set of tasks in a data-efficient manner? The disentangled representation learning approach posits that such an agent would benefit from separating out (disentangling) the underlying structure of the world into disjoint parts of its representation. However, there is no generally agreed-upon definition of disentangling, not least because it is unclear how to formalise the notion of world structure beyond toy datasets with a known ground truth generative process. Here we propose that a principled solution to characterising disentangled representations can be found by focusing on the transformation properties of the world. In particular, we suggest that those transformations that change only some properties of the underlying world state, while leaving all other properties invariant, are what gives exploitable structure to any kind of data. Similar ideas have already been successfully applied in physics, where the study of symmetry transformations has revolutionised the understanding of the world structure. By connecting symmetry transformations to vector representations using the formalism of group and representation theory we arrive at the first formal definition of disentangled representations. Our new definition is in agreement with many of the current intuitions about disentangling, while also providing principled resolutions to a number of previous points of contention. While this work focuses on formally defining disentangling - as opposed to solving the learning problem - we believe that the shift in perspective to studying data transformations can stimulate the development of better representation learning algorithms.
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Authors:
Lars Buesing,
Theophane Weber,
Yori Zwols,
Sebastien Racaniere,
Arthur Guez,
Jean-Baptiste Lespiau,
Nicolas Heess
Abstract:
Learning policies on data synthesized by models can in principle quench the thirst of reinforcement learning algorithms for large amounts of real experience, which is often costly to acquire. However, simulating plausible experience de novo is a hard problem for many complex environments, often resulting in biases for model-based policy evaluation and search. Instead of de novo synthesis of data,…
▽ More
Learning policies on data synthesized by models can in principle quench the thirst of reinforcement learning algorithms for large amounts of real experience, which is often costly to acquire. However, simulating plausible experience de novo is a hard problem for many complex environments, often resulting in biases for model-based policy evaluation and search. Instead of de novo synthesis of data, here we assume logged, real experience and model alternative outcomes of this experience under counterfactual actions, actions that were not actually taken. Based on this, we propose the Counterfactually-Guided Policy Search (CF-GPS) algorithm for learning policies in POMDPs from off-policy experience. It leverages structural causal models for counterfactual evaluation of arbitrary policies on individual off-policy episodes. CF-GPS can improve on vanilla model-based RL algorithms by making use of available logged data to de-bias model predictions. In contrast to off-policy algorithms based on Importance Sampling which re-weight data, CF-GPS leverages a model to explicitly consider alternative outcomes, allowing the algorithm to make better use of experience data. We find empirically that these advantages translate into improved policy evaluation and search results on a non-trivial grid-world task. Finally, we show that CF-GPS generalizes the previously proposed Guided Policy Search and that reparameterization-based algorithms such Stochastic Value Gradient can be interpreted as counterfactual methods.
△ Less
Submitted 15 November, 2018;
originally announced November 2018.
-
The Mechanics of n-Player Differentiable Games
Authors:
David Balduzzi,
Sebastien Racaniere,
James Martens,
Jakob Foerster,
Karl Tuyls,
Thore Graepel
Abstract:
The cornerstone underpinning deep learning is the guarantee that gradient descent on an objective converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, where there are multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-object…
▽ More
The cornerstone underpinning deep learning is the guarantee that gradient descent on an objective converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, where there are multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objective architectures proliferate. In this paper, we develop new techniques to understand and control the dynamics in general games. The key result is to decompose the second-order dynamics into two components. The first is related to potential games, which reduce to gradient descent on an implicit function; the second relates to Hamiltonian games, a new class of games that obey a conservation law, akin to conservation laws in classical mechanical systems. The decomposition motivates Symplectic Gradient Adjustment (SGA), a new algorithm for finding stable fixed points in general games. Basic experiments show SGA is competitive with recently proposed algorithms for finding stable fixed points in GANs -- whilst at the same time being applicable to -- and having guarantees in -- much more general games.
△ Less
Submitted 6 June, 2018; v1 submitted 15 February, 2018;
originally announced February 2018.
-
Learning and Querying Fast Generative Models for Reinforcement Learning
Authors:
Lars Buesing,
Theophane Weber,
Sebastien Racaniere,
S. M. Ali Eslami,
Danilo Rezende,
David P. Reichert,
Fabio Viola,
Frederic Besse,
Karol Gregor,
Demis Hassabis,
Daan Wierstra
Abstract:
A key challenge in model-based reinforcement learning (RL) is to synthesize computationally efficient and accurate environment models. We show that carefully designed generative models that learn and operate on compact state representations, so-called state-space models, substantially reduce the computational costs for predicting outcomes of sequences of actions. Extensive experiments establish th…
▽ More
A key challenge in model-based reinforcement learning (RL) is to synthesize computationally efficient and accurate environment models. We show that carefully designed generative models that learn and operate on compact state representations, so-called state-space models, substantially reduce the computational costs for predicting outcomes of sequences of actions. Extensive experiments establish that state-space models accurately capture the dynamics of Atari games from the Arcade Learning Environment from raw pixels. The computational speed-up of state-space models while maintaining high accuracy makes their application in RL feasible: We demonstrate that agents which query these models for decision making outperform strong model-free baselines on the game MSPACMAN, demonstrating the potential of using learned environment models for planning.
△ Less
Submitted 8 February, 2018;
originally announced February 2018.
-
Imagination-Augmented Agents for Deep Reinforcement Learning
Authors:
Théophane Weber,
Sébastien Racanière,
David P. Reichert,
Lars Buesing,
Arthur Guez,
Danilo Jimenez Rezende,
Adria Puigdomènech Badia,
Oriol Vinyals,
Nicolas Heess,
Yujia Li,
Razvan Pascanu,
Peter Battaglia,
Demis Hassabis,
David Silver,
Daan Wierstra
Abstract:
We introduce Imagination-Augmented Agents (I2As), a novel architecture for deep reinforcement learning combining model-free and model-based aspects. In contrast to most existing model-based reinforcement learning and planning methods, which prescribe how a model should be used to arrive at a policy, I2As learn to interpret predictions from a learned environment model to construct implicit plans in…
▽ More
We introduce Imagination-Augmented Agents (I2As), a novel architecture for deep reinforcement learning combining model-free and model-based aspects. In contrast to most existing model-based reinforcement learning and planning methods, which prescribe how a model should be used to arrive at a policy, I2As learn to interpret predictions from a learned environment model to construct implicit plans in arbitrary ways, by using the predictions as additional context in deep policy networks. I2As show improved data efficiency, performance, and robustness to model misspecification compared to several baselines.
△ Less
Submitted 14 February, 2018; v1 submitted 19 July, 2017;
originally announced July 2017.
-
Learning model-based planning from scratch
Authors:
Razvan Pascanu,
Yujia Li,
Oriol Vinyals,
Nicolas Heess,
Lars Buesing,
Sebastien Racanière,
David Reichert,
Théophane Weber,
Daan Wierstra,
Peter Battaglia
Abstract:
Conventional wisdom holds that model-based planning is a powerful approach to sequential decision-making. It is often very challenging in practice, however, because while a model can be used to evaluate a plan, it does not prescribe how to construct a plan. Here we introduce the "Imagination-based Planner", the first model-based, sequential decision-making agent that can learn to construct, evalua…
▽ More
Conventional wisdom holds that model-based planning is a powerful approach to sequential decision-making. It is often very challenging in practice, however, because while a model can be used to evaluate a plan, it does not prescribe how to construct a plan. Here we introduce the "Imagination-based Planner", the first model-based, sequential decision-making agent that can learn to construct, evaluate, and execute plans. Before any action, it can perform a variable number of imagination steps, which involve proposing an imagined action and evaluating it with its model-based imagination. All imagined actions and outcomes are aggregated, iteratively, into a "plan context" which conditions future real and imagined actions. The agent can even decide how to imagine: testing out alternative imagined actions, chaining sequences of actions together, or building a more complex "imagination tree" by navigating flexibly among the previously imagined states using a learned policy. And our agent can learn to plan economically, jointly optimizing for external rewards and computational costs associated with using its imagination. We show that our architecture can learn to solve a challenging continuous control problem, and also learn elaborate planning strategies in a discrete maze-solving task. Our work opens a new direction toward learning the components of a model-based planning system and how to use them.
△ Less
Submitted 19 July, 2017;
originally announced July 2017.
-
Recurrent Environment Simulators
Authors:
Silvia Chiappa,
Sébastien Racaniere,
Daan Wierstra,
Shakir Mohamed
Abstract:
Models that can simulate how environments change in response to actions can be used by agents to plan and act efficiently. We improve on previous environment simulators from high-dimensional pixel observations by introducing recurrent neural networks that are able to make temporally and spatially coherent predictions for hundreds of time-steps into the future. We present an in-depth analysis of th…
▽ More
Models that can simulate how environments change in response to actions can be used by agents to plan and act efficiently. We improve on previous environment simulators from high-dimensional pixel observations by introducing recurrent neural networks that are able to make temporally and spatially coherent predictions for hundreds of time-steps into the future. We present an in-depth analysis of the factors affecting performance, providing the most extensive attempt to advance the understanding of the properties of these models. We address the issue of computationally inefficiency with a model that does not need to generate a high-dimensional image at each time-step. We show that our approach can be used to improve exploration and is adaptable to many diverse environments, namely 10 Atari games, a 3D car racing environment, and complex 3D mazes.
△ Less
Submitted 19 April, 2017; v1 submitted 7 April, 2017;
originally announced April 2017.
-
Quantisation of Lie-Poisson manifolds
Authors:
Sebastien Racaniere
Abstract:
In quantum physics, the operators associated with the position and the momentum of a particle are unbounded operators and $C^*$-algebraic quantisation does therefore not deal with such operators. In the present article, I propose a quantisation of the Lie-Poisson structure of the dual of a Lie algebroid which deals with a big enough class of functions to include the above mentioned example. As a…
▽ More
In quantum physics, the operators associated with the position and the momentum of a particle are unbounded operators and $C^*$-algebraic quantisation does therefore not deal with such operators. In the present article, I propose a quantisation of the Lie-Poisson structure of the dual of a Lie algebroid which deals with a big enough class of functions to include the above mentioned example. As an application, I show with an example how the quantisation of the dual of the Lie algebroid associated to a Poisson manifold can lead to a quantisation of the Poisson manifold itself. The example I consider is the torus with constant Poisson structure, in which case I recover its usual $C^*$-algebraic quantisation.
△ Less
Submitted 3 November, 2004;
originally announced November 2004.
-
Quasi-Poisson actions and massive non-rotating BTZ black holes
Authors:
Sebastien Racaniere
Abstract:
Using ideas from an article of P. Bieliavsky, M. Rooman and Ph. Spindel on BTZ black holes, I construct a family of interesting examples of quasi-Poisson actions as defined by A. Alekseev and Y. Kosmann-Schwarzbach. As an application, I obtain a genuine Poisson structure on $SL(2,R)$ which induces a Poisson structure on a BTZ black hole.
Using ideas from an article of P. Bieliavsky, M. Rooman and Ph. Spindel on BTZ black holes, I construct a family of interesting examples of quasi-Poisson actions as defined by A. Alekseev and Y. Kosmann-Schwarzbach. As an application, I obtain a genuine Poisson structure on $SL(2,R)$ which induces a Poisson structure on a BTZ black hole.
△ Less
Submitted 29 September, 2004;
originally announced September 2004.
-
Kirwan map and moduli space of flat connections
Authors:
Sebastien Racaniere
Abstract:
If $K$ is a compact Lie group and $g\geq 2$ an integer, the space $K^{2g}$ is endowed with the structure of a Hamiltonian space with a Lie group valued moment map $Φ$. Let $β$ be in the centre of $K$. The reduction $Φ^{-1}(β)/K$ is homeomorphic to a moduli space of flat connections. When $K$ is simply connected, a direct consequence of a recent paper of Bott, Tolman and Weitsman is to give a set…
▽ More
If $K$ is a compact Lie group and $g\geq 2$ an integer, the space $K^{2g}$ is endowed with the structure of a Hamiltonian space with a Lie group valued moment map $Φ$. Let $β$ be in the centre of $K$. The reduction $Φ^{-1}(β)/K$ is homeomorphic to a moduli space of flat connections. When $K$ is simply connected, a direct consequence of a recent paper of Bott, Tolman and Weitsman is to give a set of generators for the $K$-equivariant cohomology of $Φ^{-1}(β)$. Another method to construct classes in $H^*_K(Φ^{-1}(β))$ is by using the so called universal bundle. When the group is $\Sun$ and $β$ is a generator of the centre, these last classes are known to also generate the equivariant cohomology of $Φ^{-1}(β)$. The aim of this paper is to compare the classes constructed using the result of Bott, Tolman and Weitsman and the ones using the universal bundle.
△ Less
Submitted 15 December, 2003; v1 submitted 24 June, 2003;
originally announced June 2003.