-
Hierarchical Bayesian estimation of motor-evoked potential recruitment curves yields accurate and robust estimates
Authors:
Vishweshwar Tyagi,
Lynda M. Murray,
Ahmet S. Asan,
Christopher Mandigo,
Michael S. Virk,
Noam Y. Harel,
Jason B. Carmel,
James R. McIntosh
Abstract:
Electromagnetic stimulation probes and modulates the neural systems that control movement. Key to understanding their effects is the muscle recruitment curve, which maps evoked potential size against stimulation intensity. Current methods to estimate curve parameters require large samples; however, obtaining these is often impractical due to experimental constraints. Here, we present a hierarchica…
▽ More
Electromagnetic stimulation probes and modulates the neural systems that control movement. Key to understanding their effects is the muscle recruitment curve, which maps evoked potential size against stimulation intensity. Current methods to estimate curve parameters require large samples; however, obtaining these is often impractical due to experimental constraints. Here, we present a hierarchical Bayesian framework that accounts for small samples, handles outliers, simulates high-fidelity data, and returns a posterior distribution over curve parameters that quantify estimation uncertainty. It uses a rectified-logistic function that estimates motor threshold and outperforms conventionally used sigmoidal alternatives in predictive performance, as demonstrated through cross-validation. In simulations, our method outperforms non-hierarchical models by reducing threshold estimation error on sparse data and requires fewer participants to detect shifts in threshold compared to frequentist testing. We present two common use cases involving electrical and electromagnetic stimulation data and provide an open-source library for Python, called hbMEP, for diverse applications.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
New contexts, old heuristics: How young people in India and the US trust online content in the age of generative AI
Authors:
Rachel Xu,
Nhu Le,
Rebekah Park,
Laura Murray,
Vishnupriya Das,
Devika Kumar,
Beth Goldberg
Abstract:
We conducted an in-person ethnography in India and the US to investigate how young people (18-24) trusted online content, with a focus on generative AI (GenAI). We had four key findings about how young people use GenAI and determine what to trust online. First, when online, we found participants fluidly shifted between mindsets and emotional states, which we term "information modes." Second, these…
▽ More
We conducted an in-person ethnography in India and the US to investigate how young people (18-24) trusted online content, with a focus on generative AI (GenAI). We had four key findings about how young people use GenAI and determine what to trust online. First, when online, we found participants fluidly shifted between mindsets and emotional states, which we term "information modes." Second, these information modes shaped how and why participants trust GenAI and how they applied literacy skills. In the modes where they spent most of their time, they eschewed literacy skills. Third, with the advent of GenAI, participants imported existing trust heuristics from familiar online contexts into their interactions with GenAI. Fourth, although study participants had reservations about GenAI, they saw it as a requisite tool to adopt to keep up with the times. Participants valued efficiency above all else, and used GenAI to further their goals quickly at the expense of accuracy. Our findings suggest that young people spend the majority of their time online not concerned with truth because they are seeking only to pass the time. As a result, literacy interventions should be designed to intervene at the right time, to match users' distinct information modes, and to work with their existing fact-checking practices.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement
Authors:
Lucas Murray,
Tatiana Castillo,
Jaime Carrasco,
Andrés Weintraub,
Richard Weber,
Isaac Martín de Diego,
José Ramón González,
Jordi García-Gonzalo
Abstract:
Over the past decades, the increase in both frequency and intensity of large-scale wildfires due to climate change has emerged as a significant natural threat. The pressing need to design resilient landscapes capable of withstanding such disasters has become paramount, requiring the development of advanced decision-support tools. Existing methodologies, including Mixed Integer Programming, Stochas…
▽ More
Over the past decades, the increase in both frequency and intensity of large-scale wildfires due to climate change has emerged as a significant natural threat. The pressing need to design resilient landscapes capable of withstanding such disasters has become paramount, requiring the development of advanced decision-support tools. Existing methodologies, including Mixed Integer Programming, Stochastic Optimization, and Network Theory, have proven effective but are hindered by computational demands, limiting their applicability.
In response to this challenge, we propose using artificial intelligence techniques, specifically Deep Reinforcement Learning, to address the complex problem of firebreak placement in the landscape. We employ value-function based approaches like Deep Q-Learning, Double Deep Q-Learning, and Dueling Double Deep Q-Learning. Utilizing the Cell2Fire fire spread simulator combined with Convolutional Neural Networks, we have successfully implemented a computational agent capable of learning firebreak locations within a forest environment, achieving good results.
Furthermore, we incorporate a pre-training loop, initially teaching our agent to mimic a heuristic-based algorithm and observe that it consistently exceeds the performance of these solutions. Our findings underscore the immense potential of Deep Reinforcement Learning for operational research challenges, especially in fire prevention. Our approach demonstrates convergence with highly favorable results in problem instances as large as 40 x 40 cells, marking a significant milestone in applying Reinforcement Learning to this critical issue.
To the best of our knowledge, this study represents a pioneering effort in using Reinforcement Learning to address the aforementioned problem, offering promising perspectives in fire prevention and landscape management
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Nested cobordisms, Cyl-objects and Temperley-Lieb algebras
Authors:
Maxine E. Calle,
Renee S. Hoekzema,
Laura Murray,
Natalia Pacheco-Tallaj,
Carmen Rovi,
Shruthi Sridhar-Shapiro
Abstract:
We introduce a discrete cobordism category for nested manifolds and nested cobordisms between them. A variation of stratified Morse theory applies in this case, and yields generators for a general nested cobordism category. Restricting to a low-dimensional example of the ``striped cylinder'' cobordism category Cyl, we give a complete set of relations for the generators. With an eye towards the stu…
▽ More
We introduce a discrete cobordism category for nested manifolds and nested cobordisms between them. A variation of stratified Morse theory applies in this case, and yields generators for a general nested cobordism category. Restricting to a low-dimensional example of the ``striped cylinder'' cobordism category Cyl, we give a complete set of relations for the generators. With an eye towards the study of TQFTs defined on a nested cobordism category, we describe functors Cyl$\to\mathcal{C}$, which we call Cyl-objects in $\mathcal{C}$, and show that they are related to known algebraic structures such as Temperley-Lieb algebras and cyclic objects. We moreover define novel algebraic constructions inspired by the structure of Cyl-objects, namely a doubling construction on cyclic objects analogous to edgewise subdivision, and a cylindrical bar construction on self-dual objects in a monoidal category.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Like-minded, like-bodied: How users (18-26) trust online eating and health information
Authors:
Rachel Xu,
Nhu Le,
Rebekah Park,
Laura Murray
Abstract:
This paper investigates the relationship between social media and eating practices amongst 42 internet users aged 18-26. We conducted an ethnography in the US and India to observe how they navigated eating and health information online. We found that participants portrayed themselves online through a vocabulary we have labeled "the good life": performing holistic health by displaying a socially-id…
▽ More
This paper investigates the relationship between social media and eating practices amongst 42 internet users aged 18-26. We conducted an ethnography in the US and India to observe how they navigated eating and health information online. We found that participants portrayed themselves online through a vocabulary we have labeled "the good life": performing holistic health by displaying a socially-ideal body. In doing so, participants unconsciously engaged in behaviors of disordered eating while actively eschewing them. They also valued personal testimonies, and readily tested tips from content creators who shared similar beliefs and bodies to them. In doing so, they discarded probabilistic thinking and opened themselves to harm. Our study found that their social media feeds did not unidirectionally influence participants - they also reflected participants' internalized views of health, in an intertwined, non-linear journey. Reducing the online spread of disordered eating practices requires addressing it within young people's social context.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Fast non-iterative algorithm for 3D point-cloud holography
Authors:
Nathan Tessema Ersaro,
Cem Yalcin,
Liz Murray,
Leyla Kabuli,
Laura Waller,
Rikky Muller
Abstract:
Recently developed iterative and deep learning-based approaches to computer-generated holography (CGH) have been shown to achieve high-quality photorealistic 3D images with spatial light modulators. However, such approaches remain overly cumbersome for patterning sparse collections of target points across a photoresponsive volume in applications including biological microscopy and material process…
▽ More
Recently developed iterative and deep learning-based approaches to computer-generated holography (CGH) have been shown to achieve high-quality photorealistic 3D images with spatial light modulators. However, such approaches remain overly cumbersome for patterning sparse collections of target points across a photoresponsive volume in applications including biological microscopy and material processing. Specifically, in addition to requiring heavy computation that cannot accommodate real-time operation in mobile or hardware-light settings, existing sampling-dependent 3D CGH methods preclude the ability to place target points with arbitrary precision, limiting accessible depths to a handful of planes. Accordingly, we present a non-iterative point cloud holography algorithm that employs fast deterministic calculations in order to efficiently allocate patches of SLM pixels to different target points in the 3D volume and spread the patterning of all points across multiple time frames. Compared to a matched-performance implementation of the iterative Gerchberg-Saxton algorithm, our algorithm's relative computation speed advantage was found to increase with SLM pixel count, exceeding 100,000x at 512x512 array format.
△ Less
Submitted 7 September, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Epitaxial growth of atomically thin Ga2Se2 films on c-plane sapphire substrates
Authors:
Mingyu Yu,
Lottie Murray,
Matthew Doty,
Stephanie Law
Abstract:
Broadening the variety of two-dimensional (2D) materials and improving the synthesis of ultrathin films are crucial to the development of the semiconductor industry. As a state-of-the-art 2D material, Ga2Se2 has attractive optoelectronic properties when it reaches the atomically-thin regime. However, its van der Waals epitaxial growth, especially for the atomically-thin films, has seldom been stud…
▽ More
Broadening the variety of two-dimensional (2D) materials and improving the synthesis of ultrathin films are crucial to the development of the semiconductor industry. As a state-of-the-art 2D material, Ga2Se2 has attractive optoelectronic properties when it reaches the atomically-thin regime. However, its van der Waals epitaxial growth, especially for the atomically-thin films, has seldom been studied. In this paper, we used molecular beam epitaxy to synthesize Ga2Se2 single-crystal films with a surface roughness down to 1.82 nm on c-plane sapphire substrates by optimizing substrate temperature, Se:Ga flux ratio, and growth rate. Then we used a 3-step mode to grow Ga2Se2 films with a thickness as low as 3 tetralayers and a surface roughness as low as 0.61 nm, far exceeding the performance of direct growth. Finally, we found that the surface morphology strongly depends on the Se:Ga flux ratio, and higher growth rates widened the suitable flux ratio window for growing Ga2Se2. Overall, this work advances the understanding of the vdW epitaxy growth mechanism for post-transition metal monochalcogenides on sapphire substrates.
△ Less
Submitted 16 October, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
String structures, 2-group bundles, and a categorification of the Freed-Quinn line bundle
Authors:
Daniel Berwick-Evans,
Emily Cliff,
Laura Murray,
Apurva Nakade,
Emma Phillips
Abstract:
For a 2-group constructed from a finite group and 3-cocycle, we provide an explicit description of the bicategory of flat 2-group bundles on an oriented surface in terms of weak representations of the fundamental group. We show that this bicategory encodes (flat) string structures. Furthermore, we identify the space of isomorphism classes of objects with Freed and Quinn's line bundle appearing in…
▽ More
For a 2-group constructed from a finite group and 3-cocycle, we provide an explicit description of the bicategory of flat 2-group bundles on an oriented surface in terms of weak representations of the fundamental group. We show that this bicategory encodes (flat) string structures. Furthermore, we identify the space of isomorphism classes of objects with Freed and Quinn's line bundle appearing in Chern-Simons theory of a finite group.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
MedKnowts: Unified Documentation and Information Retrieval for Electronic Health Records
Authors:
Luke Murray,
Divya Gopinath,
Monica Agrawal,
Steven Horng,
David Sontag,
David R. Karger
Abstract:
Clinical documentation can be transformed by Electronic Health Records, yet the documentation process is still a tedious, time-consuming, and error-prone process. Clinicians are faced with multi-faceted requirements and fragmented interfaces for information exploration and documentation. These challenges are only exacerbated in the Emergency Department -- clinicians often see 35 patients in one sh…
▽ More
Clinical documentation can be transformed by Electronic Health Records, yet the documentation process is still a tedious, time-consuming, and error-prone process. Clinicians are faced with multi-faceted requirements and fragmented interfaces for information exploration and documentation. These challenges are only exacerbated in the Emergency Department -- clinicians often see 35 patients in one shift, during which they have to synthesize an often previously unknown patient's medical records in order to reach a tailored diagnosis and treatment plan. To better support this information synthesis, clinical documentation tools must enable rapid contextual access to the patient's medical record. MedKnowts is an integrated note-taking editor and information retrieval system which unifies the documentation and search process and provides concise synthesized concept-oriented slices of the patient's medical record. MedKnowts automatically captures structured data while still allowing users the flexibility of natural language. MedKnowts leverages this structure to enable easier parsing of long notes, auto-populated text, and proactive information retrieval, easing the documentation burden.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Alternative ways to initiate students' intuition, and hence internalization, of why zero factorial is equal to one
Authors:
Munir Mahmood,
Lori L. Murray,
Ricardas Zitikis,
Ibtihal Mahmood
Abstract:
Zero factorial, defined to be one, is often counterintuitive to students but nonetheless an interesting concept to convey in a classroom environment. The challenge is to delineate the concept in a simple and effective way through the practice of justification, a familiar concept in mathematics and science education. In this regard two algebraic and one statistical justification using the squeeze t…
▽ More
Zero factorial, defined to be one, is often counterintuitive to students but nonetheless an interesting concept to convey in a classroom environment. The challenge is to delineate the concept in a simple and effective way through the practice of justification, a familiar concept in mathematics and science education. In this regard two algebraic and one statistical justification using the squeeze theorem are presented. To assess the effectiveness of the justifications, a student survey was conducted at a comprehensive university incorporating the analysis of the pre- and post-presentation statements. They clearly present that the justifications are useful in giving credence to zero factorial equals one. Overall, the results from the online survey supports that the students preferred Justification 1. The justifications provide instructors alternative ways to initiate exploration of students' intuitive set up of comprehending unobvious facts like zero factorial equals one.
For a range of learners with their varied abilities to perform various mental activities most closely associated with learning and problem-solving, the justifications as simple alternative methods offer the potential to raise the current level of cognitive skills to inspire differentiated paths of learning. These are evident from survey results noting the role of statistical thinking and techniques.
△ Less
Submitted 18 June, 2024; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Fast, Structured Clinical Documentation via Contextual Autocomplete
Authors:
Divya Gopinath,
Monica Agrawal,
Luke Murray,
Steven Horng,
David Karger,
David Sontag
Abstract:
We present a system that uses a learned autocompletion mechanism to facilitate rapid creation of semi-structured clinical documentation. We dynamically suggest relevant clinical concepts as a doctor drafts a note by leveraging features from both unstructured and structured medical data. By constraining our architecture to shallow neural networks, we are able to make these suggestions in real time.…
▽ More
We present a system that uses a learned autocompletion mechanism to facilitate rapid creation of semi-structured clinical documentation. We dynamically suggest relevant clinical concepts as a doctor drafts a note by leveraging features from both unstructured and structured medical data. By constraining our architecture to shallow neural networks, we are able to make these suggestions in real time. Furthermore, as our algorithm is used to write a note, we can automatically annotate the documentation with clean labels of clinical concepts drawn from medical vocabularies, making notes more structured and readable for physicians, patients, and future algorithms. To our knowledge, this system is the only machine learning-based documentation utility for clinical notes deployed in a live hospital setting, and it reduces keystroke burden of clinical concepts by 67% in real environments.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
Anytime Parallel Tempering
Authors:
A. Marie d'Avigneau,
S. S. Singh,
L. M. Murray
Abstract:
Developing efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each othe…
▽ More
Developing efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each other. To accelerate the independent local moves, they may be performed simultaneously on multiple processors. Another problem is then encountered: depending on the MCMC implementation and inference problem, local moves can take a varying and random amount of time to complete. There may also be infrastructure-induced variations, such as competing jobs on the same processors, which arises in cloud computing. Before exchanges can occur, all chains must complete the local moves they are engaged in to avoid introducing a potentially substantial bias (Proposition 2.1). To solve this issue of randomly varying local move completion times in multi-processor parallel tempering, we adopt the Anytime Monte Carlo framework of Murray et al. (2016): we impose real-time deadlines on the parallel local moves and perform exchanges at these deadlines without any processor idling. We show our methodology for exchanges at real-time deadlines does not introduce a bias and leads to significant performance enhancements over the naïve approach of idling until every processor's local moves complete. The methodology is then applied in an ABC setting, where an Anytime ABC parallel tempering algorithm is derived for the difficult task of estimating the parameters of a Lotka-Volterra predator-prey model, and similar efficiency enhancements are observed.
△ Less
Submitted 14 September, 2021; v1 submitted 26 June, 2020;
originally announced June 2020.
-
Lazy object copy as a platform for population-based probabilistic programming
Authors:
Lawrence M. Murray
Abstract:
This work considers dynamic memory management for population-based probabilistic programs, such as those using particle methods for inference. Such programs exhibit a pattern of allocating, copying, potentially mutating, and deallocating collections of similar objects through successive generations. These objects may assemble data structures such as stacks, queues, lists, ragged arrays, and trees,…
▽ More
This work considers dynamic memory management for population-based probabilistic programs, such as those using particle methods for inference. Such programs exhibit a pattern of allocating, copying, potentially mutating, and deallocating collections of similar objects through successive generations. These objects may assemble data structures such as stacks, queues, lists, ragged arrays, and trees, which may be of random, and possibly unbounded, size. For the simple case of $N$ particles, $T$ generations, $D$ objects, and resampling at each generation, dense representation requires $O(DNT)$ memory, while sparse representation requires only $O(DT+DN\log DN)$ memory, based on existing theoretical results. This work describes an object copy-on-write platform to automate this saving for the programmer. The core idea is formalized using labeled directed multigraphs, where vertices represent objects, edges the pointers between them, and labels the necessary bookkeeping. A specific labeling scheme is proposed for high performance under the motivating pattern. The platform is implemented for the Birch probabilistic programming language, using smart pointers, hash tables, and reference-counting garbage collection. It is tested empirically on a number of realistic probabilistic programs, and shown to significantly reduce memory use and execution time in a manner consistent with theoretical expectations. This enables copy-on-write for the imperative programmer, lazy deep copies for the object-oriented programmer, and in-place write optimizations for the functional programmer.
△ Less
Submitted 8 January, 2020;
originally announced January 2020.
-
Cut and paste invariants of manifolds via algebraic K-theory
Authors:
Renee S. Hoekzema,
Mona Merling,
Laura Murray,
Carmen Rovi,
Julia Semikina
Abstract:
Recent work of Jonathan Campbell and Inna Zakharevich has focused on building machinery for studying scissors congruence problems via algebraic $K$-theory, and applying these tools to studying the Grothendieck ring of varieties. In this paper we give a new application of their framework: we construct a $K$-space that recovers the classical $\mathrm{SK}$ ("schneiden und kleben," German for "cut and…
▽ More
Recent work of Jonathan Campbell and Inna Zakharevich has focused on building machinery for studying scissors congruence problems via algebraic $K$-theory, and applying these tools to studying the Grothendieck ring of varieties. In this paper we give a new application of their framework: we construct a $K$-space that recovers the classical $\mathrm{SK}$ ("schneiden und kleben," German for "cut and paste") groups for manifolds on $π_0$, and we construct a derived version of the Euler characteristic.
△ Less
Submitted 5 December, 2021; v1 submitted 1 January, 2020;
originally announced January 2020.
-
Parameter elimination in particle Gibbs sampling
Authors:
Anna Wigren,
Riccardo Sven Risuleo,
Lawrence Murray,
Fredrik Lindsten
Abstract:
Bayesian inference in state-space models is challenging due to high-dimensional state trajectories. A viable approach is particle Markov chain Monte Carlo, combining MCMC and sequential Monte Carlo to form "exact approximations" to otherwise intractable MCMC methods. The performance of the approximation is limited to that of the exact method. We focus on particle Gibbs and particle Gibbs with ance…
▽ More
Bayesian inference in state-space models is challenging due to high-dimensional state trajectories. A viable approach is particle Markov chain Monte Carlo, combining MCMC and sequential Monte Carlo to form "exact approximations" to otherwise intractable MCMC methods. The performance of the approximation is limited to that of the exact method. We focus on particle Gibbs and particle Gibbs with ancestor sampling, improving their performance beyond that of the underlying Gibbs sampler (which they approximate) by marginalizing out one or more parameters. This is possible when the parameter prior is conjugate to the complete data likelihood. Marginalization yields a non-Markovian model for inference, but we show that, in contrast to the general case, this method still scales linearly in time. While marginalization can be cumbersome to implement, recent advances in probabilistic programming have enabled its automation. We demonstrate how the marginalized methods are viable as efficient inference backends in probabilistic programming, and demonstrate with examples in ecology and epidemiology.
△ Less
Submitted 30 October, 2019;
originally announced October 2019.
-
Particle filter with rejection control and unbiased estimator of the marginal likelihood
Authors:
Jan Kudlicka,
Lawrence M. Murray,
Thomas B. Schön,
Fredrik Lindsten
Abstract:
We consider the combined use of resampling and partial rejection control in sequential Monte Carlo methods, also known as particle filters. While the variance reducing properties of rejection control are known, there has not been (to the best of our knowledge) any work on unbiased estimation of the marginal likelihood (also known as the model evidence or the normalizing constant) in this type of p…
▽ More
We consider the combined use of resampling and partial rejection control in sequential Monte Carlo methods, also known as particle filters. While the variance reducing properties of rejection control are known, there has not been (to the best of our knowledge) any work on unbiased estimation of the marginal likelihood (also known as the model evidence or the normalizing constant) in this type of particle filter. Being able to estimate the marginal likelihood without bias is highly relevant for model comparison, computation of interpretable and reliable confidence intervals, and in exact approximation methods, such as particle Markov chain Monte Carlo. In the paper we present a particle filter with rejection control that enables unbiased estimation of the marginal likelihood.
△ Less
Submitted 4 March, 2020; v1 submitted 21 October, 2019;
originally announced October 2019.
-
Trimmed Constrained Mixed Effects Models: Formulations and Algorithms
Authors:
Peng Zheng,
Ryan Barber,
Reed J. D. Sorensen,
Christopher J. L. Murray,
Aleksandr Y. Aravkin
Abstract:
Mixed effects (ME) models inform a vast array of problems in the physical and social sciences, and are pervasive in meta-analysis. We consider ME models where the random effects component is linear. We then develop an efficient approach for a broad problem class that allows nonlinear measurements, priors, and constraints, and finds robust estimates in all of these cases using trimming in the assoc…
▽ More
Mixed effects (ME) models inform a vast array of problems in the physical and social sciences, and are pervasive in meta-analysis. We consider ME models where the random effects component is linear. We then develop an efficient approach for a broad problem class that allows nonlinear measurements, priors, and constraints, and finds robust estimates in all of these cases using trimming in the associated marginal likelihood.
The software accompanying this paper is disseminated as an open-source Python package called LimeTr. LimeTr is able to recover results more accurately in the presence of outliers compared to available packages for both standard longitudinal analysis and meta-analysis, and is also more computationally efficient than competing robust alternatives. Supplementary materials that reproduce the simulations, as well as run LimeTr and third party code are available online. We also present analyses of global health data, where we use advanced functionality of LimeTr, including constraints to impose monotonicity and concavity for dose-response relationships. Nonlinear observation models allow new analyses in place of classic approximations, such as log-linear models. Robust extensions in all analyses ensure that spurious data points do not drive our understanding of either mean relationships or between-study heterogeneity.
△ Less
Submitted 27 October, 2020; v1 submitted 23 September, 2019;
originally announced September 2019.
-
Homological perspective on edge modes in linear Yang-Mills and Chern-Simons theory
Authors:
Philippe Mathieu,
Laura Murray,
Alexander Schenkel,
Nicholas J. Teh
Abstract:
We provide an elegant homological construction of the extended phase space for linear Yang-Mills theory on an oriented and time-oriented Lorentzian manifold $M$ with a time-like boundary $\partial M$ that was proposed by Donnelly and Freidel [JHEP 1609, 102 (2016)]. This explains and formalizes many of the rather ad hoc constructions for edge modes appearing in the theoretical physics literature.…
▽ More
We provide an elegant homological construction of the extended phase space for linear Yang-Mills theory on an oriented and time-oriented Lorentzian manifold $M$ with a time-like boundary $\partial M$ that was proposed by Donnelly and Freidel [JHEP 1609, 102 (2016)]. This explains and formalizes many of the rather ad hoc constructions for edge modes appearing in the theoretical physics literature. Our construction also applies to linear Chern-Simons theory, in which case we obtain the extended phase space introduced by Geiller [Nucl. Phys. B 924, 312 (2017)].
△ Less
Submitted 18 February, 2020; v1 submitted 24 July, 2019;
originally announced July 2019.
-
Probabilistic programming for birth-death models of evolution using an alive particle filter with delayed sampling
Authors:
Jan Kudlicka,
Lawrence M. Murray,
Fredrik Ronquist,
Thomas B. Schön
Abstract:
We consider probabilistic programming for birth-death models of evolution and introduce a new widely-applicable inference method that combines an extension of the alive particle filter (APF) with automatic Rao-Blackwellization via delayed sampling. Birth-death models of evolution are an important family of phylogenetic models of the diversification processes that lead to evolutionary trees. Probab…
▽ More
We consider probabilistic programming for birth-death models of evolution and introduce a new widely-applicable inference method that combines an extension of the alive particle filter (APF) with automatic Rao-Blackwellization via delayed sampling. Birth-death models of evolution are an important family of phylogenetic models of the diversification processes that lead to evolutionary trees. Probabilistic programming languages (PPLs) give phylogeneticists a new and exciting tool: their models can be implemented as probabilistic programs with just a basic knowledge of programming. The general inference methods in PPLs reduce the need for external experts, allow quick prototyping and testing, and accelerate the development and deployment of new models. We show how these birth-death models can be implemented as simple programs in existing PPLs, and demonstrate the usefulness of the proposed inference method for such models. For the popular BiSSE model the method yields an increase of the effective sample size and the conditional acceptance rate by a factor of 30 in comparison with a standard bootstrap particle filter. Although concentrating on phylogenetics, the extended APF is a general inference method that shows its strength in situations where particles are often assigned zero weight. In the case when the weights are always positive, the extra cost of using the APF rather than the bootstrap particle filter is negligible, making our method a suitable drop-in replacement for the bootstrap particle filter in probabilistic programming inference.
△ Less
Submitted 14 February, 2021; v1 submitted 10 July, 2019;
originally announced July 2019.
-
Automatic Alignment of Sequential Monte Carlo Inference in Higher-Order Probabilistic Programs
Authors:
Daniel Lundén,
David Broman,
Fredrik Ronquist,
Lawrence M. Murray
Abstract:
Probabilistic programming is a programming paradigm for expressing flexible probabilistic models. Implementations of probabilistic programming languages employ a variety of inference algorithms, where sequential Monte Carlo methods are commonly used. A problem with current state-of-the-art implementations using sequential Monte Carlo inference is the alignment of program synchronization points. We…
▽ More
Probabilistic programming is a programming paradigm for expressing flexible probabilistic models. Implementations of probabilistic programming languages employ a variety of inference algorithms, where sequential Monte Carlo methods are commonly used. A problem with current state-of-the-art implementations using sequential Monte Carlo inference is the alignment of program synchronization points. We propose a new static analysis approach based on the 0-CFA algorithm for automatically aligning higher-order probabilistic programs. We evaluate the automatic alignment on a phylogenetic model, showing a significant decrease in runtime and increase in accuracy.
△ Less
Submitted 18 December, 2018;
originally announced December 2018.
-
Automated learning with a probabilistic programming language: Birch
Authors:
Lawrence M. Murray,
Thomas B. Schön
Abstract:
This work offers a broad perspective on probabilistic modeling and inference in light of recent advances in probabilistic programming, in which models are formally expressed in Turing-complete programming languages. We consider a typical workflow and how probabilistic programming languages can help to automate this workflow, especially in the matching of models with inference methods. We focus on…
▽ More
This work offers a broad perspective on probabilistic modeling and inference in light of recent advances in probabilistic programming, in which models are formally expressed in Turing-complete programming languages. We consider a typical workflow and how probabilistic programming languages can help to automate this workflow, especially in the matching of models with inference methods. We focus on two properties of a model that are critical in this matching: its structure---the conditional dependencies between random variables---and its form---the precise mathematical definition of those dependencies. While the structure and form of a probabilistic model are often fixed a priori, it is a curiosity of probabilistic programming that they need not be, and may instead vary according to random choices made during program execution. We introduce a formal description of models expressed as programs, and discuss some of the ways in which probabilistic programming languages can reveal the structure and form of these, in order to tailor inference methods. We demonstrate the ideas with a new probabilistic programming language called Birch, with a multiple object tracking example.
△ Less
Submitted 16 April, 2020; v1 submitted 2 October, 2018;
originally announced October 2018.
-
Improving the particle filter in high dimensions using conjugate artificial process noise
Authors:
Anna Wigren,
Lawrence Murray,
Fredrik Lindsten
Abstract:
The particle filter is one of the most successful methods for state inference and identification of general non-linear and non-Gaussian models. However, standard particle filters suffer from degeneracy of the particle weights, in particular for high-dimensional problems. We propose a method for improving the performance of the particle filter for certain challenging state space models, with implic…
▽ More
The particle filter is one of the most successful methods for state inference and identification of general non-linear and non-Gaussian models. However, standard particle filters suffer from degeneracy of the particle weights, in particular for high-dimensional problems. We propose a method for improving the performance of the particle filter for certain challenging state space models, with implications for high-dimensional inference. First we approximate the model by adding artificial process noise in an additional state update, then we design a proposal that combines the standard and the locally optimal proposal. This results in a bias-variance trade-off, where adding more noise reduces the variance of the estimate but increases the model bias. The performance of the proposed method is empirically evaluated on a linear-Gaussian state space model and on the non-linear Lorenz'96 model. For both models we observe a significant improvement in performance over the standard particle filter.
△ Less
Submitted 22 November, 2018; v1 submitted 22 January, 2018;
originally announced January 2018.
-
Better together? Statistical learning in models made of modules
Authors:
Pierre E. Jacob,
Lawrence M. Murray,
Chris C. Holmes,
Christian P. Robert
Abstract:
In modern applications, statisticians are faced with integrating heterogeneous data modalities relevant for an inference, prediction, or decision problem. In such circumstances, it is convenient to use a graphical model to represent the statistical dependencies, via a set of connected "modules", each relating to a specific data modality, and drawing on specific domain expertise in their developmen…
▽ More
In modern applications, statisticians are faced with integrating heterogeneous data modalities relevant for an inference, prediction, or decision problem. In such circumstances, it is convenient to use a graphical model to represent the statistical dependencies, via a set of connected "modules", each relating to a specific data modality, and drawing on specific domain expertise in their development. In principle, given data, the conventional statistical update then allows for coherent uncertainty quantification and information propagation through and across the modules. However, misspecification of any module can contaminate the estimate and update of others, often in unpredictable ways. In various settings, particularly when certain modules are trusted more than others, practitioners have preferred to avoid learning with the full model in favor of approaches that restrict the information propagation between modules, for example by restricting propagation to only particular directions along the edges of the graph. In this article, we investigate why these modular approaches might be preferable to the full model in misspecified settings. We propose principled criteria to choose between modular and full-model approaches. The question arises in many applied settings, including large stochastic dynamical systems, meta-analysis, epidemiological models, air pollution models, pharmacokinetics-pharmacodynamics, and causal inference with propensity scores.
△ Less
Submitted 29 August, 2017;
originally announced August 2017.
-
Delayed Sampling and Automatic Rao-Blackwellization of Probabilistic Programs
Authors:
Lawrence M. Murray,
Daniel Lundén,
Jan Kudlicka,
David Broman,
Thomas B. Schön
Abstract:
We introduce a dynamic mechanism for the solution of analytically-tractable substructure in probabilistic programs, using conjugate priors and affine transformations to reduce variance in Monte Carlo estimators. For inference with Sequential Monte Carlo, this automatically yields improvements such as locally-optimal proposals and Rao-Blackwellization. The mechanism maintains a directed graph along…
▽ More
We introduce a dynamic mechanism for the solution of analytically-tractable substructure in probabilistic programs, using conjugate priors and affine transformations to reduce variance in Monte Carlo estimators. For inference with Sequential Monte Carlo, this automatically yields improvements such as locally-optimal proposals and Rao-Blackwellization. The mechanism maintains a directed graph alongside the running program that evolves dynamically as operations are triggered upon it. Nodes of the graph represent random variables, edges the analytically-tractable relationships between them. Random variables remain in the graph for as long as possible, to be sampled only when they are used by the program in a way that cannot be resolved analytically. In the meantime, they are conditioned on as many observations as possible. We demonstrate the mechanism with a few pedagogical examples, as well as a linear-nonlinear state-space model with simulated data, and an epidemiological model with real data of a dengue outbreak in Micronesia. In all cases one or more variables are automatically marginalized out to significantly reduce variance in estimates of the marginal likelihood, in the final case facilitating a random-weight or pseudo-marginal-type importance sampler for parameter estimation. We have implemented the approach in Anglican and a new probabilistic programming language called Birch.
△ Less
Submitted 21 March, 2018; v1 submitted 25 August, 2017;
originally announced August 2017.
-
Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo
Authors:
Thomas B. Schön,
Andreas Svensson,
Lawrence Murray,
Fredrik Lindsten
Abstract:
Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we a…
▽ More
Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we are forced to use approximations. In this tutorial we will provide a self-contained introduction to one of the state-of-the-art methods---the particle Metropolis--Hastings algorithm---which has proven to offer a practical approximation. This is a Monte Carlo based method, where the particle filter is used to guide a Markov chain Monte Carlo method through the parameter space. One of the key merits of the particle Metropolis--Hastings algorithm is that it is guaranteed to converge to the "true solution" under mild assumptions, despite being based on a particle filter with only a finite number of particles. We will also provide a motivating numerical example illustrating the method using a modeling language tailored for sequential Monte Carlo methods. The intention of modeling languages of this kind is to open up the power of sophisticated Monte Carlo methods---including particle Metropolis--Hastings---to a large group of users without requiring them to know all the underlying mathematical details.
△ Less
Submitted 15 December, 2017; v1 submitted 7 March, 2017;
originally announced March 2017.
-
Anytime Monte Carlo
Authors:
Lawrence M. Murray,
Sumeetpal Singh,
Anthony Lee
Abstract:
Monte Carlo algorithms simulate some prescribed number of samples, taking some random real time to complete the computations necessary. This work considers the converse: to impose a real-time budget on the computation, which results in the number of samples simulated being random. To complicate matters, the real time taken for each simulation may depend on the sample produced, so that the samples…
▽ More
Monte Carlo algorithms simulate some prescribed number of samples, taking some random real time to complete the computations necessary. This work considers the converse: to impose a real-time budget on the computation, which results in the number of samples simulated being random. To complicate matters, the real time taken for each simulation may depend on the sample produced, so that the samples themselves are not independent of their number, and a length bias with respect to compute time is apparent. This is especially problematic when a Markov chain Monte Carlo (MCMC) algorithm is used and the final state of the Markov chain -- rather than an average over all states -- is required, which is the case in parallel tempering implementations of MCMC. The length bias does not diminish with the compute budget in this case. It also occurs in sequential Monte Carlo (SMC) algorithms, which is the focus of this paper. We propose an anytime framework to address the concern, using a continuous-time Markov jump process to study the progress of the computation in real time. We first show that for any MCMC algorithm, the length bias of the final state's distribution due to the imposed real-time computing budget can be eliminated by using a multiple chain construction. The utility of this construction is then demonstrated on a large-scale SMC^2 implementation, using four billion particles distributed across a cluster of 128 graphics processing units on the Amazon EC2 service. The anytime framework imposes a real-time budget on the MCMC move steps within the SMC$^{2}$ algorithm, ensuring that all processors are simultaneously ready for the resampling step, demonstrably reducing idleness to due waiting times and providing substantial control over the total compute budget.
△ Less
Submitted 10 June, 2021; v1 submitted 10 December, 2016;
originally announced December 2016.
-
Predicting local and non-local effects of resources on animal space use using a mechanistic step selection model
Authors:
Jonathan R. Potts,
Guillaume Bastille-Rousseau,
Dennis L. Murray,
James A. Schaefer,
Mark A. Lewis
Abstract:
1. Predicting space use patterns of animals from their interactions with the environment is fundamental for understanding the effect of habitat changes on ecosystem functioning. Recent attempts to address this problem have sought to unify resource selection analysis, where animal space use is derived from available habitat quality, and mechanistic movement models, where detailed movement processes…
▽ More
1. Predicting space use patterns of animals from their interactions with the environment is fundamental for understanding the effect of habitat changes on ecosystem functioning. Recent attempts to address this problem have sought to unify resource selection analysis, where animal space use is derived from available habitat quality, and mechanistic movement models, where detailed movement processes of an animal are used to predict its emergent utilization distribution. Such models bias the animal's movement towards patches that are easily available and resource-rich, and the result is a predicted probability density at a given position being a function of the habitat quality at that position. However, in reality, the probability that an animal will use a patch of the terrain tends to be a function of the resource quality in both that patch and the surrounding habitat. 2. We propose a mechanistic model where this non-local effect of resources naturally emerges from the local movement processes, by taking into account the relative utility of both the habitat where the animal currently resides and that of where it is moving. We give statistical techniques to parametrize the model from location data and demonstrate application of these techniques to GPS data of caribou in Newfoundland. 3. Steady-state animal probability distributions arising from the model have complex patterns that cannot be expressed simply as a function of the local quality of the habitat. In particular, large areas of good habitat are used more intensively than smaller patches of equal quality habitat, whereas isolated patches are used less frequently. 4. Whilst we focus on habitats in this study, our modelling framework can be readily used with any environmental covariates and therefore represents a unification of mechanistic modelling and step selection approaches to understanding animal space use.
△ Less
Submitted 22 January, 2015;
originally announced January 2015.
-
Sequential Monte Carlo with Highly Informative Observations
Authors:
Pierre Del Moral,
Lawrence M. Murray
Abstract:
We propose sequential Monte Carlo (SMC) methods for sampling the posterior distribution of state-space models under highly informative observation regimes, a situation in which standard SMC methods can perform poorly. A special case is simulating bridges between given initial and final values. The basic idea is to introduce a schedule of intermediate weighting and resampling times between observat…
▽ More
We propose sequential Monte Carlo (SMC) methods for sampling the posterior distribution of state-space models under highly informative observation regimes, a situation in which standard SMC methods can perform poorly. A special case is simulating bridges between given initial and final values. The basic idea is to introduce a schedule of intermediate weighting and resampling times between observation times, which guide particles towards the final state. This can always be done for continuous-time models, and may be done for discrete-time models under sparse observation regimes; our main focus is on continuous-time diffusion processes. The methods are broadly applicable in that they support multivariate models with partial observation, do not require simulation of the backward transition (which is often unavailable), and, where possible, avoid pointwise evaluation of the forward transition. When simulating bridges, the last cannot be avoided entirely without concessions, and we suggest an epsilon-ball approach (reminiscent of Approximate Bayesian Computation) as a workaround. Compared to the bootstrap particle filter, the new methods deliver substantially reduced mean squared error in normalising constant estimates, even after accounting for execution time. The methods are demonstrated for state estimation with two toy examples, and for parameter estimation (within a particle marginal Metropolis--Hastings sampler) with three applied examples in econometrics, epidemiology and marine biogeochemistry.
△ Less
Submitted 9 July, 2015; v1 submitted 16 May, 2014;
originally announced May 2014.
-
Path storage in the particle filter
Authors:
Pierre E. Jacob,
Lawrence Murray,
Sylvain Rubenthaler
Abstract:
This article considers the problem of storing the paths generated by a particle filter and more generally by a sequential Monte Carlo algorithm. It provides a theoretical result bounding the expected memory cost by $T + C N \log N$ where $T$ is the time horizon, $N$ is the number of particles and $C$ is a constant, as well as an efficient algorithm to realise this. The theoretical result and the a…
▽ More
This article considers the problem of storing the paths generated by a particle filter and more generally by a sequential Monte Carlo algorithm. It provides a theoretical result bounding the expected memory cost by $T + C N \log N$ where $T$ is the time horizon, $N$ is the number of particles and $C$ is a constant, as well as an efficient algorithm to realise this. The theoretical result and the algorithm are illustrated with numerical experiments.
△ Less
Submitted 29 January, 2014; v1 submitted 11 July, 2013;
originally announced July 2013.
-
Bayesian State-Space Modelling on High-Performance Hardware Using LibBi
Authors:
Lawrence M. Murray
Abstract:
LibBi is a software package for state-space modelling and Bayesian inference on modern computer hardware, including multi-core central processing units (CPUs), many-core graphics processing units (GPUs) and distributed-memory clusters of such devices. The software parses a domain-specific language for model specification, then optimises, generates, compiles and runs code for the given model, infer…
▽ More
LibBi is a software package for state-space modelling and Bayesian inference on modern computer hardware, including multi-core central processing units (CPUs), many-core graphics processing units (GPUs) and distributed-memory clusters of such devices. The software parses a domain-specific language for model specification, then optimises, generates, compiles and runs code for the given model, inference method and hardware platform. In presenting the software, this work serves as an introduction to state-space models and the specialised methods developed for Bayesian inference with them. The focus is on sequential Monte Carlo (SMC) methods such as the particle filter for state estimation, and the particle Markov chain Monte Carlo (PMCMC) and SMC^2 methods for parameter estimation. All are well-suited to current computer hardware. Two examples are given and developed throughout, one a linear three-element windkessel model of the human arterial system, the other a nonlinear Lorenz '96 model. These are specified in the prescribed modelling language, and LibBi demonstrated by performing inference with them. Empirical results are presented, including a performance comparison of the software with different hardware configurations.
△ Less
Submitted 13 June, 2013;
originally announced June 2013.
-
Parallel resampling in the particle filter
Authors:
Lawrence M. Murray,
Anthony Lee,
Pierre E. Jacob
Abstract:
Modern parallel computing devices, such as the graphics processing unit (GPU), have gained significant traction in scientific and statistical computing. They are particularly well-suited to data-parallel algorithms such as the particle filter, or more generally Sequential Monte Carlo (SMC), which are increasingly used in statistical inference. SMC methods carry a set of weighted particles through…
▽ More
Modern parallel computing devices, such as the graphics processing unit (GPU), have gained significant traction in scientific and statistical computing. They are particularly well-suited to data-parallel algorithms such as the particle filter, or more generally Sequential Monte Carlo (SMC), which are increasingly used in statistical inference. SMC methods carry a set of weighted particles through repeated propagation, weighting and resampling steps. The propagation and weighting steps are straightforward to parallelise, as they require only independent operations on each particle. The resampling step is more difficult, as standard schemes require a collective operation, such as a sum, across particle weights. Focusing on this resampling step, we analyse two alternative schemes that do not involve a collective operation (Metropolis and rejection resamplers), and compare them to standard schemes (multinomial, stratified and systematic resamplers). We find that, in certain circumstances, the alternative resamplers can perform significantly faster on a GPU, and to a lesser extent on a CPU, than the standard approaches. Moreover, in single precision, the standard approaches are numerically biased for upwards of hundreds of thousands of particles, while the alternatives are not. This is particularly important given greater single- than double-precision throughput on modern devices, and the consequent temptation to use single precision with a greater number of particles. Finally, we provide auxiliary functions useful for implementation, such as for the permutation of ancestry vectors to enable in-place propagation.
△ Less
Submitted 11 June, 2015; v1 submitted 17 January, 2013;
originally announced January 2013.
-
Feynman-Kac particle integration with geometric interacting jumps
Authors:
Pierre Del Moral,
Pierre E. Jacob,
Anthony Lee,
Lawrence Murray,
Gareth W. Peters
Abstract:
This article is concerned with the design and analysis of discrete time Feynman-Kac particle integration models with geometric interacting jump processes. We analyze two general types of model, corresponding to whether the reference process is in continuous or discrete time. For the former, we consider discrete generation particle models defined by arbitrarily fine time mesh approximations of the…
▽ More
This article is concerned with the design and analysis of discrete time Feynman-Kac particle integration models with geometric interacting jump processes. We analyze two general types of model, corresponding to whether the reference process is in continuous or discrete time. For the former, we consider discrete generation particle models defined by arbitrarily fine time mesh approximations of the Feynman-Kac models with continuous time path integrals. For the latter, we assume that the discrete process is observed at integer times and we design new approximation models with geometric interacting jumps in terms of a sequence of intermediate time steps between the integers. In both situations, we provide non asymptotic bias and variance theorems w.r.t. the time step and the size of the system, yielding what appear to be the first results of this type for this class of Feynman-Kac particle integration models. We also discuss uniform convergence estimates w.r.t. the time horizon. Our approach is based on an original semigroup analysis with first order decompositions of the fluctuation errors.
△ Less
Submitted 30 November, 2012;
originally announced November 2012.
-
Bayesian Learning and Predictability in a Stochastic Nonlinear Dynamical Model
Authors:
John Parslow,
Noel Cressie,
Edward P. Campbell,
Emlyn Jones,
Lawrence Murray
Abstract:
Bayesian inference methods are applied within a Bayesian hierarchical modelling framework to the problems of joint state and parameter estimation, and of state forecasting. We explore and demonstrate the ideas in the context of a simple nonlinear marine biogeochemical model. A novel approach is proposed to the formulation of the stochastic process model, in which ecophysiological properties of pla…
▽ More
Bayesian inference methods are applied within a Bayesian hierarchical modelling framework to the problems of joint state and parameter estimation, and of state forecasting. We explore and demonstrate the ideas in the context of a simple nonlinear marine biogeochemical model. A novel approach is proposed to the formulation of the stochastic process model, in which ecophysiological properties of plankton communities are represented by autoregressive stochastic processes. This approach captures the effects of changes in plankton communities over time, and it allows the incorporation of literature metadata on individual species into prior distributions for process model parameters. The approach is applied to a case study at Ocean Station Papa, using Particle Markov chain Monte Carlo computational techniques. The results suggest that, by drawing on objective prior information, it is possible to extract useful information about model state and a subset of parameters, and even to make useful long-term forecasts, based on sparse and noisy observations.
△ Less
Submitted 7 November, 2012;
originally announced November 2012.
-
GPU acceleration of the particle filter: the Metropolis resampler
Authors:
Lawrence Murray
Abstract:
We consider deployment of the particle filter on modern massively parallel hardware architectures, such as Graphics Processing Units (GPUs), with a focus on the resampling stage. While standard multinomial and stratified resamplers require a sum of importance weights computed collectively between threads, a Metropolis resampler favourably requires only pair-wise ratios between weights, computed in…
▽ More
We consider deployment of the particle filter on modern massively parallel hardware architectures, such as Graphics Processing Units (GPUs), with a focus on the resampling stage. While standard multinomial and stratified resamplers require a sum of importance weights computed collectively between threads, a Metropolis resampler favourably requires only pair-wise ratios between weights, computed independently by threads, and can be further tuned for performance by adjusting its number of iterations. While achieving respectable results for the stratified and multinomial resamplers, we demonstrate that a Metropolis resampler can be faster where the variance in importance weights is modest, and so is worth considering in a performance-critical context, such as particle Markov chain Monte Carlo and real-time applications.
△ Less
Submitted 28 February, 2012;
originally announced February 2012.
-
On Disturbance State-Space Models and the Particle Marginal Metropolis-Hastings Sampler
Authors:
Lawrence M. Murray,
Emlyn M. Jones,
John Parslow
Abstract:
We investigate nonlinear state-space models without a closed-form transition density, and propose reformulating such models over their latent noise variables rather than their latent state variables. In doing so the tractable noise density emerges in place of the intractable transition density. For importance sampling methods such as the auxiliary particle filter, this enables importance weights t…
▽ More
We investigate nonlinear state-space models without a closed-form transition density, and propose reformulating such models over their latent noise variables rather than their latent state variables. In doing so the tractable noise density emerges in place of the intractable transition density. For importance sampling methods such as the auxiliary particle filter, this enables importance weights to be computed where they could not be otherwise. As case studies we take two multivariate marine biogeochemical models and perform state and parameter estimation using the particle marginal Metropolis-Hastings sampler. For the particle filter within this sampler, we compare several proposal strategies over noise variables, all based on lookaheads with the unscented Kalman filter. These strategies are compared using conventional means for assessing Metropolis-Hastings efficiency, as well as with a novel metric called the conditional acceptance rate for assessing the consequences of using an estimated, and not exact, likelihood. Results indicate the utility of reformulating the model over noise variables, particularly for fast-mixing process models.
△ Less
Submitted 10 December, 2013; v1 submitted 28 February, 2012;
originally announced February 2012.
-
High-Performance Pseudo-Random Number Generation on Graphics Processing Units
Authors:
Nimalan Nandapalan,
Richard P. Brent,
Lawrence M. Murray,
Alistair Rendell
Abstract:
This work considers the deployment of pseudo-random number generators (PRNGs) on graphics processing units (GPUs), developing an approach based on the xorgens generator to rapidly produce pseudo-random numbers of high statistical quality. The chosen algorithm has configurable state size and period, making it ideal for tuning to the GPU architecture. We present a comparison of both speed and statis…
▽ More
This work considers the deployment of pseudo-random number generators (PRNGs) on graphics processing units (GPUs), developing an approach based on the xorgens generator to rapidly produce pseudo-random numbers of high statistical quality. The chosen algorithm has configurable state size and period, making it ideal for tuning to the GPU architecture. We present a comparison of both speed and statistical quality with other common parallel, GPU-based PRNGs, demonstrating favourable performance of the xorgens-based approach.
△ Less
Submitted 2 August, 2011;
originally announced August 2011.
-
Hänsch--Couillaud locking of Mach--Zehnder interferometer for carrier removal from a phase-modulated optical spectrum
Authors:
J. E. Bateman,
R. L. D. Murray,
M. Himsworth,
H. Ohadi,
A. Xuereb,
T. Freegarde
Abstract:
We describe and analyse the operation and stabilization of a Mach--Zehnder interferometer, which separates the carrier and the first-order sidebands of a phase-modulated laser field, and which is locked using the Hänsch--Couillaud method. In addition to the necessary attenuation, our interferometer introduces, via total internal reflection, a significant polarization-dependent phase delay. We empl…
▽ More
We describe and analyse the operation and stabilization of a Mach--Zehnder interferometer, which separates the carrier and the first-order sidebands of a phase-modulated laser field, and which is locked using the Hänsch--Couillaud method. In addition to the necessary attenuation, our interferometer introduces, via total internal reflection, a significant polarization-dependent phase delay. We employ a general treatment to describe an interferometer with an object which affects the field along one path, and we examine how this phase delay affects the error signal. We discuss the requirements necessary to ensure the lock point remains unchanged when phase modulation is introduced, and we demonstrate and characterize this locking experimentally. Finally, we suggest an extension to this locking strategy using heterodyne detection.
△ Less
Submitted 11 June, 2010; v1 submitted 9 November, 2009;
originally announced November 2009.