-
Global prescribed-time control of a class of uncertain nonholonomic systems by smooth time-varying feedback
Authors:
Kang-Kang Zhang,
Bin Zhou,
Chenchen Fan,
James Lam
Abstract:
This paper investigates the prescribed-time smooth control problem for a class of uncertain nonholonomic systems. With a novel smooth time-varying state transformation, the uncertain chained nonholonomic system is reformulated as an uncertain linear time-varying system. By fully utilizing the properties of a class of parametric Lyapunov equations and constructing time-varying Lyapunov-like functio…
▽ More
This paper investigates the prescribed-time smooth control problem for a class of uncertain nonholonomic systems. With a novel smooth time-varying state transformation, the uncertain chained nonholonomic system is reformulated as an uncertain linear time-varying system. By fully utilizing the properties of a class of parametric Lyapunov equations and constructing time-varying Lyapunov-like functions, smooth time-varying high-gain state and output feedback controllers are designed. The states and controllers are proven to converge to zero at any prescribed time. The proposed smooth time-varying method combines the advantage of a time-varying high-gain function, which enhances control performance, and a smooth time-varying function that can drive the states to zero at the prescribed time. The effectiveness of the proposed methods is verified by a numerical example.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation Functions
Authors:
Yingqiang Gao,
Jhony Prada,
Nianlong Gu,
Jessica Lam,
Richard H. R. Hahnloser
Abstract:
Large Language Models (LLMs) produce eloquent texts but often the content they generate needs to be verified. Traditional information retrieval systems can assist with this task, but most systems have not been designed with LLM-generated queries in mind. As such, there is a compelling need for integrated systems that provide both retrieval and generation functionality within a single user interfac…
▽ More
Large Language Models (LLMs) produce eloquent texts but often the content they generate needs to be verified. Traditional information retrieval systems can assist with this task, but most systems have not been designed with LLM-generated queries in mind. As such, there is a compelling need for integrated systems that provide both retrieval and generation functionality within a single user interface.
We present MODOC, a modular user interface that leverages the capabilities of LLMs and provides assistance with detecting their confabulations, promoting integrity in scientific writing. MODOC represents a significant step forward in scientific writing assistance. Its modular architecture supports flexible functions for retrieving information and for writing and generating text in a single, user-friendly interface.
△ Less
Submitted 26 August, 2024;
originally announced August 2024.
-
Local systems which do not come from abelian varieties
Authors:
Paul Brommer-Wierig,
Yeuk Hay Joshua Lam
Abstract:
For each punctured curve over a finite field, we construct local systems which do not come from a family of abelian varieties. We do so by proving a criterion which must be satisfied by local systems which do come from abelian varieties, inspired by an analogous Hodge theoretic criterion in characteristic zero. Our tools include $F$-isocrystals and some $p$-adic Hodge theory.
For each punctured curve over a finite field, we construct local systems which do not come from a family of abelian varieties. We do so by proving a criterion which must be satisfied by local systems which do come from abelian varieties, inspired by an analogous Hodge theoretic criterion in characteristic zero. Our tools include $F$-isocrystals and some $p$-adic Hodge theory.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Artificially built Kondo chains with organic radicals on metallic surfaces: new model system of heavy fermion quantum criticality
Authors:
En Li,
Bimla Danu,
Yufeng Liu,
Huilin Xie,
Jacky Wing Yip Lam,
Ben Zhong Tang,
Shiyong Wang,
Fakher F. Assaad,
Nian Lin
Abstract:
Heavy fermion quantum criticality is an extremely rich domain of research which represents a framework to understand strange metals as a consequence of a Kondo breakdown transition. Here we provide an experimental realization of such systems in terms of organic radicals on a metallic surface. The ground state of organic radicals is a Kramer's doublet that can be modeled by a spin 1/2 degree of fre…
▽ More
Heavy fermion quantum criticality is an extremely rich domain of research which represents a framework to understand strange metals as a consequence of a Kondo breakdown transition. Here we provide an experimental realization of such systems in terms of organic radicals on a metallic surface. The ground state of organic radicals is a Kramer's doublet that can be modeled by a spin 1/2 degree of freedom. Using on-surface synthesis and scanning tunneling microscopy (STM) tip manipulation, one can controllably engineer and characterize chains of organic radicals on a Au(111) surface. The spatial-resolved differential conductance reveals site-dependent low-energy excitations, which support the picture of emergent many-body Kondo physics. Using quantum Monte Carlo simulations, we show that a Kondo lattice model of spin chains on a metallic surface reproduces accurately the experimental results. This allows us to interpret the experimental results in terms of a heavy fermion metal, below the coherence temperature. We foresee that the tunability of these systems will pave the way to realize quantum simulators of heavy fermion criticality.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Improving Prediction of Need for Mechanical Ventilation using Cross-Attention
Authors:
Anwesh Mohanty,
Supreeth P. Shashikumar,
Jonathan Y. Lam,
Shamim Nemati
Abstract:
In the intensive care unit, the capability to predict the need for mechanical ventilation (MV) facilitates more timely interventions to improve patient outcomes. Recent works have demonstrated good performance in this task utilizing machine learning models. This paper explores the novel application of a deep learning model with multi-head attention (FFNN-MHA) to make more accurate MV predictions a…
▽ More
In the intensive care unit, the capability to predict the need for mechanical ventilation (MV) facilitates more timely interventions to improve patient outcomes. Recent works have demonstrated good performance in this task utilizing machine learning models. This paper explores the novel application of a deep learning model with multi-head attention (FFNN-MHA) to make more accurate MV predictions and reduce false positives by learning personalized contextual information of individual patients. Utilizing the publicly available MIMIC-IV dataset, FFNN-MHA demonstrates an improvement of 0.0379 in AUC and a 17.8\% decrease in false positives compared to baseline models such as feed-forward neural networks. Our results highlight the potential of the FFNN-MHA model as an effective tool for accurate prediction of the need for mechanical ventilation in critical care settings.
△ Less
Submitted 21 July, 2024;
originally announced July 2024.
-
Competing nucleation pathways in nanocrystal formation
Authors:
Carlos R. Salazar,
Akshay Krishna Ammothum Kandy,
Jean Furstoss,
Quentin Gromoff,
Jacek Goniakowski,
Julien Lam
Abstract:
Despite numerous efforts from numerical approaches to complement experimental measurements, several fundamental challenges have still hindered one's ability to truly provide an atomistic picture of the nucleation process in nanocrystals. Among them, our study resolves three obstacles: (1) Machine-learning force fields including long-range interactions able to capture the finesse of the underlying…
▽ More
Despite numerous efforts from numerical approaches to complement experimental measurements, several fundamental challenges have still hindered one's ability to truly provide an atomistic picture of the nucleation process in nanocrystals. Among them, our study resolves three obstacles: (1) Machine-learning force fields including long-range interactions able to capture the finesse of the underlying atomic interactions, (2) Data-driven characterization of the local ordering in a complex structural landscape associated with several crystal polymorphs and (3) Comparing results from a large range of temperatures using both brute-force and rare-event sampling. Altogether, our simulation strategy has allowed us to study zinc oxide crystallization from nano-droplet melt. Remarkably, our results show that different nucleation pathways compete depending on the investigated degree of supercooling.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Analyzing the factors that are involved in length of inpatient stay at the hospital for diabetes patients
Authors:
Jorden Lam,
Kunpeng Xu
Abstract:
The paper investigates the escalating concerns surrounding the surge in diabetes cases, exacerbated by the COVID-19 pandemic, and the subsequent strain on medical resources. The research aims to construct a predictive model quantifying factors influencing inpatient hospital stay durations for diabetes patients, offering insights to hospital administrators for improved patient management strategies…
▽ More
The paper investigates the escalating concerns surrounding the surge in diabetes cases, exacerbated by the COVID-19 pandemic, and the subsequent strain on medical resources. The research aims to construct a predictive model quantifying factors influencing inpatient hospital stay durations for diabetes patients, offering insights to hospital administrators for improved patient management strategies. The literature review highlights the increasing prevalence of diabetes, emphasizing the need for continued attention and analysis of urban-rural disparities in healthcare access. International studies underscore the financial implications and healthcare burden associated with diabetes-related hospitalizations and complications, emphasizing the significance of effective management strategies. The methodology involves a quantitative approach, utilizing a dataset comprising 10,000 observations of diabetic inpatient encounters in U.S. hospitals from 1999 to 2008. Predictive modeling techniques, particularly Generalized Linear Models (GLM), are employed to develop a model predicting hospital stay durations based on patient demographics, admission types, medical history, and treatment regimen. The results highlight the influence of age, medical history, and treatment regimen on hospital stay durations for diabetes patients. Despite model limitations, such as heteroscedasticity and deviations from normality in residual analysis, the findings offer valuable insights for hospital administrators in patient management. The paper concludes with recommendations for future research to address model limitations and explore the implications of predictive models on healthcare management strategies, ensuring equitable patient care and resource allocation.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Constructing abelian varieties from rank 3 Galois representations with real trace field
Authors:
Raju Krishnamoorthy,
Yeuk Hay Joshua Lam
Abstract:
Let $U/K$ be a smooth affine curve over a number field and let $L$ be an irreducible rank 3 $\overline{\mathbb Q}_{\ell}$-local system on $U$ with trivial determinant and infinite geometric monodromy around a cusp. Suppose further that $L$ extends to an integral model such that the Frobenius traces are contained in a fixed totally real number field. Then, after potentially shrinking $U$, there exi…
▽ More
Let $U/K$ be a smooth affine curve over a number field and let $L$ be an irreducible rank 3 $\overline{\mathbb Q}_{\ell}$-local system on $U$ with trivial determinant and infinite geometric monodromy around a cusp. Suppose further that $L$ extends to an integral model such that the Frobenius traces are contained in a fixed totally real number field. Then, after potentially shrinking $U$, there exists an abelian scheme $f\colon B_U\rightarrow U$ such that $L$ is a summand of $R^2f_*\overline{\mathbb Q}_{\ell}(1)$.
The key ingredients are: (1) the totally real assumption implies $L$ admits a square root $M$; (2) the trace field of $M$ is sufficiently bounded, allowing us to use recent work of Krishnamoorthy-Yang-Zuo to construct an abelian scheme over $U_{\bar K}$ geometrically realizing $L$; and (3) Deligne's weight-monodromy theorem and the Rapoport-Zink spectral sequence, which allow us to pin down the arithmetizations using the total degeneration.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Exploring the formation of gold/silver nanoalloys with gas-phase synthesis and machine-learning assisted simulations
Authors:
Quentin Gromoff,
Patrizio Benzo,
Wissam A. Saidi,
Christopher M. Andolina,
Marie-José Casanove,
Teresa Hungria,
Sophie Barre,
Magali Benoit,
Julien Lam
Abstract:
While nanoalloys are of paramount scientific and practical interests, the main processes leading to their formation are still poorly understood. Key structural features in the alloy systems, including crystal phase, chemical ordering, and morphology, are challenging to control at the nanoscale, making it difficult to transfer their usage to industrial applications. In this contribution, we focus o…
▽ More
While nanoalloys are of paramount scientific and practical interests, the main processes leading to their formation are still poorly understood. Key structural features in the alloy systems, including crystal phase, chemical ordering, and morphology, are challenging to control at the nanoscale, making it difficult to transfer their usage to industrial applications. In this contribution, we focus on the gold/silver system that has two of the most prevalent noble metals, and combine experiments with simulations to uncover the formation mechanisms at the atomic-level. Nanoparticles are produced using state-of-the-art inert-gas aggregation source and analyzed using transmission electron microscopy and energy-dispersive x-ray spectroscopy. Machine-learning-assisted molecular dynamics simulations are employed to model the crystallization process from liquid droplets to nanocrystals. Our study finds a preponderance of nanoparticles with five-fold symmetric morphology, including icosahedron and decahedron which is consistent with previous results on mono-metallic nanoparticles. However, we observe that gold atoms, rather than silver atoms, segregate at the surface of the obtained nanoparticles for all the considered alloy compositions. These segregation tendencies are in contrast to previous studies and have consequences on the crystallization dynamics and the subsequent crystal ordering. We finally show that the underpinnings of this surprising segregation dynamics is due to charge transfer and electrostatic interactions rather than surface energy considerations.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Harvesting nucleating structures in nanoparticle crystallization: The example of gold, silver and iron
Authors:
Arthur France-Lanord,
Sarath Menon,
Julien Lam
Abstract:
The thermodynamics and kinetics of crystallization of nanoparticles, as opposed to bulk phases, may be influenced by surface and size effects. We investigate the importance of such factors in the crystallization process of gold, silver, and iron nanodroplets using numerical simulations in the form of molecular dynamics combined with path sampling. This modeling strategy is targeted at obtaining re…
▽ More
The thermodynamics and kinetics of crystallization of nanoparticles, as opposed to bulk phases, may be influenced by surface and size effects. We investigate the importance of such factors in the crystallization process of gold, silver, and iron nanodroplets using numerical simulations in the form of molecular dynamics combined with path sampling. This modeling strategy is targeted at obtaining representative ensembles of structures located at the transition state of the crystallization process. A structural analysis of the transition state ensembles reveals that both the average size and location of the critical nucleation cluster are influenced by surface and nanoscale size effets. Furthermore, we also show that transition state structures in smaller nanodroplets exhibit a more ordered liquid phase, and differentiating between a well-ordered critical cluster and its surrounding disordered liquid phase becomes less evident. All in all, these findings demonstrate that crystallization mechanisms in nanoparticles go beyond the assumptions of classical nucleation theory.
△ Less
Submitted 8 July, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Families of subvarieties of manifolds with a canonical form
Authors:
Yeuk Hay Joshua Lam,
Federico Moretti,
Giovanni Passeri
Abstract:
Suppose $Y$ is a smooth variety equipped with a top form. We prove a simple theorem giving a sharp lower bound on the geometric genus of a family of subvarieties of $Y$, in terms of the dimension of this family. Two elementary applications are presented. On the one hand, we show that for a very general curve $C$ and a very general hypersurface $Y\subset \mathbb P^{n+1}$ of degree $\ge 2n+1$, any m…
▽ More
Suppose $Y$ is a smooth variety equipped with a top form. We prove a simple theorem giving a sharp lower bound on the geometric genus of a family of subvarieties of $Y$, in terms of the dimension of this family. Two elementary applications are presented. On the one hand, we show that for a very general curve $C$ and a very general hypersurface $Y\subset \mathbb P^{n+1}$ of degree $\ge 2n+1$, any map $C \to Y$ is constant. On the other hand, we give a lower bound on the genus of a family of curves with an isotrivial factor in the associated family of Jacobians; we also characterize the families of curves attaining this bound as the families of degree $2$ branched covers of a fixed curve.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Semi-supervised Domain Adaptation on Graphs with Contrastive Learning and Minimax Entropy
Authors:
Jiaren Xiao,
Quanyu Dai,
Xiao Shen,
Xiaochen Xie,
Jing Dai,
James Lam,
Ka-Wai Kwok
Abstract:
Label scarcity in a graph is frequently encountered in real-world applications due to the high cost of data labeling. To this end, semi-supervised domain adaptation (SSDA) on graphs aims to leverage the knowledge of a labeled source graph to aid in node classification on a target graph with limited labels. SSDA tasks need to overcome the domain gap between the source and target graphs. However, to…
▽ More
Label scarcity in a graph is frequently encountered in real-world applications due to the high cost of data labeling. To this end, semi-supervised domain adaptation (SSDA) on graphs aims to leverage the knowledge of a labeled source graph to aid in node classification on a target graph with limited labels. SSDA tasks need to overcome the domain gap between the source and target graphs. However, to date, this challenging research problem has yet to be formally considered by the existing approaches designed for cross-graph node classification. This paper proposes a novel method called SemiGCL to tackle the graph \textbf{Semi}-supervised domain adaptation with \textbf{G}raph \textbf{C}ontrastive \textbf{L}earning and minimax entropy training. SemiGCL generates informative node representations by contrasting the representations learned from a graph's local and global views. Additionally, SemiGCL is adversarially optimized with the entropy loss of unlabeled target nodes to reduce domain divergence. Experimental results on benchmark datasets demonstrate that SemiGCL outperforms the state-of-the-art baselines on the SSDA tasks. The source codes of SemiGCL are publicly available at https://github.com/ JiarenX/SemiGCL.
△ Less
Submitted 4 April, 2024; v1 submitted 13 September, 2023;
originally announced September 2023.
-
SeamlessM4T: Massively Multilingual & Multimodal Machine Translation
Authors:
Seamless Communication,
Loïc Barrault,
Yu-An Chung,
Mariano Cora Meglioli,
David Dale,
Ning Dong,
Paul-Ambroise Duquenne,
Hady Elsahar,
Hongyu Gong,
Kevin Heffernan,
John Hoffman,
Christopher Klaiber,
Pengwei Li,
Daniel Licht,
Jean Maillard,
Alice Rakotoarison,
Kaushik Ram Sadagopan,
Guillaume Wenzek,
Ethan Ye,
Bapi Akula,
Peng-Jen Chen,
Naji El Hachem,
Brian Ellis,
Gabriel Mejia Gonzalez,
Justin Haaheim
, et al. (43 additional authors not shown)
Abstract:
What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s…
▽ More
What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded systems that perform translation progressively, putting high-performing unified systems out of reach. To address these gaps, we introduce SeamlessM4T, a single model that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation, and automatic speech recognition for up to 100 languages. To build this, we used 1 million hours of open speech audio data to learn self-supervised speech representations with w2v-BERT 2.0. Subsequently, we created a multimodal corpus of automatically aligned speech translations. Filtered and combined with human-labeled and pseudo-labeled data, we developed the first multilingual system capable of translating from and into English for both speech and text. On FLEURS, SeamlessM4T sets a new standard for translations into multiple target languages, achieving an improvement of 20% BLEU over the previous SOTA in direct speech-to-text translation. Compared to strong cascaded models, SeamlessM4T improves the quality of into-English translation by 1.3 BLEU points in speech-to-text and by 2.6 ASR-BLEU points in speech-to-speech. Tested for robustness, our system performs better against background noises and speaker variations in speech-to-text tasks compared to the current SOTA model. Critically, we evaluated SeamlessM4T on gender bias and added toxicity to assess translation safety. Finally, all contributions in this work are open-sourced and accessible at https://github.com/facebookresearch/seamless_communication
△ Less
Submitted 24 October, 2023; v1 submitted 22 August, 2023;
originally announced August 2023.
-
Frobenius trace fields of cohomologically rigid local systems
Authors:
Raju Krishnamoorthy,
Yeuk Hay Joshua Lam
Abstract:
Let $X/\mathbb{C}$ be a smooth variety with simple normal crossings compactification $\bar{X}$, and let $L$ be an irreducible $\overline{\mathbb{Q}}_{\ell}$-local system on $X$ with torsion determinant. Suppose $L$ is cohomologically rigid. The pair $(X, L)$ may be spread out to a finitely generated base, and therefore reduced modulo $p$ for almost all $p$; the Frobenius traces of this mod $p$ red…
▽ More
Let $X/\mathbb{C}$ be a smooth variety with simple normal crossings compactification $\bar{X}$, and let $L$ be an irreducible $\overline{\mathbb{Q}}_{\ell}$-local system on $X$ with torsion determinant. Suppose $L$ is cohomologically rigid. The pair $(X, L)$ may be spread out to a finitely generated base, and therefore reduced modulo $p$ for almost all $p$; the Frobenius traces of this mod $p$ reduction lie in a number field $F_p$, by a theorem of Deligne. We investigate to what extent the fields $F_p$ are bounded, meaning that they are contained in a fixed number field, independent of $p$. We prove a host of results around this question. For instance: assuming $L$ has totally degenerate unipotent monodromy around some component of $Z$, then we prove that $L$ admits a spreading out such that the $F_p$'s are bounded; without any local monodromy assumptions, we show that the $F_p$'s are bounded as soon as they are bounded at one point of $X$.
We also speculate on the relation between the boundedness of the $F_p$'s, and the local system $L$ being strongly of geometric origin, a notion due to Langer-Simpson.
△ Less
Submitted 4 December, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
Finite braid group orbits on $SL_2$-character varieties
Authors:
Yeuk Hay Joshua Lam,
Aaron Landesman,
Daniel Litt
Abstract:
Let X be a 2-sphere with n punctures. We classify all conjugacy classes of Zariski-dense representations $$ρ: π_1(X)\to SL_2(\mathbb{C})$$ with finite orbit under the mapping class group of X, such that the local monodromy at one or more punctures has infinite order. We show that all such representations are "of pullback type" or arise via middle convolution from finite complex reflection groups.…
▽ More
Let X be a 2-sphere with n punctures. We classify all conjugacy classes of Zariski-dense representations $$ρ: π_1(X)\to SL_2(\mathbb{C})$$ with finite orbit under the mapping class group of X, such that the local monodromy at one or more punctures has infinite order. We show that all such representations are "of pullback type" or arise via middle convolution from finite complex reflection groups. In particular, we classify all rank 2 local systems of geometric origin on the projective line with n generic punctures, and with local monodromy of infinite order about at least one puncture.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
Authors:
David Dale,
Elena Voita,
Janice Lam,
Prangthip Hansanti,
Christophe Ropers,
Elahe Kalbassi,
Cynthia Gao,
Loïc Barrault,
Marta R. Costa-jussà
Abstract:
Hallucinations in machine translation are translations that contain information completely unrelated to the input. Omissions are translations that do not include some of the input information. While both cases tend to be catastrophic errors undermining user trust, annotated data with these types of pathologies is extremely scarce and is limited to a few high-resource languages. In this work, we re…
▽ More
Hallucinations in machine translation are translations that contain information completely unrelated to the input. Omissions are translations that do not include some of the input information. While both cases tend to be catastrophic errors undermining user trust, annotated data with these types of pathologies is extremely scarce and is limited to a few high-resource languages. In this work, we release an annotated dataset for the hallucination and omission phenomena covering 18 translation directions with varying resource levels and scripts. Our annotation covers different levels of partial and full hallucinations as well as omissions both at the sentence and at the word level. Additionally, we revisit previous methods for hallucination and omission detection, show that conclusions made based on a single language pair largely do not hold for a large-scale evaluation, and establish new solid baselines.
△ Less
Submitted 5 December, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information
Authors:
Yingqiang Gao,
Jessica Lam,
Nianlong Gu,
Richard H. R. Hahnloser
Abstract:
The abstracts of scientific papers consist of premises and conclusions. Structured abstracts explicitly highlight the conclusion sentences, whereas non-structured abstracts may have conclusion sentences at uncertain positions. This implicit nature of conclusion positions makes the automatic segmentation of scientific abstracts into premises and conclusions a challenging task. In this work, we empi…
▽ More
The abstracts of scientific papers consist of premises and conclusions. Structured abstracts explicitly highlight the conclusion sentences, whereas non-structured abstracts may have conclusion sentences at uncertain positions. This implicit nature of conclusion positions makes the automatic segmentation of scientific abstracts into premises and conclusions a challenging task. In this work, we empirically explore using Normalized Mutual Information (NMI) for abstract segmentation. We consider each abstract as a recurrent cycle of sentences and place segmentation boundaries by greedily optimizing the NMI score between premises and conclusions. On non-structured abstracts, our proposed unsupervised approach GreedyCAS achieves the best performance across all evaluation metrics; on structured abstracts, GreedyCAS outperforms all baseline methods measured by $P_k$. The strong correlation of NMI to our evaluation metrics reveals the effectiveness of NMI for abstract segmentation.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Geometric local systems on the projective line minus four points
Authors:
Yeuk Hay Joshua Lam,
Daniel Litt
Abstract:
Let $J(m)$ be an $m\times m$ Jordan block with eigenvalue $1$. For $λ\in \mathbb{C}\setminus\{0,1\}$, we explicitly construct all rank $2$ local systems of geometric origin on $\mathbb{P}^1\setminus\{0,1,λ, \infty\}$, with local monodromy conjugate to $J(2)$ at $0,1,λ$ and conjugate to $-J(2)$ at $\infty$. The construction relies on Katz's middle convolution operation. We use our construction to p…
▽ More
Let $J(m)$ be an $m\times m$ Jordan block with eigenvalue $1$. For $λ\in \mathbb{C}\setminus\{0,1\}$, we explicitly construct all rank $2$ local systems of geometric origin on $\mathbb{P}^1\setminus\{0,1,λ, \infty\}$, with local monodromy conjugate to $J(2)$ at $0,1,λ$ and conjugate to $-J(2)$ at $\infty$. The construction relies on Katz's middle convolution operation. We use our construction to prove two conjectures of Sun-Yang-Zuo (one of which was proven earlier by Lin-Sheng-Wang; the other was proven independently from us by Yang-Zuo).
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Authors:
Alexander Herzog,
Kanishka Rao,
Karol Hausman,
Yao Lu,
Paul Wohlhart,
Mengyuan Yan,
Jessica Lin,
Montserrat Gonzalez Arenas,
Ted Xiao,
Daniel Kappler,
Daniel Ho,
Jarek Rettinghouse,
Yevgen Chebotar,
Kuang-Huei Lee,
Keerthana Gopalakrishnan,
Ryan Julian,
Adrian Li,
Chuyuan Kelly Fu,
Bob Wei,
Sangeetha Ramesh,
Khem Holden,
Kim Kleiven,
David Rendleman,
Sean Kirmani,
Jeff Bingham
, et al. (15 additional authors not shown)
Abstract:
We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL…
▽ More
We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL from real-world data with bootstrapping from training in simulation, and incorporates auxiliary inputs from existing computer vision systems as a way to boost generalization to novel objects, while retaining the benefits of end-to-end training. We analyze the tradeoffs of different design decisions in our system, and present a large-scale empirical validation that includes training on real-world data gathered over the course of 24 months of experimentation, across a fleet of 23 robots in three office buildings, with a total training set of 9527 hours of robotic experience. Our final validation also consists of 4800 evaluation trials across 240 waste station configurations, in order to evaluate in detail the impact of the design decisions in our system, the scaling effects of including more real-world data, and the performance of the method on novel objects. The projects website and videos can be found at \href{http://rl-at-scale.github.io}{rl-at-scale.github.io}.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
A critical comparison of general-purpose collective variables for crystal nucleation
Authors:
Julien Lam,
Fabio Pietrucci
Abstract:
The nucleation of crystals is a prominent phenomenon in science and technology that still lacks a full atomic-scale understanding. Much work has been devoted to identifying order parameters able to track the process, from the inception of early nuclei to their maturing to critical size until growth of an extended crystal. We critically assess and compare two powerful distance-based collective vari…
▽ More
The nucleation of crystals is a prominent phenomenon in science and technology that still lacks a full atomic-scale understanding. Much work has been devoted to identifying order parameters able to track the process, from the inception of early nuclei to their maturing to critical size until growth of an extended crystal. We critically assess and compare two powerful distance-based collective variables, an effective entropy derived from liquid state theory and the path variable based on permutation invariant vectors using the Kob-Andersen binary mixture and a combination of enhanced-sampling techniques. Our findings reveal a comparable ability to drive nucleation when a bias potential is applied, and comparable free-energy barriers and structural features. Yet, we also found an imperfect correlation with the committor probability on the barrier top which was bypassed by changing the order parameter definition.
△ Less
Submitted 23 January, 2023;
originally announced January 2023.
-
Motivic local systems on curves and Maeda's conjecture
Authors:
Yeuk Hay Joshua Lam
Abstract:
We show that only finitely many complex genus two curves and four punctured spheres admit rank two local systems of geometric origin, and moreover each carries finitely many. This gives further counterexamples to a conjecture of Esnault and Kerz: counterexamples over very general curves were recently obtained by Landesman and Litt. In the second part we prove an analogue of this result in positive…
▽ More
We show that only finitely many complex genus two curves and four punctured spheres admit rank two local systems of geometric origin, and moreover each carries finitely many. This gives further counterexamples to a conjecture of Esnault and Kerz: counterexamples over very general curves were recently obtained by Landesman and Litt. In the second part we prove an analogue of this result in positive characteristic, namely that over $\overline{\mathbb{F}}_p$, only finitely many genus two curves admit non-trivial rank two local systems pulled back from a fixed quaternionic Shimura variety, and the same for $\mathbb{P}^1$ minus four points; conjecturally, every rank two local system arises as such a pullback. This provides results towards Maeda's conjecture on Galois orbits of eigenforms over function fields. The proofs make use of ideas from the work of Landesman and Litt such as isomonodromy, as well as crucially the description of the Goren-Oort strata due to Tian and Xiao.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Boundedness of trace fields of rank two local systems
Authors:
Yeuk Hay Joshua Lam
Abstract:
Let $p$ be a fixed prime number, and $q$ a power of $p$. For any curve over $\mb{F}_q$ and any local system on it, we have a number field generated by the traces of Frobenii at closed points, known as the trace field. We show that as we range over all pointed curves of type $(g,n)$ in characteristic $p$ and rank two local systems satisfying a condition at infinity, the set of trace fields which ar…
▽ More
Let $p$ be a fixed prime number, and $q$ a power of $p$. For any curve over $\mb{F}_q$ and any local system on it, we have a number field generated by the traces of Frobenii at closed points, known as the trace field. We show that as we range over all pointed curves of type $(g,n)$ in characteristic $p$ and rank two local systems satisfying a condition at infinity, the set of trace fields which are unramified at $p$ and of bounded degree is finite. This proves observations of Kontsevich obtained via numerical computations, which are in turn closely related to the analogue of Maeda's conjecture over function fields. We also prove a similar finiteness result across all primes $p$. One of the key steps in the proofs is the boundedness of abelian schemes of $\mathrm{GL}_2$-type over curves in positive characteristics, which is an analogue of Faltings' Arakelov theorem for abelian varieties in our setting.
△ Less
Submitted 18 November, 2022; v1 submitted 24 October, 2022;
originally announced October 2022.
-
Improving the Efficiency of Payments Systems Using Quantum Computing
Authors:
Christopher McMahon,
Donald McGillivray,
Ajit Desai,
Francisco Rivadeneyra,
Jean-Paul Lam,
Thomas Lo,
Danica Marsden,
Vladimir Skavysh
Abstract:
High-value payment systems (HVPSs) are typically liquidity-intensive as the payment requests are indivisible and settled on a gross basis. Finding the right order in which payments should be processed to maximize the liquidity efficiency of these systems is an $NP$-hard combinatorial optimization problem, which quantum algorithms may be able to tackle at meaningful scales. We developed an algorith…
▽ More
High-value payment systems (HVPSs) are typically liquidity-intensive as the payment requests are indivisible and settled on a gross basis. Finding the right order in which payments should be processed to maximize the liquidity efficiency of these systems is an $NP$-hard combinatorial optimization problem, which quantum algorithms may be able to tackle at meaningful scales. We developed an algorithm and ran it on a hybrid quantum annealing solver to find an ordering of payments that reduced the amount of system liquidity necessary without substantially increasing payment delays. Despite the limitations in size and speed of today's quantum computers, our algorithm provided quantifiable efficiency improvements when applied to the Canadian HVPS using a 30-day sample of transaction data. By reordering each batch of 70 payments as they entered the queue, we achieved an average of C\$240 million in daily liquidity savings, with a settlement delay of approximately 90 seconds. For a few days in the sample, the liquidity savings exceeded C\$1 billion. This algorithm could be incorporated as a centralized preprocessor into existing HVPS without entailing a fundamental change to their risk management models.
△ Less
Submitted 17 January, 2023; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Lifts of supersingular abelian varieties with small Mumford-Tate groups
Authors:
Yeuk Hay Joshua Lam,
Abhishek Oswal
Abstract:
We investigate to what extent an abelian variety over a finite field can be lifted to one in characteristic zero with small Mumford-Tate group. We prove that supersingular abelian surfaces, respectively threefolds, can be lifted to ones isogenous to a square, respectively product, of elliptic curves. On the other hand, we show that supersingular abelian threefolds cannot be lifted to one isogenous…
▽ More
We investigate to what extent an abelian variety over a finite field can be lifted to one in characteristic zero with small Mumford-Tate group. We prove that supersingular abelian surfaces, respectively threefolds, can be lifted to ones isogenous to a square, respectively product, of elliptic curves. On the other hand, we show that supersingular abelian threefolds cannot be lifted to one isogenous to the cube of an elliptic curve over the Witt vectors.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
EEGNN: Edge Enhanced Graph Neural Network with a Bayesian Nonparametric Graph Model
Authors:
Yirui Liu,
Xinghao Qiao,
Liying Wang,
Jessica Lam
Abstract:
Training deep graph neural networks (GNNs) poses a challenging task, as the performance of GNNs may suffer from the number of hidden message-passing layers. The literature has focused on the proposals of {over-smoothing} and {under-reaching} to explain the performance deterioration of deep GNNs. In this paper, we propose a new explanation for such deteriorated performance phenomenon, {mis-simplifi…
▽ More
Training deep graph neural networks (GNNs) poses a challenging task, as the performance of GNNs may suffer from the number of hidden message-passing layers. The literature has focused on the proposals of {over-smoothing} and {under-reaching} to explain the performance deterioration of deep GNNs. In this paper, we propose a new explanation for such deteriorated performance phenomenon, {mis-simplification}, that is, mistakenly simplifying graphs by preventing self-loops and forcing edges to be unweighted. We show that such simplifying can reduce the potential of message-passing layers to capture the structural information of graphs. In view of this, we propose a new framework, edge enhanced graph neural network (EEGNN). EEGNN uses the structural information extracted from the proposed Dirichlet mixture Poisson graph model (DMPGM), a Bayesian nonparametric model for graphs, to improve the performance of various deep message-passing GNNs. We propose a Markov chain Monte Carlo inference framework for DMPGM. Experiments over different datasets show that our method achieves considerable performance increase compared to baselines.
△ Less
Submitted 23 February, 2023; v1 submitted 12 August, 2022;
originally announced August 2022.
-
No Language Left Behind: Scaling Human-Centered Machine Translation
Authors:
NLLB Team,
Marta R. Costa-jussà,
James Cross,
Onur Çelebi,
Maha Elbayad,
Kenneth Heafield,
Kevin Heffernan,
Elahe Kalbassi,
Janice Lam,
Daniel Licht,
Jean Maillard,
Anna Sun,
Skyler Wang,
Guillaume Wenzek,
Al Youngblood,
Bapi Akula,
Loic Barrault,
Gabriel Mejia Gonzalez,
Prangthip Hansanti,
John Hoffman,
Semarley Jarrett,
Kaushik Ram Sadagopan,
Dirk Rowe,
Shannon Spruit,
Chau Tran
, et al. (14 additional authors not shown)
Abstract:
Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality res…
▽ More
Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality results, all while keeping ethical considerations in mind? In No Language Left Behind, we took on this challenge by first contextualizing the need for low-resource language translation support through exploratory interviews with native speakers. Then, we created datasets and models aimed at narrowing the performance gap between low and high-resource languages. More specifically, we developed a conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages. We propose multiple architectural and training improvements to counteract overfitting while training on thousands of tasks. Critically, we evaluated the performance of over 40,000 different translation directions using a human-translated benchmark, Flores-200, and combined human evaluation with a novel toxicity benchmark covering all languages in Flores-200 to assess translation safety. Our model achieves an improvement of 44% BLEU relative to the previous state-of-the-art, laying important groundwork towards realizing a universal translation system. Finally, we open source all contributions described in this work, accessible at https://github.com/facebookresearch/fairseq/tree/nllb.
△ Less
Submitted 25 August, 2022; v1 submitted 11 July, 2022;
originally announced July 2022.
-
Consistent Human Evaluation of Machine Translation across Language Pairs
Authors:
Daniel Licht,
Cynthia Gao,
Janice Lam,
Francisco Guzman,
Mona Diab,
Philipp Koehn
Abstract:
Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more cons…
▽ More
Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more consistent assessment. We demonstrate the effectiveness of these novel contributions in large scale evaluation studies across up to 14 language pairs, with translation both into and out of English.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Differential Private Discrete Noise Adding Mechanism: Conditions, Properties and Optimization
Authors:
Shuying Qin,
Jianping He,
Chongrong Fang,
James Lam
Abstract:
Differential privacy is a standard framework to quantify the privacy loss in the data anonymization process. To preserve differential privacy, a random noise adding mechanism is widely adopted, where the trade-off between data privacy level and data utility is of great concern. The privacy and utility properties for the continuous noise adding mechanism have been well studied. However, the related…
▽ More
Differential privacy is a standard framework to quantify the privacy loss in the data anonymization process. To preserve differential privacy, a random noise adding mechanism is widely adopted, where the trade-off between data privacy level and data utility is of great concern. The privacy and utility properties for the continuous noise adding mechanism have been well studied. However, the related works are insufficient for the discrete random mechanism on discretely distributed data, e.g., traffic data, health records. This paper focuses on the discrete random noise adding mechanisms. We study the basic differential privacy conditions and properties for the general discrete random mechanisms, as well as the trade-off between data privacy and data utility. Specifically, we derive a sufficient and necessary condition for discrete epsilon-differential privacy and a sufficient condition for discrete (epsilon, delta)-differential privacy, with the numerical estimation of differential privacy parameters. These conditions can be applied to analyze the differential privacy properties for the discrete noise adding mechanisms with various kinds of noises. Then, with the differential privacy guarantees, we propose an optimal discrete epsilon-differential private noise adding mechanism under the utility-maximization framework, where the utility is characterized by the similarity of the statistical properties between the mechanism's input and output. For this setup, we find that the class of the discrete noise probability distributions in the optimal mechanism is Staircase-shaped.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
The SLAC T-510 experiment for radio emission from particle showers: detailed simulation study and interpretation
Authors:
K. Bechtol,
K. Belov,
K. Borch,
P. Chen,
J. Clem,
P. Gorham,
C. Hast,
T. Huege,
R. Hyneman,
K. Jobe,
K. Kuwatani,
J. Lam,
T. C. Liu,
K. Mulrey,
J. Nam,
C. Naudet,
R. J. Nichol,
C. Paciaroni,
B. F. Rauch,
A. Romero-Wolf,
B. Rotter,
D. Saltzberg,
H. Schoorlemmer,
D. Seckel,
B. Strutt
, et al. (4 additional authors not shown)
Abstract:
Over the last several decades, radio detection of air showers has been widely used to detect ultra-high-energy cosmic rays. We developed an experiment under controlled laboratory conditions at SLAC with which we measured the radio-frequency radiation from a charged particle shower produced by bunches of electrons as primaries with known energy. The shower took place in a target made of High Densit…
▽ More
Over the last several decades, radio detection of air showers has been widely used to detect ultra-high-energy cosmic rays. We developed an experiment under controlled laboratory conditions at SLAC with which we measured the radio-frequency radiation from a charged particle shower produced by bunches of electrons as primaries with known energy. The shower took place in a target made of High Density Polyethylene located in a strong magnetic field. The experiment was designed so that Askaryan and magnetically-induced components of the radio emission could be measured independently. At the same time, we performed a detailed simulation of this experiment to predict the radio signal using two microscopic formalisms, endpoint and ZHS. In this paper, we present the simulation scheme and make a comparison with data characteristics such as linearity with magnetic field and amplitude. The simulations agree with the measurements within uncertainties and present a good description of the data. In particular, reflections within the target that accounted for the largest systematic uncertainties are addressed. The prediction of the amplitude of Askaryan emission agrees with measurements to within 5% for the endpoint formalism and 11% for the ZHS formalism. The amplitudes of magnetically-induced emission agree to within 5% for the endpoint formalism and less than 1% for the ZHS formalism. The agreement of the absolute scale of emission gives confidence in state-of-the-art air shower simulations which are based on the applied formalisms.
△ Less
Submitted 3 March, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
Machine Learning Statistical Gravity from Multi-Region Entanglement Entropy
Authors:
Jonathan Lam,
Yi-Zhuang You
Abstract:
The Ryu-Takayanagi formula directly connects quantum entanglement and geometry. Yet the assumption of static geometry lead to an exponentially small mutual information between far-separated disjoint regions, which does not hold in many systems such as free fermion conformal field theories. In this work, we proposed a microscopic model by superimposing entanglement features of an ensemble of random…
▽ More
The Ryu-Takayanagi formula directly connects quantum entanglement and geometry. Yet the assumption of static geometry lead to an exponentially small mutual information between far-separated disjoint regions, which does not hold in many systems such as free fermion conformal field theories. In this work, we proposed a microscopic model by superimposing entanglement features of an ensemble of random tensor networks of different bond dimensions, which can be mapped to a statistical gravity model consisting of a massive scalar field on a fluctuating background geometry. We propose a machine-learning algorithm that recovers the underlying geometry fluctuation from multi-region entanglement entropy data by modeling the bulk geometry distribution via a generative neural network. To demonstrate its effectiveness, we tested the model on a free fermion system and showed mutual information can be mediated effectively by geometric fluctuation. Remarkably, locality emerged from the learned distribution of bulk geometries, pointing to a local statistical gravity theory in the holographic bulk.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
Finiteness of reductions of Hecke orbits
Authors:
Mark Kisin,
Yeuk Hay Joshua Lam,
Ananth N. Shankar,
Padmavathi Srinivasan
Abstract:
We prove two finiteness results for reductions of Hecke orbits of abelian varieties over local fields: one in the case of supersingular reduction and one in the case of reductive monodromy. As an application, we show that only finitely many abelian varieties on a fixed isogeny leaf admit CM lifts, which in particular implies that in each fixed dimension $g$ only finitely many supersingular abelian…
▽ More
We prove two finiteness results for reductions of Hecke orbits of abelian varieties over local fields: one in the case of supersingular reduction and one in the case of reductive monodromy. As an application, we show that only finitely many abelian varieties on a fixed isogeny leaf admit CM lifts, which in particular implies that in each fixed dimension $g$ only finitely many supersingular abelian varieties admit CM lifts. Combining this with the Kuga-Satake construction, we also show that only finitely many supersingular $K3$-surfaces admit CM lifts. Our tools include $p$-adic Hodge theory and group theoretic techniques.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Non-classical nucleation of zinc oxide from a physically-motivated machine-learning approach
Authors:
Gaétan Laurens,
Jacek Goniakowski,
Julien Lam
Abstract:
Observing non-classical nucleation pathways remains challenging in simulations of complex materials with technological interests. This is because it requires very accurate force fields that can capture the whole complexity of their underlying interatomic interactions and an advanced structural analysis. Here, we first report the construction of a machine-learning force field for zinc oxide interac…
▽ More
Observing non-classical nucleation pathways remains challenging in simulations of complex materials with technological interests. This is because it requires very accurate force fields that can capture the whole complexity of their underlying interatomic interactions and an advanced structural analysis. Here, we first report the construction of a machine-learning force field for zinc oxide interactions using the Physical LassoLars Interaction Potentials approach which allows us to be predictive even for untrained structures. Then, we carried out freezing simulations from a liquid and observed the crystal formation with atomistic precision. Our results, which are analyzed using a data-driven approach based on bond order parameters, demonstrate the presence of both prenucleation clusters and two-step nucleation scenarios thus retrieving seminal predictions of non-classical nucleation pathways made on much simpler models.
△ Less
Submitted 9 August, 2022; v1 submitted 24 August, 2021;
originally announced August 2021.
-
Adversarially Regularized Graph Attention Networks for Inductive Learning on Partially Labeled Graphs
Authors:
Jiaren Xiao,
Quanyu Dai,
Xiaochen Xie,
James Lam,
Ka-Wai Kwok
Abstract:
The high cost of data labeling often results in node label shortage in real applications. To improve node classification accuracy, graph-based semi-supervised learning leverages the ample unlabeled nodes to train together with the scarce available labeled nodes. However, most existing methods require the information of all nodes, including those to be predicted, during model training, which is not…
▽ More
The high cost of data labeling often results in node label shortage in real applications. To improve node classification accuracy, graph-based semi-supervised learning leverages the ample unlabeled nodes to train together with the scarce available labeled nodes. However, most existing methods require the information of all nodes, including those to be predicted, during model training, which is not practical for dynamic graphs with newly added nodes. To address this issue, an adversarially regularized graph attention model is proposed to classify newly added nodes in a partially labeled graph. An attention-based aggregator is designed to generate the representation of a node by aggregating information from its neighboring nodes, thus naturally generalizing to previously unseen nodes. In addition, adversarial training is employed to improve the model's robustness and generalization ability by enforcing node representations to match a prior distribution. Experiments on real-world datasets demonstrate the effectiveness of the proposed method in comparison with the state-of-the-art methods. The code is available at https://github.com/JiarenX/AGAIN.
△ Less
Submitted 13 March, 2023; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Strong and fast rising pressure waves emitted by plasmonic vapor nanobubbles
Authors:
Julien Lombard,
Julien Lam,
Francois Detcheverry,
Thierry Biben,
Samy Merabia
Abstract:
Plasmonic vapour nanobubbles are currently considered for a wide variety of applications ranging from solar energy harvesting and photoacoustic imaging to nanoparticle-assisted cancer therapy. Yet, due their small size and unstable nature, their generation and consequences remain difficult to characterize. Here, building on a phase-field model, we report on the existence of strong pressure waves t…
▽ More
Plasmonic vapour nanobubbles are currently considered for a wide variety of applications ranging from solar energy harvesting and photoacoustic imaging to nanoparticle-assisted cancer therapy. Yet, due their small size and unstable nature, their generation and consequences remain difficult to characterize. Here, building on a phase-field model, we report on the existence of strong pressure waves that are emitted when vapor nanobubbles first form around a laser-heated nanoparticle immersed in water, and subsequently after bubble rebound. These effects are strongest when the fluid is locally brought high in its supercritical state, which may be realized with a short laser pulse. Because of the highly out-of-equilibrium nature of nanobubble generation, the waves combine a high pressure peak with a fast pressure rising time, and propagate in water over micron distances, opening the way to induce spatially and temporally localized damage. Discussing the consequences on biological cell membranes, we conclude that acoustic-mediated perforation is more efficient than nanobubble expansion to breach membrane. Our findings should serve as guide for optimizing the thermoacoustic conversion efficiency of plasmonic vapor nanobubbles.
△ Less
Submitted 11 May, 2021;
originally announced May 2021.
-
Deep-AIR: A Hybrid CNN-LSTM Framework for Air Quality Modeling in Metropolitan Cities
Authors:
Yang Han,
Qi Zhang,
Victor O. K. Li,
Jacqueline C. K. Lam
Abstract:
Air pollution has long been a serious environmental health challenge, especially in metropolitan cities, where air pollutant concentrations are exacerbated by the street canyon effect and high building density. Whilst accurately monitoring and forecasting air pollution are highly crucial, existing data-driven models fail to fully address the complex interaction between air pollution and urban dyna…
▽ More
Air pollution has long been a serious environmental health challenge, especially in metropolitan cities, where air pollutant concentrations are exacerbated by the street canyon effect and high building density. Whilst accurately monitoring and forecasting air pollution are highly crucial, existing data-driven models fail to fully address the complex interaction between air pollution and urban dynamics. Our Deep-AIR, a novel hybrid deep learning framework that combines a convolutional neural network with a long short-term memory network, aims to address this gap to provide fine-grained city-wide air pollution estimation and station-wide forecast. Our proposed framework creates 1x1 convolution layers to strengthen the learning of cross-feature spatial interaction between air pollution and important urban dynamic features, particularly road density, building density/height, and street canyon effect. Using Hong Kong and Beijing as case studies, Deep-AIR achieves a higher accuracy than our baseline models. Our model attains an accuracy of 67.6%, 77.2%, and 66.1% in fine-grained hourly estimation, 1-hr, and 24-hr air pollution forecast for Hong Kong, and an accuracy of 65.0%, 75.3%, and 63.5% for Beijing. Our saliency analysis has revealed that for Hong Kong, street canyon and road density are the best estimators for NO2, while meteorology is the best estimator for PM2.5.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
Modeling the locomotion of articulated soft robots in granular medium
Authors:
Yayun Du,
Jacqueline Lam,
Karunesh Sachanandani,
Mohammad Khalid Jawed
Abstract:
Soft robots, in contrast to their rigid counter parts, have infinite degrees of freedom that are coupled with their interaction with the environment. We consider the locomotion of an untethered robot, in the granular medium, comprised of multiple flexible flagella that rotate about an axis by a motor. Drag from the grains causes the flagella to deform and the deformed shape generates a net forward…
▽ More
Soft robots, in contrast to their rigid counter parts, have infinite degrees of freedom that are coupled with their interaction with the environment. We consider the locomotion of an untethered robot, in the granular medium, comprised of multiple flexible flagella that rotate about an axis by a motor. Drag from the grains causes the flagella to deform and the deformed shape generates a net forward propulsion. This external drag force depends on the shape of the flagella, while the change in flagellar shape is the result of the competition between the external loading and elastic forces. We introduce a numerical tool that couples discrete differential geometry based simulation of elastic rods - our model for flagella - and a resistive force theory based model for the drag. In parallel with simulations, we conduct experiments to quantify the propulsive speed of this class of robots. We find reasonable quantitative agreement between experiments and simulations. Owing to a rod-based kinematic representation of the robot, the simulation runs faster than real-time, and, therefore, we can use it as a design tool for this class of soft robots. We find that there is an optimal rotational speed at which maximum efficiency is achieved. Moreover, both experiments and simulations show that increasing the number of flagella decreases the speed of the robot. We also gain insight into the mechanics of granular medium - while resistive force theory can successfully describe the propulsion at low number of flagella, it fails when more flagella are added to the robot.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Infrared spectra of neutral polycyclic aromatic hydrocarbons by machine learning
Authors:
Gaétan Laurens,
Malalatiana Rabary,
Julien Lam,
Daniel Peláez,
Abdul-Rahman Allouche
Abstract:
The Interest in polycyclic aromatic hydrocarbons (PAHs) spans numerous fields and infrared spectroscopy is usually the method of choice to disentangle their molecular structure. In order to compute vibrational frequencies, numerous theoretical studies employ either quantum calculation methods, or empirical potentials, but it remains difficult to combine the accuracy of the first approach with the…
▽ More
The Interest in polycyclic aromatic hydrocarbons (PAHs) spans numerous fields and infrared spectroscopy is usually the method of choice to disentangle their molecular structure. In order to compute vibrational frequencies, numerous theoretical studies employ either quantum calculation methods, or empirical potentials, but it remains difficult to combine the accuracy of the first approach with the computational cost of the second. In this work, we employed Machine Learning techniques to develop a potential energy surface and a dipole mapping based on an artificial neural network (ANN) architecture. Altogether, while trained on only 11 small PAH molecules, the obtained ANNs are able to retrieve the infrared spectra of those small molecules, but more importantly of 8 large PAHs different from the training set, thus demonstrating the transferability of our approach.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
The Attractor Conjecture for Calabi-Yau variations of Hodge structures
Authors:
Yeuk Hay Joshua Lam
Abstract:
We study attractor points for Calabi-Yau variations of Hodge structures. In particular, for certain moduli spaces which are Shimura varieties, we prove that the attractor points are CM points, thus proving Moore's Attractor Conjecture in these cases. We also study non-BPS examples of attractors, obtaining special points on locally symmetric spaces without hermitian structures, as well as locally s…
▽ More
We study attractor points for Calabi-Yau variations of Hodge structures. In particular, for certain moduli spaces which are Shimura varieties, we prove that the attractor points are CM points, thus proving Moore's Attractor Conjecture in these cases. We also study non-BPS examples of attractors, obtaining special points on locally symmetric spaces without hermitian structures, as well as locally symmetric spaces inside Shimura varieties; for the latter we point out a possible analogy with subspaces studied by Goresky-Tai. Finally we give an explicit geometric description of non-BPS attractors in the simplest case.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Attractors are not algebraic
Authors:
Yeuk Hay Joshua Lam,
Arnav Tripathy
Abstract:
The Attractor Conjecture for Calabi-Yau moduli spaces predicts the algebraicity of the moduli values of certain isolated points picked out by Hodge-theoretic conditions. We provide a family of counterexamples to the Attractor Conjecture in all suitably high, odd dimensions conditional on the Zilber-Pink conjecture.
The Attractor Conjecture for Calabi-Yau moduli spaces predicts the algebraicity of the moduli values of certain isolated points picked out by Hodge-theoretic conditions. We provide a family of counterexamples to the Attractor Conjecture in all suitably high, odd dimensions conditional on the Zilber-Pink conjecture.
△ Less
Submitted 29 September, 2020; v1 submitted 26 September, 2020;
originally announced September 2020.
-
Calabi-Yau threefolds over finite fields and torsion in cohomologies
Authors:
Yeuk Hay Joshua Lam
Abstract:
We study various examples of Calabi-Yau threefolds over finite fields. In particular, we provide a counterexample to a conjecture of K. Joshi on lifting Calabi-Yau threefolds to characteristic zero. We also compute the p-adic cohomologies of some Calabi-Yau threefolds constructed by Cynk-van Straten which have remarkable arithmetic properties, as well as those of the Hirokado threefold. These exam…
▽ More
We study various examples of Calabi-Yau threefolds over finite fields. In particular, we provide a counterexample to a conjecture of K. Joshi on lifting Calabi-Yau threefolds to characteristic zero. We also compute the p-adic cohomologies of some Calabi-Yau threefolds constructed by Cynk-van Straten which have remarkable arithmetic properties, as well as those of the Hirokado threefold. These examples and computations answer some outstanding questions of B. Bhatt, T. Ekedahl, van der Geer-Katsura and Patakfalvi-Zdanowicz, and shed new light on the Beauville-Bogomolov decomposition in positive characteristic. Our tools include p-adic Hodge theory as well as classical algebraic topology. We also give potential examples showing that Hodge numbers of threefolds in positive characteristic are not derived invariants, contrary to the case of characteristic zero.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Out-of-equilibrium Polymorph Selection in Nanoparticle Freezing
Authors:
Jonathan Amodeo,
Fabio Pietrucci,
Julien Lam
Abstract:
The ability to design synthesis processes that are out of equilibrium has opened the possibility of creating nanomaterials with remarkable physico-chemical properties, choosing from a much richer palette of possible atomic architectures compared to equilibrium processes in extended systems. In this work, we employ atomistic simulations to demonstrate how to control polymorph selection via cooling…
▽ More
The ability to design synthesis processes that are out of equilibrium has opened the possibility of creating nanomaterials with remarkable physico-chemical properties, choosing from a much richer palette of possible atomic architectures compared to equilibrium processes in extended systems. In this work, we employ atomistic simulations to demonstrate how to control polymorph selection via cooling rate during nanoparticle freezing in the case of Ni$_3$Al, a material with a rich structural landscape. State-of-the-art free-energy calculations allow to rationalize the complex nucleation process, discovering a switch between two kinetic pathways, yielding the equilibrium structure at room temperature and an alternative metastable one at higher temperature. Our findings address the key challenge in the synthesis of nano-alloys for technological applications, i.e., rationally exploiting the competition between kinetics and thermodynamics by designing a treatment history that forces the system into desirable metastable states.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Generalized Bockstein maps and Massey products
Authors:
Yeuk Hay Joshua Lam,
Yuan Liu,
Romyar Sharifi,
Preston Wake,
Jiuya Wang
Abstract:
Given a profinite group G of finite p-cohomological dimension and a pro-p quotient H of G by a closed normal subgroup N, we study the filtration on the Iwasawa cohomology of N by powers of the augmentation ideal in the group algebra of H. We show that the graded pieces are related to the cohomology of G via analogues of Bockstein maps for the powers of the augmentation ideal. For certain groups H,…
▽ More
Given a profinite group G of finite p-cohomological dimension and a pro-p quotient H of G by a closed normal subgroup N, we study the filtration on the Iwasawa cohomology of N by powers of the augmentation ideal in the group algebra of H. We show that the graded pieces are related to the cohomology of G via analogues of Bockstein maps for the powers of the augmentation ideal. For certain groups H, we relate the values of these generalized Bockstein maps to Massey products relative to a restricted class of defining systems depending on H. We apply our study to prove lower bounds on the p-ranks of class groups of certain nonabelian extensions of the rational numbers and to give a new proof of the vanishing of triple Massey products in Galois cohomology.
△ Less
Submitted 25 November, 2022; v1 submitted 23 April, 2020;
originally announced April 2020.
-
Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution
Authors:
Yingruo Fan,
Jacqueline C. K. Lam,
Victor O. K. Li
Abstract:
The intensity estimation of facial action units (AUs) is challenging due to subtle changes in the person's facial appearance. Previous approaches mainly rely on probabilistic models or predefined rules for modeling co-occurrence relationships among AUs, leading to limited generalization. In contrast, we present a new learning framework that automatically learns the latent relationships of AUs via…
▽ More
The intensity estimation of facial action units (AUs) is challenging due to subtle changes in the person's facial appearance. Previous approaches mainly rely on probabilistic models or predefined rules for modeling co-occurrence relationships among AUs, leading to limited generalization. In contrast, we present a new learning framework that automatically learns the latent relationships of AUs via establishing semantic correspondences between feature maps. In the heatmap regression-based network, feature maps preserve rich semantic information associated with AU intensities and locations. Moreover, the AU co-occurring pattern can be reflected by activating a set of feature channels, where each channel encodes a specific visual pattern of AU. This motivates us to model the correlation among feature channels, which implicitly represents the co-occurrence relationship of AU intensity levels. Specifically, we introduce a semantic correspondence convolution (SCC) module to dynamically compute the correspondences from deep and low resolution feature maps, and thus enhancing the discriminability of features. The experimental results demonstrate the effectiveness and the superior performance of our method on two benchmark datasets.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
Long-wavelength density fluctuations as nucleation precursors
Authors:
James F. Lutsko,
Julien Lam
Abstract:
Recent theories of nucleation that go beyond Classical Nucleation Theory predict that diffusion-limited nucleation of both liquid droplets and of crystals from a low-density vapor (or weak solution) begins with long-wavelength density fluctuations. This means that in the early stages of nucleation, 'clusters' can have low density but large spatial extent, which is at odds with the classical pictur…
▽ More
Recent theories of nucleation that go beyond Classical Nucleation Theory predict that diffusion-limited nucleation of both liquid droplets and of crystals from a low-density vapor (or weak solution) begins with long-wavelength density fluctuations. This means that in the early stages of nucleation, 'clusters' can have low density but large spatial extent, which is at odds with the classical picture of arbitrarily small clusters of the condensed phase. We present the results of kinetic Monte Carlo simulations using Forward Flux Sampling to show that these predictions are confirmed: namely that on average, nucleation begins in the presence of low-amplitude, but spatially extended density fluctuations thus confirming a significant prediction of the non-classical theory.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
Machine Learning based Anomaly Detection for 5G Networks
Authors:
Jordan Lam,
Robert Abbas
Abstract:
Protecting the networks of tomorrow is set to be a challenging domain due to increasing cyber security threats and widening attack surfaces created by the Internet of Things (IoT), increased network heterogeneity, increased use of virtualisation technologies and distributed architectures. This paper proposes SDS (Software Defined Security) as a means to provide an automated, flexible and scalable…
▽ More
Protecting the networks of tomorrow is set to be a challenging domain due to increasing cyber security threats and widening attack surfaces created by the Internet of Things (IoT), increased network heterogeneity, increased use of virtualisation technologies and distributed architectures. This paper proposes SDS (Software Defined Security) as a means to provide an automated, flexible and scalable network defence system. SDS will harness current advances in machine learning to design a CNN (Convolutional Neural Network) using NAS (Neural Architecture Search) to detect anomalous network traffic. SDS can be applied to an intrusion detection system to create a more proactive and end-to-end defence for a 5G network. To test this assumption, normal and anomalous network flows from a simulated environment have been collected and analyzed with a CNN. The results from this method are promising as the model has identified benign traffic with a 100% accuracy rate and anomalous traffic with a 96.4% detection rate. This demonstrates the effectiveness of network flow analysis for a variety of common malicious attacks and also provides a viable option for detection of encrypted malicious network traffic.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
Deep-AIR: A Hybrid CNN-LSTM Framework forFine-Grained Air Pollution Forecast
Authors:
Qi Zhang,
Jacqueline CK Lam,
Victor OK Li,
Yang Han
Abstract:
Poor air quality has become an increasingly critical challenge for many metropolitan cities, which carries many catastrophicphysical and mental consequences on human health and quality of life. However, accurately monitoring and forecasting air qualityremains a highly challenging endeavour. Limited by geographically sparse data, traditional statistical models and newly emergingdata-driven methods…
▽ More
Poor air quality has become an increasingly critical challenge for many metropolitan cities, which carries many catastrophicphysical and mental consequences on human health and quality of life. However, accurately monitoring and forecasting air qualityremains a highly challenging endeavour. Limited by geographically sparse data, traditional statistical models and newly emergingdata-driven methods of air quality forecasting mainly focused on the temporal correlation between the historical temporal datasets of airpollutants. However, in reality, both distribution and dispersion of air pollutants are highly location-dependant. In this paper, we proposea novel hybrid deep learning model that combines Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM)together to forecast air quality at high-resolution. Our model can utilize the spatial correlation characteristic of our air pollutant datasetsto achieve higher forecasting accuracy than existing deep learning models of air pollution forecast.
△ Less
Submitted 29 January, 2020;
originally announced January 2020.
-
CATVI: Conditional and Adaptively Truncated Variational Inference for Hierarchical Bayesian Nonparametric Models
Authors:
Yirui Liu,
Xinghao Qiao,
Jessica Lam
Abstract:
Current variational inference methods for hierarchical Bayesian nonparametric models can neither characterize the correlation structure among latent variables due to the mean-field setting, nor infer the true posterior dimension because of the universal truncation. To overcome these limitations, we propose the conditional and adaptively truncated variational inference method (CATVI) by maximizing…
▽ More
Current variational inference methods for hierarchical Bayesian nonparametric models can neither characterize the correlation structure among latent variables due to the mean-field setting, nor infer the true posterior dimension because of the universal truncation. To overcome these limitations, we propose the conditional and adaptively truncated variational inference method (CATVI) by maximizing the nonparametric evidence lower bound and integrating Monte Carlo into the variational inference framework. CATVI enjoys several advantages over traditional methods, including a smaller divergence between variational and true posteriors, reduced risk of underfitting or overfitting, and improved prediction accuracy. Empirical studies on three large datasets reveal that CATVI applied in Bayesian nonparametric topic models substantially outperforms competing models, providing lower perplexity and clearer topic-words clustering.
△ Less
Submitted 5 April, 2022; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Measuring transferability issues in machine-learning force fields: The example of Gold-Iron interactions with linearized potentials
Authors:
Magali Benoit,
Jonathan Amodeo,
Ségolène Combettes,
Ibrahim Khaled,
Aurélien Roux,
Julien Lam
Abstract:
Machine-learning force fields have been increasingly employed in order to extend the possibility of current first-principles calculations. However, the transferability of the obtained potential can not always be guaranteed in situations that are outside the original database. To study such limitation, we examined the very difficult case of the interactions in gold-iron nanoparticles. For the machi…
▽ More
Machine-learning force fields have been increasingly employed in order to extend the possibility of current first-principles calculations. However, the transferability of the obtained potential can not always be guaranteed in situations that are outside the original database. To study such limitation, we examined the very difficult case of the interactions in gold-iron nanoparticles. For the machine-learning potential, we employed a linearized formulation that is parameterized using a penalizing regression scheme which allows us to control the complexity of the obtained potential. We showed that while having a more complex potential allows for a better agreement with the training database, it can also lead to overfitting issues and a lower accuracy in untrained systems.
△ Less
Submitted 9 November, 2020; v1 submitted 23 December, 2019;
originally announced December 2019.
-
Entropy-driven impurity-induced nematic-isotropic transition of liquid crystals
Authors:
Pritam Kumar Jana,
Julien Lam,
Nagma Parveen,
Mikko J Alava,
Lasse Laurson
Abstract:
Phase behavior of liquid crystals is of long-standing interest due to numerous applications, with one of the key issues being how the presence of impurities affects the liquid crystalline order. Here we study the orientational order of 4-cyano-4$^{'}$-pentylbiphenyl (5CB) and 4-cyano-4$^{'}$-hexylbiphenyl (6CB) nematic liquid crystals in the presence of varying concentrations of water and n-hexane…
▽ More
Phase behavior of liquid crystals is of long-standing interest due to numerous applications, with one of the key issues being how the presence of impurities affects the liquid crystalline order. Here we study the orientational order of 4-cyano-4$^{'}$-pentylbiphenyl (5CB) and 4-cyano-4$^{'}$-hexylbiphenyl (6CB) nematic liquid crystals in the presence of varying concentrations of water and n-hexane molecules serving as impurities, by carrying out both fully atomistic simulations and experiments. Our results reveal that mixing of the impurities (in case of hexane) with the host liquid crystals causes a nematic-to-isotropic phase transition with hexane concentration as the control parameter while demixing (in case of water) results in only weak impurity-induced perturbations to the nematic liquid-crystalline order. We develop a coarse-grained model illustrating the general nature and entropic origin of the mixing-induced phase transition.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
Combining quantum mechanics and machine-learning calculations for anharmonic corrections to vibrational frequencies
Authors:
Julien Lam,
Saleh Abdul-Al,
Abdul-Rahman Allouche
Abstract:
Several methods are available to compute the anharmonicity in semi-rigid molecules. However, such methods are not routinely employed yet because of their large computational cost, especially for large molecules. The potential energy surface is required and generally approximated by a quartic force field potential based on ab initio calculation, thus limiting this approach to medium-sized molecules…
▽ More
Several methods are available to compute the anharmonicity in semi-rigid molecules. However, such methods are not routinely employed yet because of their large computational cost, especially for large molecules. The potential energy surface is required and generally approximated by a quartic force field potential based on ab initio calculation, thus limiting this approach to medium-sized molecules. We developed a new, fast and accurate hybrid Quantum Mechanic/Machine learning (QM//ML) approach to reduce the computational time for large systems. With this novel approach, we evaluated anharmonic frequencies of 37 molecules thus covering a broad range of vibrational modes and chemical environments. The obtained fundamental frequencies reproduce results obtained using B2PLYP/def2tzvpp with a root-mean-square deviation (RMSD) of 21 cm-1 and experimental results with a RMSD of 23 cm-1. Along with this very good accuracy, the computational time with our hybrid QM//ML approach scales linearly with N while the traditional full ab initio method scales as N2, where N is the number of atoms.
△ Less
Submitted 3 February, 2020; v1 submitted 27 September, 2019;
originally announced September 2019.