-
Insights from the exact analytical solution of periodically driven transverse field Ising chain
Authors:
Pritam Das,
Anirban Dutta
Abstract:
We derive an exact analytical expression, at stroboscopic intervals, for the time-dependent wave function of a class of integrable quantum many-body systems, driven by the periodic delta-kick protocol. To investigate long-time dynamics, we use the wave-function to obtain an exact analytical expression for the expectation value of defect density, magnetization, residual energy, fidelity, and correl…
▽ More
We derive an exact analytical expression, at stroboscopic intervals, for the time-dependent wave function of a class of integrable quantum many-body systems, driven by the periodic delta-kick protocol. To investigate long-time dynamics, we use the wave-function to obtain an exact analytical expression for the expectation value of defect density, magnetization, residual energy, fidelity, and correlation function after the $n$th drive cycle. Periodically driven integrable closed quantum systems absorb energy, and the long-time universal dynamics are described by the periodic generalized Gibbs ensemble(GGE). We demonstrate that the expectation values of all observables are divided into two parts: one highly oscillatory term that depends on the drive cycle $n$, and the rest of the terms are independent of it. Typically, the $n$-independent part constitutes the saturation at large $n$ and periodic GGE. The contribution from the highly oscillatory term vanishes in large $n$.
△ Less
Submitted 13 September, 2024;
originally announced September 2024.
-
Principles of hydrodynamic particle manipulation in internal Stokes flow
Authors:
Xuchen Liu,
Partha Kumar Das,
Sascha Hilgenfeldt
Abstract:
Manipulation of small-scale particles across streamlines is the elementary task of microfluidic devices. Many such devices operate at very low Reynolds numbers and deflect particles using arrays of obstacles, but a systematic quantification of relevant hydrodynamic effects has been lacking. Here, we explore an alternate approach, rigorously modeling the displacement of force-free spherical particl…
▽ More
Manipulation of small-scale particles across streamlines is the elementary task of microfluidic devices. Many such devices operate at very low Reynolds numbers and deflect particles using arrays of obstacles, but a systematic quantification of relevant hydrodynamic effects has been lacking. Here, we explore an alternate approach, rigorously modeling the displacement of force-free spherical particles in vortical Stokes flows under hydrodynamic particle-wall interaction. Certain Moffatt-like eddy geometries with broken symmetry allow for systematic deflection of particles across streamlines, leading to particle accumulation at either Faxen field fixed points or limit cycles. Moreover, particles can be forced onto trajectories approaching channel walls exponentially closely, making quantitative predictions of particle capture (sticking) by short-range forces possible. This rich, particle size-dependent behavior suggests the versatile use of inertial-less flow in devices with a long particle residence time for concentration, sorting, or filtering.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Tuning the Planarity of an Aromatic Thianthrene-Based Molecule on Au(111)
Authors:
Kwan Ho Au-Yeung,
Suchetana Sarkar,
Sattwick Haldar,
Pranjit Das,
Tim Kühne,
Dmitry A. Ryndyk,
Preeti Bhauriyal,
Stefan Kaskel,
Thomas Heine,
Gianaurelio Cuniberti,
Andreas Schneemann,
Francesca Moresco
Abstract:
Non-planar aromatic molecules are interesting systems for organic electronics and optoelectronics applications due to their high stability and electronic properties. By using scanning tunneling microscopy and spectroscopy, we investigated thianthrene-based molecules adsorbed on Au(111), which are non-planar in the gas phase and the bulk solid state. Varying the molecular coverage leads to the form…
▽ More
Non-planar aromatic molecules are interesting systems for organic electronics and optoelectronics applications due to their high stability and electronic properties. By using scanning tunneling microscopy and spectroscopy, we investigated thianthrene-based molecules adsorbed on Au(111), which are non-planar in the gas phase and the bulk solid state. Varying the molecular coverage leads to the formation of two different kinds of self-assembled structures: close-packed islands and quasi one-dimensional chains. We found that the molecules are non-planar within the close-packed islands, while the configuration is planar in the molecular chain and for single adsorbed molecules. Using vertical tip manipulation to isolate a molecule from the island, we demonstrate the conversion of a non-planar molecule to its planar configuration. We discuss the two different geometries and their electronic properties with the support of density functional theory calculations.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Leveraging Machine Learning for Official Statistics: A Statistical Manifesto
Authors:
Marco Puts,
David Salgado,
Piet Daas
Abstract:
It is important for official statistics production to apply ML with statistical rigor, as it presents both opportunities and challenges. Although machine learning has enjoyed rapid technological advances in recent years, its application does not possess the methodological robustness necessary to produce high quality statistical results. In order to account for all sources of error in machine learn…
▽ More
It is important for official statistics production to apply ML with statistical rigor, as it presents both opportunities and challenges. Although machine learning has enjoyed rapid technological advances in recent years, its application does not possess the methodological robustness necessary to produce high quality statistical results. In order to account for all sources of error in machine learning models, the Total Machine Learning Error (TMLE) is presented as a framework analogous to the Total Survey Error Model used in survey methodology. As a means of ensuring that ML models are both internally valid as well as externally valid, the TMLE model addresses issues such as representativeness and measurement errors. There are several case studies presented, illustrating the importance of applying more rigor to the application of machine learning in official statistics.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Building FKG.in: a Knowledge Graph for Indian Food
Authors:
Saransh Kumar Gupta,
Lipika Dey,
Partha Pratim Das,
Ramesh Jain
Abstract:
This paper presents an ontology design along with knowledge engineering, and multilingual semantic reasoning techniques to build an automated system for assimilating culinary information for Indian food in the form of a knowledge graph. The main focus is on designing intelligent methods to derive ontology designs and capture all-encompassing knowledge about food, recipes, ingredients, cooking char…
▽ More
This paper presents an ontology design along with knowledge engineering, and multilingual semantic reasoning techniques to build an automated system for assimilating culinary information for Indian food in the form of a knowledge graph. The main focus is on designing intelligent methods to derive ontology designs and capture all-encompassing knowledge about food, recipes, ingredients, cooking characteristics, and most importantly, nutrition, at scale. We present our ongoing work in this workshop paper, describe in some detail the relevant challenges in curating knowledge of Indian food, and propose our high-level ontology design. We also present a novel workflow that uses AI, LLM, and language technology to curate information from recipe blog sites in the public domain to build knowledge graphs for Indian food. The methods for knowledge curation proposed in this paper are generic and can be replicated for any domain. The design is application-agnostic and can be used for AI-driven smart analysis, building recommendation systems for Personalized Digital Health, and complementing the knowledge graph for Indian food with contextual information such as user information, food biochemistry, geographic information, agricultural information, etc.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
On isomorphism of the space of $α$-Hölder continuous functions with finite $p$-th variation
Authors:
Purba Das,
Donghan Kim
Abstract:
We study the concept of (generalized) $p$-th variation of a real-valued continuous function along a general class of refining sequence of partitions. We show that the finiteness of the $p$-th variation of a given function is closely related to the finiteness of $\ell^p$-norm of the coefficients along a Schauder basis, similar to the fact that Hölder coefficient of the function is connected to…
▽ More
We study the concept of (generalized) $p$-th variation of a real-valued continuous function along a general class of refining sequence of partitions. We show that the finiteness of the $p$-th variation of a given function is closely related to the finiteness of $\ell^p$-norm of the coefficients along a Schauder basis, similar to the fact that Hölder coefficient of the function is connected to $\ell^{\infty}$-norm of the Schauder coefficients. This result provides an isomorphism between the space of $α$-Hölder continuous functions with finite (generalized) $p$-th variation along a given partition sequence and a subclass of infinite-dimensional matrices equipped with an appropriate norm, in the spirit of Ciesielski.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
LLMs as Evaluators: A Novel Approach to Evaluate Bug Report Summarization
Authors:
Abhishek Kumar,
Sonia Haiduc,
Partha Pratim Das,
Partha Pratim Chakrabarti
Abstract:
Summarizing software artifacts is an important task that has been thoroughly researched. For evaluating software summarization approaches, human judgment is still the most trusted evaluation. However, it is time-consuming and fatiguing for evaluators, making it challenging to scale and reproduce. Large Language Models (LLMs) have demonstrated remarkable capabilities in various software engineering…
▽ More
Summarizing software artifacts is an important task that has been thoroughly researched. For evaluating software summarization approaches, human judgment is still the most trusted evaluation. However, it is time-consuming and fatiguing for evaluators, making it challenging to scale and reproduce. Large Language Models (LLMs) have demonstrated remarkable capabilities in various software engineering tasks, motivating us to explore their potential as automatic evaluators for approaches that aim to summarize software artifacts. In this study, we investigate whether LLMs can evaluate bug report summarization effectively. We conducted an experiment in which we presented the same set of bug summarization problems to humans and three LLMs (GPT-4o, LLaMA-3, and Gemini) for evaluation on two tasks: selecting the correct bug report title and bug report summary from a set of options. Our results show that LLMs performed generally well in evaluating bug report summaries, with GPT-4o outperforming the other LLMs. Additionally, both humans and LLMs showed consistent decision-making, but humans experienced fatigue, impacting their accuracy over time. Our results indicate that LLMs demonstrate potential for being considered as automated evaluators for bug report summarization, which could allow scaling up evaluations while reducing human evaluators effort and fatigue.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
EDGE: Predictable Scatter in the Stellar Mass--Halo Mass Relation of Dwarf Galaxies
Authors:
Stacy Y. Kim,
Justin I. Read,
Martin P. Rey,
Matthew D. A. Orkney,
Sushanta Nigudkar,
Andrew Pontzen,
Ethan Taylor,
Oscar Agertz,
Payel Das
Abstract:
The stellar-mass--halo-mass (SMHM) relation is central to our understanding of galaxy formation and the nature of dark matter. However, its normalisation, slope, and scatter are highly uncertain at dwarf galaxy scales. In this paper, we present DarkLight, a new semi-empirical dwarf galaxy formation model designed to robustly predict the SMHM relation for the smallest galaxies. DarkLight harnesses…
▽ More
The stellar-mass--halo-mass (SMHM) relation is central to our understanding of galaxy formation and the nature of dark matter. However, its normalisation, slope, and scatter are highly uncertain at dwarf galaxy scales. In this paper, we present DarkLight, a new semi-empirical dwarf galaxy formation model designed to robustly predict the SMHM relation for the smallest galaxies. DarkLight harnesses a correlation between the mean star formation rate of dwarfs and their peak rotation speed -- the $\langle$SFR$\rangle$-$v_{\rm max}$ relation -- that we derive from simulations and observations. Given the sparsity of data for isolated dwarfs with $v_{\rm max} \lesssim 20$ km/s, we fit the $\langle$SFR$\rangle$-$v_{\rm max}$ relation to observational data for dwarfs above this velocity scale and to the high-resolution EDGE cosmological simulations below. Reionisation quenching is implemented via distinct $\langle$SFR$\rangle$-$v_{\rm max}$ relations before and after reionisation. We find that the SMHM scatter is small at reionisation, $\sim$0.2 dex, but rises to $\sim$0.5 dex ($1σ$) at a halo mass of $\sim$10$^9$ M$_\odot$ as star formation is quenched by reionisation but dark matter halo masses continue to grow. While we do not find a significant break in the slope of the SMHM relation, one can be introduced if reionisation occurs early ($z_{\rm quench} \gtrsim 5$). Finally, we find that dwarfs can be star forming today down to a halo mass of $\sim$2 $\times 10^9$ M$_\odot$. We predict that the lowest mass star forming dwarf irregulars in the nearby universe are the tip of the iceberg of a much larger population of quiescent isolated dwarfs.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Implications of Fermionic Dark Matter Interactions on Anisotropic Neutron Stars
Authors:
Premachand Mahapatra,
Chiranjeeb Singha,
Ayush Hazarika,
Prasanta Kumar Das
Abstract:
The presence of Dark matter (DM) within a neutron star (NS) can substantially influence the macroscopic properties. It is commonly assumed that the pressure inside an NS is isotropic, but in reality, pressure is locally anisotropic. This study explores the properties of anisotropic NS with a subfraction of DM (isotropic) trapped inside. Implementing a two-fluid formalism with three Equations of St…
▽ More
The presence of Dark matter (DM) within a neutron star (NS) can substantially influence the macroscopic properties. It is commonly assumed that the pressure inside an NS is isotropic, but in reality, pressure is locally anisotropic. This study explores the properties of anisotropic NS with a subfraction of DM (isotropic) trapped inside. Implementing a two-fluid formalism with three Equations of State (EOS): AP3 (a realistic nucleon-nucleon interaction model), BSk22 (modeling atomic nuclei and neutron-matter), and MPA1 (considering relativistic effects in nuclear interactions). The properties of NS, such as mass ($M$), radius ($R$), and dimensionless tidal deformability ($Λ$), for various DM-anisotropic configurations, have been rigorously tested against observational constraints. These constraints include data from the binary NS merger GW170817, NICER x-ray measurements, and pulsar mass-radius observations. We observe that with increasing DM subfraction, higher anisotropies could also satisfy the observational constraints. Furthermore, increasing the coupling ($g$) between DM and its mediator leads to the formation of a core-halo structure, with a DM halo surrounding the baryonic matter (BM). Specifically, for coupling values of $g = 10^{-4}$, $10^{-3.7}$, and $10^{-3.5}$, we observe that the maximum radius ($R_{max}$) decreases with increasing anisotropy, which contrasts with the behavior at $g = 10^{-5}$ and in scenarios with no DM. Our analysis indicates that binary pulsar systems could potentially constrain the extent of admixed anisotropic NS or, more optimistically, provide evidence for the existence of DM-admixed anisotropic NS.
△ Less
Submitted 12 September, 2024; v1 submitted 26 August, 2024;
originally announced August 2024.
-
Spin dynamics in itinerant antiferromagnet ${\rm\bf SrCr_2As_2}$
Authors:
Zhenhua Ning,
Pinaki Das,
Y. Lee,
N. S. Sangeetha,
D. L. Abernathy,
D. C. Johnston,
R. J. McQueeney,
D. Vaknin,
Liqin Ke
Abstract:
SrCr$_2$As$_2$ is an itinerant antiferromagnet in the same structural family as the SrFe2As2 high-temperature superconductors. We report our calculations of exchange coupling parameters $J_{ij}$ for SrCr$_2$As$_2$ using a static linear-response method based on first-principles electronic structure calculations. We find that the dominant nearest neighbor exchange coupling $J_{\rm{1}} > 0$ is antife…
▽ More
SrCr$_2$As$_2$ is an itinerant antiferromagnet in the same structural family as the SrFe2As2 high-temperature superconductors. We report our calculations of exchange coupling parameters $J_{ij}$ for SrCr$_2$As$_2$ using a static linear-response method based on first-principles electronic structure calculations. We find that the dominant nearest neighbor exchange coupling $J_{\rm{1}} > 0$ is antiferromagnetic whereas the next-nearest neighbor interaction $J_{\rm{2}} < 0$ is ferromagnetic with $J_{\rm{2}}$/$J_{\rm{1}}$~=~$-0.68$, reinforcing the checkerboard in-plane structure. Thus, unlike other transition-metal arsenides based on Mn, Fe, or Co, we find no competing magnetic interactions in SrCr$_2$As$_2$, which aligns with experimental findings. Moreover, the orbital resolution of exchange interactions shows that $J_1$ and $J_2$ are dominated by direct exchange mediated by the Cr $d$ orbitals. To validate the calculations we conduct inelastic neutron-scattering measurements on powder samples that show steeply dispersive magnetic excitations arising from the magnetic $Γ$ points and persisting up to energies of at least 175 meV. The spin-wave spectra are then modeled using the Heisenberg Hamiltonian with the theoretically calculated exchange couplings. The calculated neutron scattering spectra are in good agreement with the experimental data.
△ Less
Submitted 10 August, 2024;
originally announced August 2024.
-
Study of Stable Dark Energy Stars in Hořava-Lifshitz gravity
Authors:
Krishna Pada Das,
Ujjal Debnath
Abstract:
We study the structure and basic physical properties of non-rotating dark energy stars in Ho$\Check{\text{r}}$ava-Lifshitz (HL) gravity. The interior of propsed stellar structure is made of isotropic matter obeys extended Chaplygin gas EoS. The structure equations representing the state of hydrostatic equilibrium i.e., generalize TOV equation in HL gravity is numerically solved by using chosen rea…
▽ More
We study the structure and basic physical properties of non-rotating dark energy stars in Ho$\Check{\text{r}}$ava-Lifshitz (HL) gravity. The interior of propsed stellar structure is made of isotropic matter obeys extended Chaplygin gas EoS. The structure equations representing the state of hydrostatic equilibrium i.e., generalize TOV equation in HL gravity is numerically solved by using chosen realistic EoS. Next, we investigate the deviation of physical features of dark energy stars in HL gravity as compared with general relativity (GR). Such investigation is depicted by varying a parameter $ω$, whereas for $ω\rightarrow \infty$ HL coincide with GR. As a results, we find that necessary features of our stellar structure are significantly affected by $ω$ in HL gravity specifically on the estimation of the maximum mass and corresponding predicted radius of the star. In conclusion, we can predict the existence of heavior massive dark energy stars in the context of HL gravity as compared with GR with not collapsing into a black hole. Moreover, we investigate the stability of our proposed stellar system. By integrating the modified perturbations equations in support of suitable boundary conditions at the center and the surface of the stellar object, we evaluate the frequencies and eigenfunctions corresponding to six lowest excited modes. Finally, we find that physically viable and stable dark energy stars can be successfully discussed in HL gravity by this study.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Origin of unexpected weak Gilbert damping in the LSMO/Pt bilayer system
Authors:
Pritam Das,
Pushpendra Gupta,
Seung-Cheol Lee,
Subhankar Bedanta,
Satadeep Bhattacharjee
Abstract:
We investigated the Gilbert damping in La$_{0.7}$Sr$_{0.3}$MnO$_3$ (LSMO) and La$_{0.7}$Sr$_{0.3}$MnO$_3$/Pt (LSMO/Pt) heterostructures using first-principles calculations and Wannier interpolation techniques. Our work is motivated by recent experimental observations showing smaller Gilbert damping in LSMO/Pt films compared to their reference single-layer LSMO films, despite expectations of enhanc…
▽ More
We investigated the Gilbert damping in La$_{0.7}$Sr$_{0.3}$MnO$_3$ (LSMO) and La$_{0.7}$Sr$_{0.3}$MnO$_3$/Pt (LSMO/Pt) heterostructures using first-principles calculations and Wannier interpolation techniques. Our work is motivated by recent experimental observations showing smaller Gilbert damping in LSMO/Pt films compared to their reference single-layer LSMO films, despite expectations of enhanced spin-pumping effects in the former. We analyze the electronic structures and transport behaviors, finding that LSMO thin films have a high spin Hall angle ($|{θ_{\mathrm{SH}}}|$). However, in LSMO/Pt, the presence of platinum significantly increases longitudinal conductivity, reducing $|θ_{\mathrm{SH}}|$. Despite the lower $|θ_{\mathrm{SH}}|$, LSMO/Pt shows a notable anti-damping contribution to Gilbert damping due to a larger spin diffusion length. In contrast, pure LSMO films with large $|θ_{\mathrm{SH}}|$ exhibit higher damping due to efficient spin-to-charge conversion via a self-induced inverse spin Hall effect (ISHE), as reported in a recent experiment. Finally, this work demonstrates that by fine-tuning the ratio of spin Hall conductivity to longitudinal charge conductivity, it is possible to engineer heterostructures with desired spin-to-charge or charge-to-spin conversion efficiencies even with weaker spin-orbit couplings.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
The accreted Galaxy: An overview of TESS metal-poor accreted stars candidates
Authors:
Danielle de Brito Silva,
Paula Jofré,
Clare Worley,
Keith Hawkins,
Payel Das
Abstract:
The Milky Way is a mosaic of stars from different origins. In particular, metal-poor accreted star candidates offer a unique opportunity to better understand the accretion history of the Milky Way. In this work, we aim to explore the assembly history of the Milky Way by investigating accreted stars in terms of their ages, dynamical properties, and chemical abundances. We also aim to better charact…
▽ More
The Milky Way is a mosaic of stars from different origins. In particular, metal-poor accreted star candidates offer a unique opportunity to better understand the accretion history of the Milky Way. In this work, we aim to explore the assembly history of the Milky Way by investigating accreted stars in terms of their ages, dynamical properties, and chemical abundances. We also aim to better characterize the impact of incorporating asteroseismic information on age and chemical abundance calculations of metal-poor accreted stars for which TESS data is available. In this study, we conducted an in-depth examination of 30 metal-poor accreted star candidates, using TESS and Gaia data, as well as MIKE spectra. We find satisfactory agreement between seismic and predicted/spectroscopic surface gravity (log g) values, demonstrating the reliability of spectroscopic data from our methodology. We found that while age determination is highly dependent on the log g and asteroseismic information used, the overall chemical abundance distributions are similar for different log g. However, we found that calcium (Ca) abundances are more sensitive to the adopted log g. Our study reveals that the majority of our stars have properties compatible to those reported for the Gaia-Sausage-Enceladus, with a minority of stars that might be associated to Splash. We found an age distribution with a median of 11.3 Gyr with lower and upper uncertainties of 4.1 and 1.3 Gyr respectively when including asteroseismic information. As regarding some key chemical signatures we note that these stars are metal-poor ([Fe/H]) < -0.8), alpha-rich ([alpha]/Fe] > 0.2), copper-poor ([Cu/Fe] < 0 ) and with chemical abundances typical of accreted stars. These findings illustrate the importance of multi-dimensional analyses in unraveling the complex accretion history of the Milky Way.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
The Chemical Diversity of the Metal-Poor Milky Way
Authors:
Nicole Buckley,
Payel Das,
Paula Jofré,
Robert M. Yates,
Keith Hawkins
Abstract:
We present a detailed study of the chemical diversity of the metal-poor Milky Way (MW) using data from the GALAH DR3 survey. Considering 17 chemical abundances relative to iron ([X/Fe]) for 9,923 stars, we employ Principal Component Analysis (PCA) and Extreme Deconvolution (XD) to identify 10 distinct stellar groups. This approach, free from chemical or dynamical cuts, reveals known populations, i…
▽ More
We present a detailed study of the chemical diversity of the metal-poor Milky Way (MW) using data from the GALAH DR3 survey. Considering 17 chemical abundances relative to iron ([X/Fe]) for 9,923 stars, we employ Principal Component Analysis (PCA) and Extreme Deconvolution (XD) to identify 10 distinct stellar groups. This approach, free from chemical or dynamical cuts, reveals known populations, including the accreted halo, thick disc, thin disc, and in-situ halo. The thick disc is characterised by multiple substructures, suggesting it comprises stars formed in diverse environments. Our findings highlight the limited discriminatory power of magnesium in separating accreted and disc stars. Elements such as Ba, Al, Cu, and Sc are critical in distinguishing disc from accreted stars, while Ba, Y, Eu and Zn differentiate disc and accreted stars from the in-situ halo. This study demonstrates the potential power of combining a latent space representation of the data (PCA) with a clustering algorithm (XD) in Galactic archaeology, in providing new insights into the galaxy's assembly and evolutionary history.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Generation Constraint Scaling Can Mitigate Hallucination
Authors:
Georgios Kollias,
Payel Das,
Subhajit Chaudhury
Abstract:
Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled with explicit memory mechanisms. We empirically demonstrate that by simply scaling the readout vector that constrains generation in a memory-augmented LLM decoder, hallucinatio…
▽ More
Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled with explicit memory mechanisms. We empirically demonstrate that by simply scaling the readout vector that constrains generation in a memory-augmented LLM decoder, hallucination mitigation can be achieved in a training-free manner. Our method is geometry-inspired and outperforms a state-of-the-art LLM editing method on the task of generation of Wikipedia-like biography entries both in terms of generation quality and runtime complexity.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Hybrid physics-AI outperforms numerical weather prediction for extreme precipitation nowcasting
Authors:
Puja Das,
August Posch,
Nathan Barber,
Michael Hicks,
Thomas J. Vandal,
Kate Duffy,
Debjani Singh,
Katie van Werkhoven,
Auroop R. Ganguly
Abstract:
Precipitation nowcasting, critical for flood emergency and river management, has remained challenging for decades, although recent developments in deep generative modeling (DGM) suggest the possibility of improvements. River management centers, such as the Tennessee Valley Authority, have been using Numerical Weather Prediction (NWP) models for nowcasting but have struggled with missed detections…
▽ More
Precipitation nowcasting, critical for flood emergency and river management, has remained challenging for decades, although recent developments in deep generative modeling (DGM) suggest the possibility of improvements. River management centers, such as the Tennessee Valley Authority, have been using Numerical Weather Prediction (NWP) models for nowcasting but have struggled with missed detections even from best-in-class NWP models. While decades of prior research achieved limited improvements beyond advection and localized evolution, recent attempts have shown progress from physics-free machine learning (ML) methods and even greater improvements from physics-embedded ML approaches. Developers of DGM for nowcasting have compared their approaches with optical flow (a variant of advection) and meteorologists' judgment but not with NWP models. Further, they have not conducted independent co-evaluations with water resources and river managers. Here, we show that the state-of-the-art physics-embedded deep generative model, specifically NowcastNet, outperforms the High-Resolution Rapid Refresh (HRRR) model, the latest generation of NWP, along with advection and persistence, especially for heavy precipitation events. For grid-cell extremes over 16 mm/h, NowcastNet demonstrated a median critical success index (CSI) of 0.30, compared with a median CSI of 0.04 for HRRR. However, despite hydrologically relevant improvements in point-by-point forecasts from NowcastNet, caveats include the overestimation of spatially aggregated precipitation over longer lead times. Our co-evaluation with ML developers, hydrologists, and river managers suggests the possibility of improved flood emergency response and hydropower management.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Bias Correction in Machine Learning-based Classification of Rare Events
Authors:
Luuk Gubbels,
Marco Puts,
Piet Daas
Abstract:
Online platform businesses can be identified by using web-scraped texts. This is a classification problem that combines elements of natural language processing and rare event detection. Because online platforms are rare, accurately identifying them with Machine Learning algorithms is challenging. Here, we describe the development of a Machine Learning-based text classification approach that reduce…
▽ More
Online platform businesses can be identified by using web-scraped texts. This is a classification problem that combines elements of natural language processing and rare event detection. Because online platforms are rare, accurately identifying them with Machine Learning algorithms is challenging. Here, we describe the development of a Machine Learning-based text classification approach that reduces the number of false positives as much as possible. It greatly reduces the bias in the estimates obtained by using calibrated probabilities and ensembles.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Needle in the Haystack for Memory Based Large Language Models
Authors:
Elliot Nelson,
Georgios Kollias,
Payel Das,
Subhajit Chaudhury,
Soham Dan
Abstract:
Current large language models (LLMs) often perform poorly on simple fact retrieval tasks. Here we investigate if coupling a dynamically adaptable external memory to a LLM can alleviate this problem. For this purpose, we test Larimar, a recently proposed language model architecture which uses an external associative memory, on long-context recall tasks including passkey and needle-in-the-haystack t…
▽ More
Current large language models (LLMs) often perform poorly on simple fact retrieval tasks. Here we investigate if coupling a dynamically adaptable external memory to a LLM can alleviate this problem. For this purpose, we test Larimar, a recently proposed language model architecture which uses an external associative memory, on long-context recall tasks including passkey and needle-in-the-haystack tests. We demonstrate that the external memory of Larimar, which allows fast write and read of an episode of text samples, can be used at test time to handle contexts much longer than those seen during training. We further show that the latent readouts from the memory (to which long contexts are written) control the decoder towards generating correct outputs, with the memory stored off of the GPU. Compared to existing transformer-based LLM architectures for long-context recall tasks that use larger parameter counts or modified attention mechanisms, a relatively smaller size Larimar is able to maintain strong performance without any task-specific training or training on longer contexts.
△ Less
Submitted 12 July, 2024; v1 submitted 1 July, 2024;
originally announced July 2024.
-
SABLE: Staging Blocked Evaluation of Sparse Matrix Computations
Authors:
Pratyush Das,
Adhitha Dias,
Anxhelo Xhebraj,
Artem Pelenitsyn,
Kirshanthan Sundararajah,
Milind Kulkarni
Abstract:
Sparse Matrices found in the real world often have some structure in how the dense elements are organized. While the inspector-executor model inspects matrices for structure, its generality can overlook further specialization. We propose a system that - if the sparse matrix is stored in a blocked storage format - can generate more efficient code by constructing regular loops over these blocks. Our…
▽ More
Sparse Matrices found in the real world often have some structure in how the dense elements are organized. While the inspector-executor model inspects matrices for structure, its generality can overlook further specialization. We propose a system that - if the sparse matrix is stored in a blocked storage format - can generate more efficient code by constructing regular loops over these blocks. Our system performs a specified computation over every element of the block instead of avoiding computing any sparse element at all and achieving regularity in specialized code. The system is extensible, providing a dense block iterator for the user to express any computation over these dense blocks. We show that this approach can significantly speed up SpMV and SpMM operations over the state-of-the-art systems Partially-Strided Codelets and Sparse Register Tiling.
△ Less
Submitted 3 April, 2024;
originally announced July 2024.
-
Discrete dark matter with light Dirac neutrinos
Authors:
Debasish Borah,
Pritam Das,
Biswajit Karmakar,
Satyabrata Mahapatra
Abstract:
We propose a new realisation of light Dirac neutrino mass and dark matter (DM) within the framework of a non-Abelian discrete flavour symmetry based on $A_4$ group. In addition to $A_4$, we also consider a $Z_2$ and an unbroken global lepton number symmetry $U(1)_L$ to keep unwanted terms away while guaranteeing the Dirac nature of light neutrinos. The field content, their transformations and flav…
▽ More
We propose a new realisation of light Dirac neutrino mass and dark matter (DM) within the framework of a non-Abelian discrete flavour symmetry based on $A_4$ group. In addition to $A_4$, we also consider a $Z_2$ and an unbroken global lepton number symmetry $U(1)_L$ to keep unwanted terms away while guaranteeing the Dirac nature of light neutrinos. The field content, their transformations and flavon vacuum alignments are chosen in such a way that the type-I Dirac seesaw generates only one light Dirac neutrino mass while the other two masses arise from scotogenic contributions at one-loop. This leads to the Dirac scoto-seesaw framework, a generalisation of the widely studied scoto-seesaw model to Dirac neutrinos. The symmetry breaking of $A_4$ leaves a remnant $\mathcal{Z}_2$ symmetry responsible for stabilising DM. Dirac nature of light neutrinos introduces additional relativistic degrees of freedom $ΔN_{\rm eff}$ within reach of cosmic microwave background experiments.
△ Less
Submitted 11 July, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Populating Galaxies Into Halos Via Machine Learning on the Simba Simulation
Authors:
Pratyush Kumar Das,
Romeel Davé,
Weiguang Cui
Abstract:
We present machine learning (ML)-based pipelines designed to populate galaxies into dark matter halos from N-body simulations. These pipelines predict galaxy stellar mass ($M_*$), star formation rate (SFR), atomic and molecular gas contents, and metallicities, and can be easily extended to other galaxy properties and simulations. Our approach begins by categorizing galaxies into central and satell…
▽ More
We present machine learning (ML)-based pipelines designed to populate galaxies into dark matter halos from N-body simulations. These pipelines predict galaxy stellar mass ($M_*$), star formation rate (SFR), atomic and molecular gas contents, and metallicities, and can be easily extended to other galaxy properties and simulations. Our approach begins by categorizing galaxies into central and satellite classifications, followed by their ML classification into quenched (Q) and star-forming (SF) galaxies. We then develop regressors specifically for the SF galaxies within both central and satellite subgroups. We train the model on the $(100\mathrm{h^{-1}Mpc})^3$ Simba galaxy formation simulation at $z=0$. Our pipeline yields robust predictions for stellar mass and metallicity and offers significant improvements for SFR and gas properties compared to previous works, achieving an unbiased scatter of less than 0.2 dex around true Simba values for the halo-$M_{\rm HI}$ relation of central galaxies. We also show the effectiveness of the ML-based pipelines at $z=1,2$. Interestingly, we find that training on fraction-based properties (e.g. $M_{\rm HI}$/$M_{*}$) and then multiplying by the ML-predicted $M_{*}$ yields improved predictions versus directly training on the property value, for many quantities across redshifts. However, we find that the ML-predicted scatter around the mean is lower than the true scatter, leading to artificially suppressed distribution functions at high values. To alleviate this, we add a "ML scatter bias", finely tuned to recover the true distribution functions, critical for accurate predictions of integrated quantities such as $\rm{HI}$ intensity maps.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Emergent Moiré fringes in direct-grown quasicrystal
Authors:
Jingwei Li,
Kejie Bao,
Honglin Sun,
Xingxu Yan,
Ting Huang,
Qicheng Zhang,
Yaoqiang Zhou,
Zhenjing Liu,
Paul Masih Das,
Jiawen You,
Jiong Zhao,
Jianbin Xu,
Xiaoqing Pan,
Yongli Mi,
Junyi Zhu,
Zhaoli Gao
Abstract:
Quasicrystals represent a category of rarely structured solids that challenge traditional periodicity in crystal materials. Recent advancements in the synthesis of two-dimensional (2D) van der Waals materials have paved the way for exploring the unique physical properties of these systems. Here, we report on the synthesis of 2D quasicrystals featuring 30° alternating twist angles between multiple…
▽ More
Quasicrystals represent a category of rarely structured solids that challenge traditional periodicity in crystal materials. Recent advancements in the synthesis of two-dimensional (2D) van der Waals materials have paved the way for exploring the unique physical properties of these systems. Here, we report on the synthesis of 2D quasicrystals featuring 30° alternating twist angles between multiple graphene layers, using chemical vapor deposition (CVD). Strikingly, we observed periodic Moiré patterns in the quasicrystal, a finding that has not been previously reported in traditional alloy-based quasicrystals. The Moiré periodicity, varying with the parity of the constituent layers, aligns with the theoretical predictions that suggest a stress cancellation mechanism in force. The emergence of Moiré fringes is attributed to the spontaneous mismatched lattice constant in the oriented graphene layers, proving the existence of atomic relaxation. This phenomenon, which has been largely understudied in graphene systems with large twist angles, has now been validated through our use of scanning transmission electron microscopy (STEM). Our CVD-grown Moiré quasicrystal provides an ideal platform for exploring the unusual physical properties that arise from Moiré periodicity within quasicrystals.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Electrically Tunable Magnetoconductance of Close-Packed CVD Bilayer Graphene Layer Stacking Walls
Authors:
Qicheng Zhang,
Sheng Wang,
Zhaoli Gao,
Sebastian Hurtado-Parra,
Joel Berry,
Zachariah Addison,
Paul Masih Das,
William M. Parkin,
Marija Drndic,
James M. Kikkawa,
Feng Wang,
Eugene J. Mele,
A. T. Charlie Johnson,
Zhengtang Luo
Abstract:
Quantum valley Hall (QVH) domain wall states are a new class of one-dimensional (1D) one-way conductors that are topologically protected in the absence of valley mixing. Development beyond a single QVH channel raises important new questions as to how QVH channels in close spatial proximity interact with each other, and how that interaction may be controlled. Scalable epitaxial bilayer graphene syn…
▽ More
Quantum valley Hall (QVH) domain wall states are a new class of one-dimensional (1D) one-way conductors that are topologically protected in the absence of valley mixing. Development beyond a single QVH channel raises important new questions as to how QVH channels in close spatial proximity interact with each other, and how that interaction may be controlled. Scalable epitaxial bilayer graphene synthesis produces layer stacking wall (LSW) bundles, where QVH channels are bound, providing an excellent platform to study QVH channel interactions. Here we show that distinct strain sources lead to the formation of both well-separated LSWs and close packed LSW bundles. Comparative studies of electronic transport in these two regimes reveal that close-packed LSW bundles support electrically tunable magnetoconductance. The coexistence of different strain sources offers a potential pathway to realize scalable quantum transport platform based on LSWs where electrically tunability enables programmable functionality.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Quantum hardware demonstrations of relativistic calculations of molecular electric dipole moments: from light to heavy systems using Variational Quantum Eigensolver
Authors:
Palak Chawla,
Shweta,
K. R. Swain,
Tushti Patel,
Renu Bala,
Disha Shetty,
Kenji Sugisaki,
Sudhindu Bikash Mandal,
Jordi Riu,
Jan Nogue,
V. S. Prasannaa,
B. P. Das
Abstract:
The quantum-classical hybrid Variational Quantum Eigensolver (VQE) algorithm is recognized to be the method of choice to obtain ground state energies of quantum many-body systems in the noisy intermediate scale quantum (NISQ) era. This study not only extends the VQE algorithm to the relativistic regime, but also calculates a property other than energy, namely the molecular permanent electric dipol…
▽ More
The quantum-classical hybrid Variational Quantum Eigensolver (VQE) algorithm is recognized to be the method of choice to obtain ground state energies of quantum many-body systems in the noisy intermediate scale quantum (NISQ) era. This study not only extends the VQE algorithm to the relativistic regime, but also calculates a property other than energy, namely the molecular permanent electric dipole moment (PDM). We carry out 18-qubit quantum simulations to obtain ground state energies as well as PDMs of single-valence diatomic molecules, ranging from the light BeH to the heavy radioactive RaH molecule. We investigate the correlation trends in these systems as well as access the precision in our results. Furthermore, we measure the PDM of the moderately heavy SrH and SrF molecules on the optimized unitary coupled cluster state, using the state-of-the-art IonQ Aria-I quantum computer in an active space of 6 qubits. The associated quantum circuits for these computations were extensively optimized in view of limitations imposed by NISQ hardware. To that end, we employ an array of techniques, including the use of point group symmetries, integrating ZX-Calculus into our pipeline-based circuit optimization, and energy sort VQE procedure. Through these methods, we compress our 6-qubit quantum circuit from 280 two-qubit gates to 37 two-qubit gates (with a marginal trade-off of 0.33 and 0.31 percent in the PDM for SrH and SrF in their respective 6-spin orbital active spaces). We anticipate that our proof-of-concept demonstration lays the groundwork for future quantum hardware calculations involving heavy atoms and molecules.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques
Authors:
Megh Thakkar,
Quentin Fournier,
Matthew D Riemer,
Pin-Yu Chen,
Amal Zouaq,
Payel Das,
Sarath Chandar
Abstract:
Large language models are first pre-trained on trillions of tokens and then instruction-tuned or aligned to specific preferences. While pre-training remains out of reach for most researchers due to the compute required, fine-tuning has become affordable thanks to parameter-efficient methods such as LoRA and QLoRA. Alignment is known to be sensitive to the many factors involved, including the quant…
▽ More
Large language models are first pre-trained on trillions of tokens and then instruction-tuned or aligned to specific preferences. While pre-training remains out of reach for most researchers due to the compute required, fine-tuning has become affordable thanks to parameter-efficient methods such as LoRA and QLoRA. Alignment is known to be sensitive to the many factors involved, including the quantity and quality of data, the alignment method, and the adapter rank. However, there has not yet been an extensive study of their effect on downstream performance. To address this gap, we conduct an in-depth investigation of the impact of popular choices for three crucial axes: (i) the alignment dataset (HH-RLHF and BeaverTails), (ii) the alignment technique (SFT and DPO), and (iii) the model (LLaMA-1, Vicuna-v1.3, Mistral-7b, and Mistral-7b-Instruct). Our extensive setup spanning over 300 experiments reveals consistent trends and unexpected findings. We observe how more informative data helps with preference alignment, cases where supervised fine-tuning outperforms preference optimization, and how aligning to a distinct preference boosts performance on downstream tasks. Through our in-depth analyses, we put forward key guidelines to help researchers perform more effective parameter-efficient LLM alignment.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Electron Confinement-Induced Plasmonic Breakdown in Metals
Authors:
Prasanna Das,
Sourav Rudra,
Dheemahi Rao,
Souvik Banerjee,
Ashalatha Indiradevi Kamalasanan Pillai,
Magnus Garbrecht,
Alexandra Boltasseva,
Igor V. Bondarev,
Vladimir M. Shalaev,
Bivas Saha
Abstract:
Plasmon resonance in metals represents the collective oscillation of the free electron gas density and enables enhanced light-matter interactions in nanoscale dimensions. Traditionally, the classical Drude model describes the plasmonic excitation, wherein the plasma frequency exhibits no spatial dispersion. Here, we show conclusive experimental evidence of the breakdown of the plasmon resonance an…
▽ More
Plasmon resonance in metals represents the collective oscillation of the free electron gas density and enables enhanced light-matter interactions in nanoscale dimensions. Traditionally, the classical Drude model describes the plasmonic excitation, wherein the plasma frequency exhibits no spatial dispersion. Here, we show conclusive experimental evidence of the breakdown of the plasmon resonance and a consequent photonic metal-insulator transition in an ultrathin archetypal refractory plasmonic material, hafnium nitride (HfN). Epitaxial HfN thick films exhibit a low-loss and high-quality Drude-like plasmon resonance in the visible spectral range. However, as the film thickness is reduced to nanoscale dimensions, the Coulomb interaction among electrons increases due to the electron confinement, leading to the spatial dispersion of the plasma frequency. Importantly, with the further decrease in thickness, electrons lose their ability to shield the incident electric field, turning the medium into a dielectric. The breakdown of the plasmon resonance in epitaxial ultrathin metals could be useful for fundamental physics studies in transdimensional regimes and novel photonic device applications.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Linear and nonlinear propagation of cylindrical vector beam through a non-degenerate four level atomic system
Authors:
Partha Das,
Tarak Nath Dey
Abstract:
We investigate the phase-induced susceptibilities for both components of the probe vector beam (PVB) within an atomic system. The atoms are prepared in a non-degenerate four-level configuration. The transitions are coupled by a $π$ polarized control field and two orthogonally polarized components of a PVB. We show that the linear susceptibility of the medium depends on the phase shift between the…
▽ More
We investigate the phase-induced susceptibilities for both components of the probe vector beam (PVB) within an atomic system. The atoms are prepared in a non-degenerate four-level configuration. The transitions are coupled by a $π$ polarized control field and two orthogonally polarized components of a PVB. We show that the linear susceptibility of the medium depends on the phase shift between the control field and PVB, characterizing loss or gain in the system. Additionally, the phase shift causes polarization rotation in the vector beams (VBs) as they propagate. We further study the effect of nonlinearity on the VB propagation through the medium for a couple of Rayleigh lengths. The self-focusing and defocusing phenomena are observed for radial, azimuthal, and spiral VBs. The special chain-like self-focusing and defocusing leads to the formation of consecutive smaller spot sizes with moderate gain. Therefore, the mechanism of control of susceptibility and self-focusing may hold promise for applications such as transitioning from an absorber to an amplifier, high-resolution microscopy, and optical trap systems.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
A short introduction on Angular momentum of Kerr Blackhole
Authors:
Sumit Panganti,
Siba Prasad Das
Abstract:
General relativity (GR) predicts the existence of black hole (BH). The rotating BH called as a Kerr Black hole and GR implies that there is an upper limit on the angular momentum per mass squared of black holes $\leq 1$, above which the event horizon of the Kerr BH is not exist. We find the radial equation for equatorial motion for Kerr BH in terms of the effective potential. We have shown the eff…
▽ More
General relativity (GR) predicts the existence of black hole (BH). The rotating BH called as a Kerr Black hole and GR implies that there is an upper limit on the angular momentum per mass squared of black holes $\leq 1$, above which the event horizon of the Kerr BH is not exist. We find the radial equation for equatorial motion for Kerr BH in terms of the effective potential. We have shown the effective potential profile for different rotation parameter ($a$). We find the solution of the radial equation of the Kerr metric and found the expression of the angular momentum per unit mass squared, $\tilde a = \frac{a}{M}$. We showed the profile of $\tilde a$ as a function of $\frac{r}{M}$. The solution also leads the energy per unit rest mass ($e$) and we showed its behavior as a function of $\frac{r}{M}$. We enumerated the maximum values of radius of innermost stable circular orbit ($r_{ISCO}$) for $\tilde a=1$.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Rare and Exotic Higgs decays at ATLAS and CMS
Authors:
Pallabi Das
Abstract:
After the Higgs boson discovery in 2012, the experiments at the LHC are continuing to study this particle and look for physics beyond the standard model. Some of the Higgs boson properties, such as the mass, has been measured with sub-percent level accuracy. Yet the present integrated luminosity is still a limiting factor for measuring the Higgs boson self-coupling or the first generation Yukawa c…
▽ More
After the Higgs boson discovery in 2012, the experiments at the LHC are continuing to study this particle and look for physics beyond the standard model. Some of the Higgs boson properties, such as the mass, has been measured with sub-percent level accuracy. Yet the present integrated luminosity is still a limiting factor for measuring the Higgs boson self-coupling or the first generation Yukawa couplings. The current constraints on the Higgs boson couplings would still allow for a sizeable branching fraction into undetected final states, which motivates the direct searches for rare and exotic decay modes. This presentation discusses several new results from these searches utilizing advanced online selection methods or analysis techniques with the entire Run 2 data.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Minimal Finite Model of Wedge Sum of Spheres
Authors:
Ponaki Das,
Sainkupar Marwein Mawiong
Abstract:
In \cite{Barmak(2007),Barmak(2011)}, Barmak extensively investigates the minimal finite models of the n-dimensional sphere $S^n$ for all $n\geq 0$ and establishes the minimal finite model of the wedge sum of unit circles $\bigvee\limits_{i= 1}^{n} S^{1}$. In this work, we demonstrate that the minimal finite model of the M$\ddot{\rm{o}}$bius band coincides with the minimal finite model of the unit…
▽ More
In \cite{Barmak(2007),Barmak(2011)}, Barmak extensively investigates the minimal finite models of the n-dimensional sphere $S^n$ for all $n\geq 0$ and establishes the minimal finite model of the wedge sum of unit circles $\bigvee\limits_{i= 1}^{n} S^{1}$. In this work, we demonstrate that the minimal finite model of the M$\ddot{\rm{o}}$bius band coincides with the minimal finite model of the unit circle $S^1$. Furthermore, we establish that the minimal finite models of the spaces $S^{2}\vee S^{1}$, $S^{2}\vee S^{2}$ consist of only seven points, while the minimal finite models of the spaces $S^{1}\vee S^{1}\vee S^{2}$ and $S^{2}\vee S^{2}\vee S^{1}$ contain eight points. Additionally, we thoroughly discuss all the necessary homotopy groups and homology groups of the aforementioned spaces to provide a comprehensive and self-contained presentation in the paper.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Testing neutrino mass hierarchy under type-II seesaw scenario in $U(1)_X$ from colliders
Authors:
Arindam Das,
Puja Das,
Nobuchika Okada
Abstract:
The origin of tiny neutrino mass is a long standing unsolved puzzle of the Standard Model (SM), which allows us to consider scenarios beyond the Standard Model (BSM) in a variety of ways. One of them being a gauge extension of the SM may be realized as in the form of an anomaly free, general $U(1)_X$ extension of the SM, where an $SU(2)_L$ triplet scalar with a $U(1)_X$ charge is introduced to hav…
▽ More
The origin of tiny neutrino mass is a long standing unsolved puzzle of the Standard Model (SM), which allows us to consider scenarios beyond the Standard Model (BSM) in a variety of ways. One of them being a gauge extension of the SM may be realized as in the form of an anomaly free, general $U(1)_X$ extension of the SM, where an $SU(2)_L$ triplet scalar with a $U(1)_X$ charge is introduced to have Dirac Yukawa couplings with the SM lepton doublets. Once the triplet scalar developes a Vacuum Expectation Value (VEV), light neutrinos acquire their tiny Majorana masses. Hence, the decay modes of the triplet scalar has a direct connection to the neutrino oscillation data for different neutrino mass hierarchies. After the breaking of the $U(1)_X$ gauge symmetry, a neutral $U(1)_X$ gauge boson $(Z^\prime)$ acquires mass, which interacts differently with the left and right handed SM fermions. Satisfying the recent LHC bounds on the triplet scalar and $Z^\prime$ boson productions, we study the pair production of the triplet scalar at LHC, 100 TeV proton proton collider FCC, $e^-e^+$ and $μ^-μ^+$ colliders followed by its decay into dominant dilepton modes whose flavor structure depend on the neutrino mass hierarchy. Generating the SM backgrounds, we study the possible signal significance of four lepton final states from the triplet scalar pair production. We also compare our results with the purely SM gauge mediated triplet scalar pair production followed by four lepton final states, which could be significant only in $μ^- μ^+$ collider.
△ Less
Submitted 16 July, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Bimodal Plasmonic Refractive Index Sensors Based on SU-8 Waveguides
Authors:
Omkar Bhalerao,
Stephan Suckow,
Horst Windgassen,
Harry Biller,
Konstantinos Fotiadis,
Stelios Simos,
Evangelia Chatzianagnostou,
Dimosthenis Spasopoulos,
Pratyusha Das,
Laurent Markey,
Jean-Claude Weeber,
Nikos Pleros,
Matthias Schirmer,
Max C. Lemme
Abstract:
Plasmonic refractive index sensors are essential for detecting subtle variations in the ambient environment through surface plasmon interactions. Current efforts utilizing CMOS-compatible, plasmo-photonic Mach-Zehnder interferometers with active power balancing exhibit high sensitivities at the cost of fabrication and measurement complexity. Alternatively, passive bimodal plasmonic interferometers…
▽ More
Plasmonic refractive index sensors are essential for detecting subtle variations in the ambient environment through surface plasmon interactions. Current efforts utilizing CMOS-compatible, plasmo-photonic Mach-Zehnder interferometers with active power balancing exhibit high sensitivities at the cost of fabrication and measurement complexity. Alternatively, passive bimodal plasmonic interferometers based on SU-8 waveguides present a cost-effective solution with a smaller device footprint, though they currently lack opto-mechanical isolation due to exposed photonic waveguides. In this work, we introduce innovative polymer-core and polymer-cladded bimodal plasmonic refractive index sensors with high refractive index contrast. Our sensors feature an aluminum stripe, a bilayer SU-8 photonic waveguide core, and the experimental optical cladding polymer SX AR LWL 2.0. They achieve a sensitivity of (6300 $\pm$ 460) nm/RIU (refractive index unit), surpassing both traditional and polymer-based plasmo-photonic sensors. This approach enables integrated, wafer-scale, CMOS-compatible, and low-cost sensors and facilitates plasmonic refractive index sensing platforms for various applications.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
GP-MoLFormer: A Foundation Model For Molecular Generation
Authors:
Jerret Ross,
Brian Belgodere,
Samuel C. Hoffman,
Vijil Chenthamarakshan,
Youssef Mroueh,
Payel Das
Abstract:
Transformer-based models trained on large and general purpose datasets consisting of molecular strings have recently emerged as a powerful tool for successfully modeling various structure-property relations. Inspired by this success, we extend the paradigm of training chemical language transformers on large-scale chemical datasets to generative tasks in this work. Specifically, we propose GP-MoLFo…
▽ More
Transformer-based models trained on large and general purpose datasets consisting of molecular strings have recently emerged as a powerful tool for successfully modeling various structure-property relations. Inspired by this success, we extend the paradigm of training chemical language transformers on large-scale chemical datasets to generative tasks in this work. Specifically, we propose GP-MoLFormer, an autoregressive molecular string generator that is trained on more than 1.1B chemical SMILES. GP-MoLFormer uses a 46.8M parameter transformer decoder model with linear attention and rotary positional encodings as the base architecture. We explore the utility of GP-MoLFormer in generating novel, valid, and unique SMILES. Impressively, we find GP-MoLFormer is able to generate a significant fraction of novel, valid, and unique SMILES even when the number of generated molecules is in the 10 billion range and the reference set is over a billion. We also find strong memorization of training data in GP-MoLFormer generations, which has so far remained unexplored for chemical language models. Our analyses reveal that training data memorization and novelty in generations are impacted by the quality of the training data; duplication bias in training data can enhance memorization at the cost of lowering novelty. We evaluate GP-MoLFormer's utility and compare it with that of existing baselines on three different tasks: de novo generation, scaffold-constrained molecular decoration, and unconstrained property-guided optimization. While the first two are handled with no additional training, we propose a parameter-efficient fine-tuning method for the last task, which uses property-ordered molecular pairs as input. We call this new approach pair-tuning. Our results show GP-MoLFormer performs better or comparable with baselines across all three tasks, demonstrating its general utility.
△ Less
Submitted 4 April, 2024;
originally announced May 2024.
-
New Angular Momentum Conservation Laws for Gauge Fields in QED
Authors:
Farhad Khosravi,
Li-Ping Yang,
Pronoy Das,
Zubin Jacob
Abstract:
Quantum electrodynamics (QED) deals with the relativistic interaction of bosonic gauge fields and fermionic charged particles. In QED, global conservation laws of angular momentum for light-matter interactions are well-known. However, local conservation laws, i.e. the conservation law of angular momentum at every point in space, remain unexplored. Here, we use the QED Lagrangian and Noether's theo…
▽ More
Quantum electrodynamics (QED) deals with the relativistic interaction of bosonic gauge fields and fermionic charged particles. In QED, global conservation laws of angular momentum for light-matter interactions are well-known. However, local conservation laws, i.e. the conservation law of angular momentum at every point in space, remain unexplored. Here, we use the QED Lagrangian and Noether's theorem to derive a new local conservation law of angular momentum for Dirac-Maxwell fields in the form of the continuity relation for linear momentum. We separate this local conservation law into four coupled motion equations for spin and orbital angular momentum (OAM) densities. We introduce a helicity current tensor, OAM current tensor, and spin-orbit torque in the motion equations to shed light on on the local dynamics of spin-OAM interaction and angular momentum exchange between Maxwell-Dirac fields. We elucidate how our results translate to classical electrodynamics using the example of plane wave interference as well as a dual-mode optical fiber. Our results shine light on phenomena related to the spin of gauge bosons.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Statistically characterized subgroups related to some non-arithmetic sequence of integers
Authors:
Pratulananda Das,
Ayan Ghosh
Abstract:
Recently, in Das et al. (Mediterr. J. Math. 21 : 164, 2024), characterized subgroups are investigated for some special kind of non-arithmetic sequences. In this note, we study subsequent problems in case of ``statistically characterized subgroups" introduced in Dikranjan et al. (Fund. Math. 249 : 185-209, 2020). The entire investigation emphasizes that these statistically characterized subgroups a…
▽ More
Recently, in Das et al. (Mediterr. J. Math. 21 : 164, 2024), characterized subgroups are investigated for some special kind of non-arithmetic sequences. In this note, we study subsequent problems in case of ``statistically characterized subgroups" introduced in Dikranjan et al. (Fund. Math. 249 : 185-209, 2020). The entire investigation emphasizes that these statistically characterized subgroups are mostly larger in size, having cardinality $\mathfrak{c}$, and exhibit behavior that significantly differs from that of classically characterized subgroups. As a consequence, we solve an open problem raised in Dikranjan et al. (Fund. Math. 249 : 185-209, 2020).
△ Less
Submitted 2 September, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Unified Map Handling for Robotic Systems: Enhancing Interoperability and Efficiency Across Diverse Environments
Authors:
James R. Heselden,
Gautham P. Das
Abstract:
Mapping is a time-consuming process for deploying robotic systems to new environments. The handling of maps is also risk-adverse when not managed effectively. We propose here, a standardised approach to handling such maps in a manner which focuses on the information contained wherein such as global location, object positions, topology, and occupancy. As part of this approach, associated management…
▽ More
Mapping is a time-consuming process for deploying robotic systems to new environments. The handling of maps is also risk-adverse when not managed effectively. We propose here, a standardised approach to handling such maps in a manner which focuses on the information contained wherein such as global location, object positions, topology, and occupancy. As part of this approach, associated management scripts are able to assist with generation of maps both through direct and indirect information restructuring, and with template and procedural generation of missing data. These approaches are able to, when combined, improve the handling of maps to enable more efficient deployments and higher interoperability between platforms. Alongside this, a collection of sample datasets of fully-mapped environments are included covering areas such as agriculture, urban roadways, and indoor environments.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages
Authors:
Paramita Das,
Isaac Johnson,
Diego Saez-Trumper,
Pablo Aragón
Abstract:
Wikipedia is the largest web repository of free knowledge. Volunteer editors devote time and effort to creating and expanding articles in more than 300 language editions. As content quality varies from article to article, editors also spend substantial time rating articles with specific criteria. However, keeping these assessments complete and up-to-date is largely impossible given the ever-changi…
▽ More
Wikipedia is the largest web repository of free knowledge. Volunteer editors devote time and effort to creating and expanding articles in more than 300 language editions. As content quality varies from article to article, editors also spend substantial time rating articles with specific criteria. However, keeping these assessments complete and up-to-date is largely impossible given the ever-changing nature of Wikipedia. To overcome this limitation, we propose a novel computational framework for modeling the quality of Wikipedia articles.
State-of-the-art approaches to model Wikipedia article quality have leveraged machine learning techniques with language-specific features. In contrast, our framework is based on language-agnostic structural features extracted from the articles, a set of universal weights, and a language version-specific normalization criterion. Therefore, we ensure that all language editions of Wikipedia can benefit from our framework, even those that do not have their own quality assessment scheme. Using this framework, we have built datasets with the feature values and quality scores of all revisions of all articles in the existing language versions of Wikipedia. We provide a descriptive analysis of these resources and a benchmark of our framework. In addition, we discuss possible downstream tasks to be addressed with these datasets, which are released for public use.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Formation and Microwave Losses of Hydrides in Superconducting Niobium Thin Films Resulting from Fluoride Chemical Processing
Authors:
Carlos G. Torres-Castanedo,
Dominic P. Goronzy,
Thang Pham,
Anthony McFadden,
Nicholas Materise,
Paul Masih Das,
Matthew Cheng,
Dmitry Lebedev,
Stephanie M. Ribet,
Mitchell J. Walker,
David A. Garcia-Wetten,
Cameron J. Kopas,
Jayss Marshall,
Ella Lachman,
Nikolay Zhelev,
James A. Sauls,
Joshua Y. Mutus,
Corey Rae H. McRae,
Vinayak P. Dravid,
Michael J. Bedzyk,
Mark C. Hersam
Abstract:
Superconducting Nb thin films have recently attracted significant attention due to their utility for quantum information technologies. In the processing of Nb thin films, fluoride-based chemical etchants are commonly used to remove surface oxides that are known to affect superconducting quantum devices adversely. However, these same etchants can also introduce hydrogen to form Nb hydrides, potenti…
▽ More
Superconducting Nb thin films have recently attracted significant attention due to their utility for quantum information technologies. In the processing of Nb thin films, fluoride-based chemical etchants are commonly used to remove surface oxides that are known to affect superconducting quantum devices adversely. However, these same etchants can also introduce hydrogen to form Nb hydrides, potentially negatively impacting microwave loss performance. Here, we present comprehensive materials characterization of Nb hydrides formed in Nb thin films as a function of fluoride chemical treatments. In particular, secondary-ion mass spectrometry, X-ray scattering, and transmission electron microscopy reveal the spatial distribution and phase transformation of Nb hydrides. The rate of hydride formation is determined by the fluoride solution acidity and the etch rate of Nb2O5, which acts as a diffusion barrier for hydrogen into Nb. The resulting Nb hydrides are detrimental to Nb superconducting properties and lead to increased power-independent microwave loss in coplanar waveguide resonators. However, Nb hydrides do not correlate with two-level system loss or device aging mechanisms. Overall, this work provides insight into the formation of Nb hydrides and their role in microwave loss, thus guiding ongoing efforts to maximize coherence time in superconducting quantum devices.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Promatch: Extending the Reach of Real-Time Quantum Error Correction with Adaptive Predecoding
Authors:
Narges Alavisamani,
Suhas Vittal,
Ramin Ayanzadeh,
Poulami Das,
Moinuddin Qureshi
Abstract:
Fault-tolerant quantum computing relies on Quantum Error Correction, which encodes logical qubits into data and parity qubits. Error decoding is the process of translating the measured parity bits into types and locations of errors. To prevent a backlog of errors, error decoding must be performed in real-time. Minimum Weight Perfect Matching (MWPM) is an accurate decoding algorithm for surface cod…
▽ More
Fault-tolerant quantum computing relies on Quantum Error Correction, which encodes logical qubits into data and parity qubits. Error decoding is the process of translating the measured parity bits into types and locations of errors. To prevent a backlog of errors, error decoding must be performed in real-time. Minimum Weight Perfect Matching (MWPM) is an accurate decoding algorithm for surface code, and recent research has demonstrated real-time implementations of MWPM (RT-MWPM) for a distance of up to 9. Unfortunately, beyond d=9, the number of flipped parity bits in the syndrome, referred to as the Hamming weight of the syndrome, exceeds the capabilities of existing RT-MWPM decoders. In this work, our goal is to enable larger distance RT-MWPM decoders by using adaptive predecoding that converts high Hamming weight syndromes into low Hamming weight syndromes, which are accurately decoded by the RT-MWPM decoder. An effective predecoder must balance both accuracy and coverage. In this paper, we propose Promatch, a real-time adaptive predecoder that predecodes both simple and complex patterns using a locality-aware, greedy approach. Our approach ensures two crucial factors: 1) high accuracy in prematching flipped bits, ensuring that the decoding accuracy is not hampered by the predecoder, and 2) enough coverage adjusted based on the main decoder's capability given the time constraints. Promatch represents the first real-time decoding framework capable of decoding surface codes of distances 11 and 13, achieving an LER of $2.6\times 10^{-14}$ for distance 13. Moreover, we demonstrate that running Promatch concurrently with the recently proposed Astrea-G achieves LER equivalent to MWPM LER, $3.4\times10^{-15}$, for distance 13, representing the first real-time accurate decoder for up-to a distance of 13.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Interplay between the Lyapunov exponents and phase transitions of charged AdS black holes
Authors:
Bhaskar Shukla,
Pranaya Pratik Das,
David Dudal,
Subhash Mahapatra
Abstract:
We study the relationship between the standard or extended thermodynamic phase structure of various AdS black holes and the Lyapunov exponents associated with the null and time-like geodesics. We consider dyonic, Bardeen, Gauss-Bonnet, and Lorentz-symmetry breaking massive gravity black holes and calculate the Lyapunov exponents of massless and massive particles in unstable circular geodesics clos…
▽ More
We study the relationship between the standard or extended thermodynamic phase structure of various AdS black holes and the Lyapunov exponents associated with the null and time-like geodesics. We consider dyonic, Bardeen, Gauss-Bonnet, and Lorentz-symmetry breaking massive gravity black holes and calculate the Lyapunov exponents of massless and massive particles in unstable circular geodesics close to the black hole. We find that the thermal profile of the Lyapunov exponents exhibits distinct behaviour in the small and large black hole phases and can encompass certain aspects of the van der Waals type small/large black hole phase transition. We further analyse the properties of Lyapunov exponents as an order parameter and find that its critical exponent is $1/2$, near the critical point for all black holes considered here.
△ Less
Submitted 26 July, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
Authors:
Amit Dhurandhar,
Tejaswini Pedapati,
Ronny Luss,
Soham Dan,
Aurelie Lozano,
Payel Das,
Georgios Kollias
Abstract:
Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their widespread applicability. While enforcing sparsity at various levels of the model architecture has found promise in addressing scaling and efficiency issues, the…
▽ More
Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their widespread applicability. While enforcing sparsity at various levels of the model architecture has found promise in addressing scaling and efficiency issues, there remains a disconnect between how sparsity affects network topology. Inspired by brain neuronal networks, we explore sparsity approaches through the lens of network topology. Specifically, we exploit mechanisms seen in biological networks, such as preferential attachment and redundant synapse pruning, and show that principled, model-agnostic sparsity approaches are performant and efficient across diverse NLP tasks, spanning both classification (such as natural language inference) and generation (summarization, machine translation), despite our sole objective not being optimizing performance. NeuroPrune is competitive with (or sometimes superior to) baselines on performance and can be up to $10$x faster in terms of training time for a given level of sparsity, simultaneously exhibiting measurable improvements in inference time in many cases.
△ Less
Submitted 5 June, 2024; v1 submitted 28 February, 2024;
originally announced April 2024.
-
What are the quantum commutation relations for the total angular momentum of light?
Authors:
Pronoy Das,
Li-Ping Yang,
Zubin Jacob
Abstract:
The total angular momentum of light has received attention for its application in a variety of phenomena such as optical communication, optical forces and sensing. However, the quantum behavior including the commutation relations have been relatively less explored. Here, we derive the correct commutation relation for the total angular momentum of light using both relativistic and non-relativistic…
▽ More
The total angular momentum of light has received attention for its application in a variety of phenomena such as optical communication, optical forces and sensing. However, the quantum behavior including the commutation relations have been relatively less explored. Here, we derive the correct commutation relation for the total angular momentum of light using both relativistic and non-relativistic approaches. An important outcome of our work is the proof that the widely-assumed quantum commutation relation for the total observable angular momentum of light is fundamentally incorrect. Our work will motivate experiments and leads to new insight on the quantum behavior of the angular momentum of light.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Larimar: Large Language Models with Episodic Memory Control
Authors:
Payel Das,
Subhajit Chaudhury,
Elliot Nelson,
Igor Melnyk,
Sarath Swaminathan,
Sihui Dai,
Aurélie Lozano,
Georgios Kollias,
Vijil Chenthamarakshan,
Jiří,
Navrátil,
Soham Dan,
Pin-Yu Chen
Abstract:
Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tunin…
▽ More
Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tuning. Experimental results on multiple fact editing benchmarks demonstrate that Larimar attains accuracy comparable to most competitive baselines, even in the challenging sequential editing setup, but also excels in speed - yielding speed-ups of 8-10x depending on the base LLM - as well as flexibility due to the proposed architecture being simple, LLM-agnostic, and hence general. We further provide mechanisms for selective fact forgetting, information leakage prevention, and input context length generalization with Larimar and show their effectiveness. Our code is available at https://github.com/IBM/larimar
△ Less
Submitted 21 August, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Certain observations on selection principles related to bornological covers using ideals
Authors:
D. Chandra,
P. Das,
S. Das
Abstract:
We study selection principles related to bornological covers using the notion of ideals. We consider ideals $\mathcal I$ and $\mathcal J$ on $ω$ and standard ideal orderings $KB, K$. Relations between cardinality of a base of a bornology with certain selection principles related to bornological covers are established using cardinal invariants such as modified pseudointersection number, the unbound…
▽ More
We study selection principles related to bornological covers using the notion of ideals. We consider ideals $\mathcal I$ and $\mathcal J$ on $ω$ and standard ideal orderings $KB, K$. Relations between cardinality of a base of a bornology with certain selection principles related to bornological covers are established using cardinal invariants such as modified pseudointersection number, the unbounding number and slaloms numbers. When $\mathcal I \leq_\square \mathcal J$ for ideals $\mathcal I, \mathcal J$ and $\square\in \{1\text{-}1,KB,K\}$, implications among various selection principles related to bornological covers are established. Under the assumption that ideal $\mathcal I$ has a pseudounion we show equivalences among certain selection principles related to bornological covers. Finally, the $\mathcal I\text{-}\mathfrak B^s$-Hurewicz property of $X$ is investigated. We prove that $\mathcal I\text{-}\mathfrak B^s$-Hurewicz property of $X$ coincides with the $\mathfrak B^s$-Hurewicz property of $X$ if $\mathcal I$ has a pseudounion. Implications or equivalences among selection principles, games and $\mathcal I\text{-}\mathfrak B^s$-Hurewicz property which are obtained from our investigations are described in diagrams.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Quantum theory of orbital angular momentum in spatiotemporal optical vortices
Authors:
Pronoy Das,
Sathwik Bharadwaj,
Zubin Jacob
Abstract:
Spatiotemporal Optical Vortices (STOVs) are structured electromagnetic fields propagating in free space with phase singularities in the space-time domain. Depending on the tilt of the helical phase front, STOVs can carry both longitudinal and transverse orbital angular momentum (OAM). Although STOVs have gained significant interest in the recent years, the current understanding is limited to the s…
▽ More
Spatiotemporal Optical Vortices (STOVs) are structured electromagnetic fields propagating in free space with phase singularities in the space-time domain. Depending on the tilt of the helical phase front, STOVs can carry both longitudinal and transverse orbital angular momentum (OAM). Although STOVs have gained significant interest in the recent years, the current understanding is limited to the semi-classical picture. Here, we develop a quantum theory for STOVs with an arbitrary tilt, extending beyond the paraxial limit. We demonstrate that quantum STOV states, such as Fock and coherent twisted photon pulses, display non-vanishing longitudinal OAM fluctuations that are absent in conventional monochromatic twisted pulses. We show that these quantum fluctuations exhibit a unique texture, i.e. a spatial distribution which can be used to experimentally isolate these quantum effects. Our findings represent a step towards the exploitation of quantum effects of structured light for various applications such as OAM-based encoding protocols and platforms to explore novel light-matter interaction in 2D material systems.
△ Less
Submitted 24 March, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Virial Equation of State for a Granular System
Authors:
Subhanker Howlader,
Prasenjit Das
Abstract:
The equation of state for an ideal gas is simple, which is $P=nk_{\rm B}T$. In the case of imperfect gases where mutual interactions among the constituents are important, pressure $P$ can be expressed as the series expansion of density $n$ with appropriate coefficients, known as virial coefficients $B_m$. In this paper, we have obtained the first four virial coefficients for a model interaction po…
▽ More
The equation of state for an ideal gas is simple, which is $P=nk_{\rm B}T$. In the case of imperfect gases where mutual interactions among the constituents are important, pressure $P$ can be expressed as the series expansion of density $n$ with appropriate coefficients, known as virial coefficients $B_m$. In this paper, we have obtained the first four virial coefficients for a model interaction potential $Φ(r)$ using multidimensional Monte-Carlo integration and importance sampling methods. Next, we perform molecular dynamics simulations with the same $Φ(r)$ for a many-particle system to obtain $P$ as a function of $T$ and $n$. We compare our numerical data with the virial equation of state.
△ Less
Submitted 16 March, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Quantum Linear Magnetoresistance and Fermi Liquid Behavior in Kagome Metal Ni3In2S2
Authors:
P. Das,
P. Saha,
M. Singh,
P. Kumar,
S. Patnaik
Abstract:
Kagome metals gain attention as they manifest a spectrum of quantum phenomena, including superconductivity, charge order, frustrated magnetism, and intertwined correlated states of condensed matter. With regard to electronic band structure, several of the them exhibit non-trivial topological characteristics. Here, we present a thorough investigation on the growth and the physical properties of sin…
▽ More
Kagome metals gain attention as they manifest a spectrum of quantum phenomena, including superconductivity, charge order, frustrated magnetism, and intertwined correlated states of condensed matter. With regard to electronic band structure, several of the them exhibit non-trivial topological characteristics. Here, we present a thorough investigation on the growth and the physical properties of single crystals of Ni3In2S2 which is established to be a Dirac nodal line Kagome metal. Extensive characterization is attained through temperature and field-dependent resistivity, angle-dependent magnetoresistance and specific heat measurements. In most metals, the Fermi liquid behaviour is mostly restricted to a narrow range of temperature. In Ni3In2S2, this characteristic feature has been observed for an extensive temperature range of 82 K. This is attributed to the strong electron-electron correlation in the material. Specific heat measurements reveal a high Kadowaki-Woods ratio which is in good agreement with strongly correlated systems. Almost linear positive magnetoresistance follows the conventional Kohler scaling which depicts the applicability of semi-classical theories. The angle-dependent magneto-resistance been explained using the Voigt-Thomson formula. Furthermore, de-Haas van Alphen oscillations are observed in magnetization vs. magnetic field measurement which shed light on the topological features in the Shandite Ni3In2S2.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Rationality of Learning Algorithms in Repeated Normal-Form Games
Authors:
Shivam Bajaj,
Pranoy Das,
Yevgeniy Vorobeychik,
Vijay Gupta
Abstract:
Many learning algorithms are known to converge to an equilibrium for specific classes of games if the same learning algorithm is adopted by all agents. However, when the agents are self-interested, a natural question is whether agents have a strong incentive to adopt an alternative learning algorithm that yields them greater individual utility. We capture such incentives as an algorithm's rational…
▽ More
Many learning algorithms are known to converge to an equilibrium for specific classes of games if the same learning algorithm is adopted by all agents. However, when the agents are self-interested, a natural question is whether agents have a strong incentive to adopt an alternative learning algorithm that yields them greater individual utility. We capture such incentives as an algorithm's rationality ratio, which is the ratio of the highest payoff an agent can obtain by deviating from a learning algorithm to its payoff from following it. We define a learning algorithm to be $c$-rational if its rationality ratio is at most $c$ irrespective of the game. We first establish that popular learning algorithms such as fictitious play and regret matching are not $c$-rational for any constant $c\geq 1$. We then propose and analyze two algorithms that are provably $1$-rational under mild assumptions, and have the same properties as (a generalized version of) fictitious play and regret matching, respectively, if all agents follow them. Finally, we show that if an assumption of perfect monitoring is not satisfied, there are games for which $c$-rational algorithms do not exist, and illustrate our results with numerical case studies.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Quantum Computing-Enhanced Algorithm Unveils Novel Inhibitors for KRAS
Authors:
Mohammad Ghazi Vakili,
Christoph Gorgulla,
AkshatKumar Nigam,
Dmitry Bezrukov,
Daniel Varoli,
Alex Aliper,
Daniil Polykovsky,
Krishna M. Padmanabha Das,
Jamie Snider,
Anna Lyakisheva,
Ardalan Hosseini Mansob,
Zhong Yao,
Lela Bitar,
Eugene Radchenko,
Xiao Ding,
Jinxin Liu,
Fanye Meng,
Feng Ren,
Yudong Cao,
Igor Stagljar,
Alán Aspuru-Guzik,
Alex Zhavoronkov
Abstract:
The discovery of small molecules with therapeutic potential is a long-standing challenge in chemistry and biology. Researchers have increasingly leveraged novel computational techniques to streamline the drug development process to increase hit rates and reduce the costs associated with bringing a drug to market. To this end, we introduce a quantum-classical generative model that seamlessly integr…
▽ More
The discovery of small molecules with therapeutic potential is a long-standing challenge in chemistry and biology. Researchers have increasingly leveraged novel computational techniques to streamline the drug development process to increase hit rates and reduce the costs associated with bringing a drug to market. To this end, we introduce a quantum-classical generative model that seamlessly integrates the computational power of quantum algorithms trained on a 16-qubit IBM quantum computer with the established reliability of classical methods for designing small molecules. Our hybrid generative model was applied to designing new KRAS inhibitors, a crucial target in cancer therapy. We synthesized 15 promising molecules during our investigation and subjected them to experimental testing to assess their ability to engage with the target. Notably, among these candidates, two molecules, ISM061-018-2 and ISM061-22, each featuring unique scaffolds, stood out by demonstrating effective engagement with KRAS. ISM061-018-2 was identified as a broad-spectrum KRAS inhibitor, exhibiting a binding affinity to KRAS-G12D at $1.4 μM$. Concurrently, ISM061-22 exhibited specific mutant selectivity, displaying heightened activity against KRAS G12R and Q61H mutants. To our knowledge, this work shows for the first time the use of a quantum-generative model to yield experimentally confirmed biological hits, showcasing the practical potential of quantum-assisted drug discovery to produce viable therapeutics. Moreover, our findings reveal that the efficacy of distribution learning correlates with the number of qubits utilized, underlining the scalability potential of quantum computing resources. Overall, we anticipate our results to be a stepping stone towards developing more advanced quantum generative models in drug discovery.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation
Authors:
Zuobai Zhang,
Jiarui Lu,
Vijil Chenthamarakshan,
Aurélie Lozano,
Payel Das,
Jian Tang
Abstract:
Protein function annotation is an important yet challenging task in biology. Recent deep learning advancements show significant potential for accurate function prediction by learning from protein sequences and structures. Nevertheless, these predictor-based methods often overlook the modeling of protein similarity, an idea commonly employed in traditional approaches using sequence or structure ret…
▽ More
Protein function annotation is an important yet challenging task in biology. Recent deep learning advancements show significant potential for accurate function prediction by learning from protein sequences and structures. Nevertheless, these predictor-based methods often overlook the modeling of protein similarity, an idea commonly employed in traditional approaches using sequence or structure retrieval tools. To fill this gap, we first study the effect of inter-protein similarity modeling by benchmarking retriever-based methods against predictors on protein function annotation tasks. Our results show that retrievers can match or outperform predictors without large-scale pre-training. Building on these insights, we introduce a novel variational pseudo-likelihood framework, ProtIR, designed to improve function predictors by incorporating inter-protein similarity modeling. This framework iteratively refines knowledge between a function predictor and retriever, thereby combining the strengths of both predictors and retrievers. ProtIR showcases around 10% improvement over vanilla predictor-based methods. Besides, it achieves performance on par with protein language model-based methods, yet without the need for massive pre-training, highlighting the efficacy of our framework. Code will be released upon acceptance.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.