-
Observation of muonic Dalitz decays of $χ_{b}$ mesons and precise spectroscopy of hidden-beauty states
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1114 additional authors not shown)
Abstract:
The decays of the $χ_{b1}(1P)$, $χ_{b2}(1P)$, $χ_{b1}(2P)$ and $χ_{b2}(2P)$~mesons into the~$Υ(1S)μ^+μ^-$ final state are observed with a high significance using proton-proton collision data collected with the LHCb detector and corresponding to an integrated luminosity of 9fb$^{-1}$. The newly observed decays together with the $Υ(2S)\rightarrow Υ(1S)π^+π^-$ and $Υ(3S)\rightarrow Υ(2S)π^+π^-$ decay…
▽ More
The decays of the $χ_{b1}(1P)$, $χ_{b2}(1P)$, $χ_{b1}(2P)$ and $χ_{b2}(2P)$~mesons into the~$Υ(1S)μ^+μ^-$ final state are observed with a high significance using proton-proton collision data collected with the LHCb detector and corresponding to an integrated luminosity of 9fb$^{-1}$. The newly observed decays together with the $Υ(2S)\rightarrow Υ(1S)π^+π^-$ and $Υ(3S)\rightarrow Υ(2S)π^+π^-$ decay modes are used for precision measurements of the mass and mass splittings for the hidden-beauty states.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Optimizing Cox Models with Stochastic Gradient Descent: Theoretical Foundations and Practical Guidances
Authors:
Lang Zeng,
Weijing Tang,
Zhao Ren,
Ying Ding
Abstract:
Optimizing Cox regression and its neural network variants poses substantial computational challenges in large-scale studies. Stochastic gradient descent (SGD), known for its scalability in model optimization, has recently been adapted to optimize Cox models. Unlike its conventional application, which typically targets a sum of independent individual loss, SGD for Cox models updates parameters base…
▽ More
Optimizing Cox regression and its neural network variants poses substantial computational challenges in large-scale studies. Stochastic gradient descent (SGD), known for its scalability in model optimization, has recently been adapted to optimize Cox models. Unlike its conventional application, which typically targets a sum of independent individual loss, SGD for Cox models updates parameters based on the partial likelihood of a subset of data. Despite its empirical success, the theoretical foundation for optimizing Cox partial likelihood with SGD is largely underexplored. In this work, we demonstrate that the SGD estimator targets an objective function that is batch-size-dependent. We establish that the SGD estimator for the Cox neural network (Cox-NN) is consistent and achieves the optimal minimax convergence rate up to a polylogarithmic factor. For Cox regression, we further prove the $\sqrt{n}$-consistency and asymptotic normality of the SGD estimator, with variance depending on the batch size. Furthermore, we quantify the impact of batch size on Cox-NN training and its effect on the SGD estimator's asymptotic efficiency in Cox regression. These findings are validated by extensive numerical experiments and provide guidance for selecting batch sizes in SGD applications. Finally, we demonstrate the effectiveness of SGD in a real-world application where GD is unfeasible due to the large scale of data.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Spatio-Temporal Communication Compression in Distributed Prime-Dual Flows
Authors:
Zihao Ren,
Lei Wang,
Deming Yuan,
Hongye Su,
Guodong Shi
Abstract:
In this paper, we study distributed prime-dual flows for multi-agent optimization with spatio-temporal compressions. The central aim of multi-agent optimization is for a network of agents to collaboratively solve a system-level optimization problem with local objective functions and node-to-node communication by distributed algorithms. The scalability of such algorithms crucially depends on the co…
▽ More
In this paper, we study distributed prime-dual flows for multi-agent optimization with spatio-temporal compressions. The central aim of multi-agent optimization is for a network of agents to collaboratively solve a system-level optimization problem with local objective functions and node-to-node communication by distributed algorithms. The scalability of such algorithms crucially depends on the complexity of the communication messages, and a number of communication compressors for distributed optimization have recently been proposed in the literature. First of all, we introduce a general spatio-temporal compressor characterized by the stability of the resulting dynamical system along the vector field of the compressor. We show that several important distributed optimization compressors such as the greedy sparsifier, the uniform quantizer, and the scalarizer all fall into the category of this spatio-temporal compressor. Next, we propose two distributed prime-dual flows with the spatio-temporal compressors being applied to local node states and local error states, respectively, and prove (exponential) convergence of the node trajectories to the global optimizer for (strongly) convex cost functions. Finally, a few numerical examples are present to illustrate our theoretical results.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Generative Retrieval with Few-shot Indexing
Authors:
Arian Askari,
Chuan Meng,
Mohammad Aliannejadi,
Zhaochun Ren,
Evangelos Kanoulas,
Suzan Verberne
Abstract:
Existing generative retrieval (GR) approaches rely on training-based indexing, i.e., fine-tuning a model to memorise the associations between a query and the document identifier (docid) of a relevant document. Training-based indexing has three limitations: high training overhead, under-utilization of the pre-trained knowledge of large language models (LLMs), and challenges in adapting to a dynamic…
▽ More
Existing generative retrieval (GR) approaches rely on training-based indexing, i.e., fine-tuning a model to memorise the associations between a query and the document identifier (docid) of a relevant document. Training-based indexing has three limitations: high training overhead, under-utilization of the pre-trained knowledge of large language models (LLMs), and challenges in adapting to a dynamic document corpus. To address the above issues, we propose a novel few-shot indexing-based GR framework (Few-Shot GR). It has a novel few-shot indexing process, where we prompt an LLM to generate docids for all documents in a corpus, ultimately creating a docid bank for the entire corpus. During retrieval, we feed a query to the same LLM and constrain it to generate a docid within the docid bank created during indexing, and then map the generated docid back to its corresponding document. Few-Shot GR relies solely on prompting an LLM without requiring any training, making it more efficient. Moreover, we devise few-shot indexing with one-to-many mapping to further enhance Few-Shot GR. Experiments show that Few-Shot GR achieves superior performance to state-of-the-art GR methods that require heavy training.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
ChipExpert: The Open-Source Integrated-Circuit-Design-Specific Large Language Model
Authors:
Ning Xu,
Zhaoyang Zhang,
Lei Qi,
Wensuo Wang,
Chao Zhang,
Zihao Ren,
Huaiyuan Zhang,
Xin Cheng,
Yanqi Zhang,
Zhichao Liu,
Qingwen Wei,
Shiyang Wu,
Lanlan Yang,
Qianfeng Lu,
Yiqun Ma,
Mengyao Zhao,
Junbo Liu,
Yufan Song,
Xin Geng,
Jun Yang
Abstract:
The field of integrated circuit (IC) design is highly specialized, presenting significant barriers to entry and research and development challenges. Although large language models (LLMs) have achieved remarkable success in various domains, existing LLMs often fail to meet the specific needs of students, engineers, and researchers. Consequently, the potential of LLMs in the IC design domain remains…
▽ More
The field of integrated circuit (IC) design is highly specialized, presenting significant barriers to entry and research and development challenges. Although large language models (LLMs) have achieved remarkable success in various domains, existing LLMs often fail to meet the specific needs of students, engineers, and researchers. Consequently, the potential of LLMs in the IC design domain remains largely unexplored. To address these issues, we introduce ChipExpert, the first open-source, instructional LLM specifically tailored for the IC design field. ChipExpert is trained on one of the current best open-source base model (Llama-3 8B). The entire training process encompasses several key stages, including data preparation, continue pre-training, instruction-guided supervised fine-tuning, preference alignment, and evaluation. In the data preparation stage, we construct multiple high-quality custom datasets through manual selection and data synthesis techniques. In the subsequent two stages, ChipExpert acquires a vast amount of IC design knowledge and learns how to respond to user queries professionally. ChipExpert also undergoes an alignment phase, using Direct Preference Optimization, to achieve a high standard of ethical performance. Finally, to mitigate the hallucinations of ChipExpert, we have developed a Retrieval-Augmented Generation (RAG) system, based on the IC design knowledge base. We also released the first IC design benchmark ChipICD-Bench, to evaluate the capabilities of LLMs across multiple IC design sub-domains. Through comprehensive experiments conducted on this benchmark, ChipExpert demonstrated a high level of expertise in IC design knowledge Question-and-Answer tasks.
△ Less
Submitted 26 July, 2024;
originally announced August 2024.
-
MetaOpenFOAM: an LLM-based multi-agent framework for CFD
Authors:
Yuxuan Chen,
Xu Zhu,
Hua Zhou,
Zhuyin Ren
Abstract:
Remarkable progress has been made in automated problem solving through societies of agents based on large language models (LLMs). Computational fluid dynamics (CFD), as a complex problem, presents unique challenges in automated simulations that require sophisticated solutions. MetaOpenFOAM, as a novel multi-agent collaborations framework, aims to complete CFD simulation tasks with only natural lan…
▽ More
Remarkable progress has been made in automated problem solving through societies of agents based on large language models (LLMs). Computational fluid dynamics (CFD), as a complex problem, presents unique challenges in automated simulations that require sophisticated solutions. MetaOpenFOAM, as a novel multi-agent collaborations framework, aims to complete CFD simulation tasks with only natural language as input. These simulation tasks include mesh pre-processing, simulation and so on. MetaOpenFOAM harnesses the power of MetaGPT's assembly line paradigm, which assigns diverse roles to various agents, efficiently breaking down complex CFD tasks into manageable subtasks. Langchain further complements MetaOpenFOAM by integrating Retrieval-Augmented Generation (RAG) technology, which enhances the framework's ability by integrating a searchable database of OpenFOAM tutorials for LLMs. Tests on a benchmark for natural language-based CFD solver, consisting of eight CFD simulation tasks, have shown that MetaOpenFOAM achieved a high pass rate per test (85%), with each test case costing only $0.22 on average. The eight CFD simulation tasks encompass a range of multidimensional flow problems, covering compressible and incompressible flows with different physical processes. This demonstrates the capability to automate CFD simulations using only natural language input, iteratively correcting errors to achieve the desired simulations. An ablation study was conducted to verify the necessity of each component in the multi-agent system and the RAG technology. A sensitivity study on the randomness of LLM showed that LLM with low randomness can obtain more stable and accurate results. Additionally, MetaOpenFOAM owns the ability to identify and modify key parameters in user requirements, and excels in correcting bugs when failure match occur,which demonstrates the generalization of MetaOpenFOAM.
△ Less
Submitted 7 August, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
Apple Intelligence Foundation Language Models
Authors:
Tom Gunter,
Zirui Wang,
Chong Wang,
Ruoming Pang,
Andy Narayanan,
Aonan Zhang,
Bowen Zhang,
Chen Chen,
Chung-Cheng Chiu,
David Qiu,
Deepak Gopinath,
Dian Ang Yap,
Dong Yin,
Feng Nan,
Floris Weers,
Guoli Yin,
Haoshuo Huang,
Jianyu Wang,
Jiarui Lu,
John Peebles,
Ke Ye,
Mark Lee,
Nan Du,
Qibin Chen,
Quentin Keunebroek
, et al. (130 additional authors not shown)
Abstract:
We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used…
▽ More
We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Absence of BCS-BEC Crossover in FeSe0.45Te0 55 Superconductor
Authors:
Junjie Jia,
Yadong Gu,
Chaohui Yin,
Yingjie Shu,
Yiwen Chen,
Jumin Shi,
Xing Zhang,
Hao Chen,
Taimin Miao,
Xiaolin Ren,
Bo Liang,
Wenpei Zhu,
Neng Cai,
Fengfeng Zhang,
Shenjin Zhang,
Feng Yang,
Zhimin Wang,
Qinjun Peng,
Zuyan Xu,
Hanqing Mao,
Guodong Liu,
Zhian Ren,
Lin Zhao,
X. J. Zhou
Abstract:
In iron-based superconductor Fe(Se,Te), a flat band-like feature near the Fermi level was observed around the Brillouin zone center in the superconducting state. It is under debate whether this is the evidence on the presence of the BCS-BEC crossover in the superconductor. High-resolution laser-based angle-resolved photoemission measurements are carried out on high quality single crystals of FeSe0…
▽ More
In iron-based superconductor Fe(Se,Te), a flat band-like feature near the Fermi level was observed around the Brillouin zone center in the superconducting state. It is under debate whether this is the evidence on the presence of the BCS-BEC crossover in the superconductor. High-resolution laser-based angle-resolved photoemission measurements are carried out on high quality single crystals of FeSe0.45Te0.55 superconductor to address the issue. By employing different polarization geometries, we have resolved and isolated the dyz band and the topological surface band, making it possible to study their superconducting behaviors separately. The dyz band alone does not form a flat band-like feature in the superconducting state and the measured dispersion can be well described by the BCS picture. We find that the flat band-like feature is formed from the combination of the dyz band and the topological surface state band in the superconducting state. These results reveal the origin of the flat band-like feature and rule out the presence of BCS-BEC crossover in Fe(Se,Te) superconductor.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
From Complexity to Clarity: Kolmogorov-Arnold Networks in Nuclear Binding Energy Prediction
Authors:
Hao Liu,
Jin Lei,
Zhongzhou Ren
Abstract:
This study explores the application of Kolmogorov-Arnold Networks (KANs) in predicting nuclear binding energies, leveraging their ability to decompose complex multi-parameter systems into simpler univariate functions. By utilizing data from the Atomic Mass Evaluation (AME2020) and incorporating features such as atomic number, neutron number, and shell effects, KANs demonstrated significant predict…
▽ More
This study explores the application of Kolmogorov-Arnold Networks (KANs) in predicting nuclear binding energies, leveraging their ability to decompose complex multi-parameter systems into simpler univariate functions. By utilizing data from the Atomic Mass Evaluation (AME2020) and incorporating features such as atomic number, neutron number, and shell effects, KANs demonstrated significant predictive accuracy, surpassing traditional models. The symbolic regression analysis yielded simplified analytical expressions for binding energies, aligning with classical models like the liquid drop model and the Bethe-Weizsäcker formula. These results highlight KANs' potential in enhancing the interpretability and understanding of nuclear phenomena, paving the way for future applications in nuclear physics and beyond.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Measurement of $D^0-\overline{D}^0$ mixing and search for $CP$ violation with $D^0\rightarrow K^+π^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1065 additional authors not shown)
Abstract:
A measurement of the time-dependent ratio of the $D^0\rightarrow K^+π^-$ to $\overline{D}^0\rightarrow K^+π^-$ decay rates is reported. The analysis uses a sample of proton-proton collisions corresponding to an integrated luminosity of 6 fb$^-1$ recorded by the LHCb experiment from 2015 through 2018 at a center-of-mass energy of 13 TeV. The $D^0$ meson is required to originate from a…
▽ More
A measurement of the time-dependent ratio of the $D^0\rightarrow K^+π^-$ to $\overline{D}^0\rightarrow K^+π^-$ decay rates is reported. The analysis uses a sample of proton-proton collisions corresponding to an integrated luminosity of 6 fb$^-1$ recorded by the LHCb experiment from 2015 through 2018 at a center-of-mass energy of 13 TeV. The $D^0$ meson is required to originate from a $D^{*+}\rightarrow D^0π^+$ decay, such that its flavor at production is inferred from the charge of the accompanying pion. The measurement is performed simultaneously for the $K^+π^-$ and $K^-π^+$ final states, allowing both mixing and $CP$-violation parameters to be determined. The value of the ratio of the decay rates at production is determined to be $R_{Kπ} = (343.1 \pm 2.0) \times 10^{-5}$. The mixing parameters are measured to be $c_{Kπ} = (51.4 \pm 3.5) \times 10^{-4}$ and $c_{Kπ}^{\prime} = (13 \pm 4) \times 10^{-6}$, where $\sqrt{R_{Kπ}}c_{Kπ}$ is the linear coefficient of the expansion of the ratio as a function of decay time in units of the $D^0$ lifetime, and $c_{Kπ}^{\prime}$ is the quadratic coefficient, both averaged between the $K^+π^-$ and $K^-π^+$ final states. The precision is improved relative to the previous best measurement by approximately 60%. No evidence for $CP$ violation is found.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
Three-Photon Polarization Entanglement of Green Light
Authors:
Yan-Chao Lou,
Zhi-Cheng Ren,
Chao Chen,
Pei Wan,
Wen-Zheng Zhu,
Jing Wang,
Shu-Tian Xue,
Bo-Wen Dong,
Jianping Ding,
Xi-Lin Wang,
Hui-Tian Wang
Abstract:
Recently, great progress has been made in the entanglement of multiple photons at various wavelengths and in different degrees of freedom for optical quantum information applied in diverse scenarios. However, multi-photon entanglement in the transmission window of green light under the water has not been reported yet. Here, by combining femtosecond laser based multi-photon entanglement and entangl…
▽ More
Recently, great progress has been made in the entanglement of multiple photons at various wavelengths and in different degrees of freedom for optical quantum information applied in diverse scenarios. However, multi-photon entanglement in the transmission window of green light under the water has not been reported yet. Here, by combining femtosecond laser based multi-photon entanglement and entanglement-maintaining frequency upconversion techniques, we successfully generate a green two-photon polarization-entangled Bell state and a green three-photon Greenberger-Horne-Zeilinger (GHZ) state, whose state fidelities are 0.893$\mathbf{\pm}$0.002 and 0.595$\mathbf{\pm}$0.023, respectively. Our result provides a scalable method to prepare green multi-photon entanglement, which may have wide applications in underwater quantum information.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations
Authors:
Hao Liu,
Jin Lei,
Zhongzhou Ren
Abstract:
We explore the impact of half-shell components on nuclear reaction calculations, focusing on nonelastic breakup cross sections within the Ichimura-Austern-Vincent (IAV) model. By advocating for the use of a consistent Single Folding Model (SFM) for all optical potentials in IAV calculations, we aim to reduce the uncertainties associated with half-shell components and enhance agreement with experim…
▽ More
We explore the impact of half-shell components on nuclear reaction calculations, focusing on nonelastic breakup cross sections within the Ichimura-Austern-Vincent (IAV) model. By advocating for the use of a consistent Single Folding Model (SFM) for all optical potentials in IAV calculations, we aim to reduce the uncertainties associated with half-shell components and enhance agreement with experimental data. We present results from deuteron-induced reactions on $^{60}$Ni and $^{208}$Pb, which serve as surrogate targets for neutron-induced reactions on short-lived nuclei. The application of consistent optical potentials derived from the SFM shows improved alignment with experimental data compared to traditional global phenomenological potentials. Furthermore, we investigate the $^{59}$Co($^6$Li,$αX$) reaction, which reveals that the half-shell $T$-matrix plays a pivotal role in accurately modeling nuclear reactions. Our findings suggest that a unified approach to optical potentials, accounting for half-shell effects, is critical for a precise understanding of complex nuclear reactions. This work highlights the significance of the internal dynamics of the wave function, particularly in lighter targets, and underscores the importance of the half-shell $T$-matrix as a previously underappreciated variable in reaction calculations.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
COLOSS: Complex-scaled Optical and couLOmb Scattering Solver
Authors:
Junzhe Liu,
Jin Lei,
Zhongzhou Ren
Abstract:
We introduce COLOSS, a program designed to address the scattering problem using a bound-state technique known as complex scaling. In this method, the oscillatory boundary conditions of the wave function are transformed into exponentially decaying ones, accommodating the long-range Coulomb interaction. The program implements the Woods-Saxon form of a realistic optical potential, with all potential…
▽ More
We introduce COLOSS, a program designed to address the scattering problem using a bound-state technique known as complex scaling. In this method, the oscillatory boundary conditions of the wave function are transformed into exponentially decaying ones, accommodating the long-range Coulomb interaction. The program implements the Woods-Saxon form of a realistic optical potential, with all potential parameters included in a well-designed input format for ease of use. This design offers users straightforward access to compute \(S\)-matrices and cross-sections of the scattering process. We provide thorough discussions on the precision of Lagrange functions and their benefits in evaluating matrix elements. Additionally, COLOSS incorporates two distinct rotation methods, making it adaptable to potentials without analytical expressions. Comparative results demonstrate that COLOSS achieves high accuracy when compared with the direct integration method, Numerov, underscoring its utility and effectiveness in scattering calculations.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Heralded High-Dimensional Photon-Photon Quantum Gate
Authors:
Zhi-Feng Liu,
Zhi-Cheng Ren,
Pei Wan,
Wen-Zheng Zhu,
Zi-Mo Cheng,
Jing Wang,
Yu-Peng Shi,
Han-Bing Xi,
Marcus Huber,
Nicolai Friis,
Xiaoqin Gao,
Xi-Lin Wang,
Hui-Tian Wang
Abstract:
High-dimensional encoding of quantum information holds the potential to greatly increase the computational power of existing devices by enlarging the accessible state space for fixed register size and by reducing the number of required entangling gates. However, qudit-based quantum computation remains far less developed than conventional qubit-based approaches, in particular for photons, which rep…
▽ More
High-dimensional encoding of quantum information holds the potential to greatly increase the computational power of existing devices by enlarging the accessible state space for fixed register size and by reducing the number of required entangling gates. However, qudit-based quantum computation remains far less developed than conventional qubit-based approaches, in particular for photons, which represent natural multi-level information carriers that play a crucial role in the development of quantum networks. A major obstacle for realizing quantum gates between two individual photons is the restriction of direct interaction between photons in linear media. In particular, essential logic components for quantum operations such as native qudit-qudit entangling gates are still missing for optical quantum information processing. Here we address this challenge by presenting a protocol for realizing an entangling gate -- the controlled phase-flip (CPF) gate -- for two photonic qudits in arbitrary dimension. We experimentally demonstrate this protocol by realizing a four-dimensional qudit-qudit CPF gate, whose decomposition would require at least 13 two-qubit entangling gates. Our photonic qudits are encoded in orbital angular momentum (OAM) and we have developed a new active high-precision phase-locking technology to construct a high-dimensional OAM beam splitter that increases the stability of the CPF gate, resulting in a process fidelity within a range of $ [0.64 \pm 0.01, 0.82 \pm 0.01]$. Our experiment represents a significant advance for high-dimensional optical quantum information processing and has the potential for wider applications beyond optical system.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Observation of exotic $J/ψφ$ resonances in diffractive processes in proton-proton collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1068 additional authors not shown)
Abstract:
The first study of $J/ψφ$ production in diffractive processes in proton-proton collisions is presented. The study is based on an LHCb dataset recorded at centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 5 fb$^{-1}$. The data disfavour a nonresonant $J/ψφ$ production but are consistent with a resonant model including several resonant states observed previously only in…
▽ More
The first study of $J/ψφ$ production in diffractive processes in proton-proton collisions is presented. The study is based on an LHCb dataset recorded at centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 5 fb$^{-1}$. The data disfavour a nonresonant $J/ψφ$ production but are consistent with a resonant model including several resonant states observed previously only in $B^+ \to J/ψφK^+$ decays. The $χ_{c0}(4500)$ state is observed with a significance over $5σ$ and the $χ_{c1}(4274)$ is confirmed with a significance of more than $4σ$.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Study of charmonium production via the decay to $p\bar{p}$ at $\sqrt{s} = 13 TeV$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1060 additional authors not shown)
Abstract:
Charmonium production cross-section in proton-proton collisions is measured at the centre-of-mass energy $\sqrt{s}=13\,TeV$ using decays to $p\bar{p}$ final state. The study is performed using a data sample corresponding to an integrated luminosity of $2.2\,{fb}^{-1}$ collected in 2018 with the $LHCb$ detector. The production cross-section of the $η_c$ meson is measured in a rapidity range of…
▽ More
Charmonium production cross-section in proton-proton collisions is measured at the centre-of-mass energy $\sqrt{s}=13\,TeV$ using decays to $p\bar{p}$ final state. The study is performed using a data sample corresponding to an integrated luminosity of $2.2\,{fb}^{-1}$ collected in 2018 with the $LHCb$ detector. The production cross-section of the $η_c$ meson is measured in a rapidity range of $2.0 < y < 4.0$ and in a transverse momentum range of $5.0 < p_{T} < 20.0\,{GeV/\it{c}}$, which is extended compared with previous $LHCb$ analyses. The differential cross-section is measured in bins of $p_{T}$ and, for the first time, of $y$. Upper limits, at 90% and 95% confidence levels, on the $η_c(2S)$ and $h_c(1P)$ prompt production cross-sections are determined for the first time.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Convergence of Sinkhorn's Algorithm for Entropic Martingale Optimal Transport Problem
Authors:
Fan Chen,
Giovanni Conforti,
Zhenjie Ren,
Xiaozhen Wang
Abstract:
In this paper, we study the Entropic Martingale Optimal Transport (EMOT) problem on R. We begin by introducing the dual formulation and prove the exponential convergence of Sinkhorn's algorithm on the dual potential coefficients. Our analysis does not require prior knowledge of the optimal potential and confirms that there is no primal-dual gap. Our findings provide a theoretical guarantee for sol…
▽ More
In this paper, we study the Entropic Martingale Optimal Transport (EMOT) problem on R. We begin by introducing the dual formulation and prove the exponential convergence of Sinkhorn's algorithm on the dual potential coefficients. Our analysis does not require prior knowledge of the optimal potential and confirms that there is no primal-dual gap. Our findings provide a theoretical guarantee for solving the EMOT problem using Sinkhorn's algorithm. In applications, our result provides insight into the calibration of stochastic volatility models, as proposed by Henry-Labordere.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1092 additional authors not shown)
Abstract:
The first full amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays is performed using proton-proton collision data corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ recorded with the LHCb detector. The rich $K^+ π^+ π^-$ spectrum is studied and the branching fractions of the resonant substructure associated with the prominent $K_1(1270)^+$ contribution are measured. The data ca…
▽ More
The first full amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays is performed using proton-proton collision data corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ recorded with the LHCb detector. The rich $K^+ π^+ π^-$ spectrum is studied and the branching fractions of the resonant substructure associated with the prominent $K_1(1270)^+$ contribution are measured. The data cannot be described by conventional strange and charmonium resonances only. An amplitude model with 53 components is developed comprising 11 hidden-charm exotic hadrons. New production mechanisms for charged charmonium-like states are observed. Significant resonant activity with spin-parity $J^P = 1^+$ in the $ψ(2S) π^+$ system is confirmed and a multi-pole structure is demonstrated. The spectral decomposition of the $ψ(2S) π^+ π^-$ invariant-mass structure, dominated by $X^0 \to ψ(2S) ρ(770)^0$ decays, broadly resembles the $J/ψφ$ spectrum observed in $B^+ \to J/ψφK^+$ decays. Exotic $ψ(2S) K^+ π^-$ resonances are observed for the first time.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Motion and Structure from Event-based Normal Flow
Authors:
Zhongyang Ren,
Bangyan Liao,
Delei Kong,
Jinghang Li,
Peidong Liu,
Laurent Kneip,
Guillermo Gallego,
Yi Zhou
Abstract:
Recovering the camera motion and scene geometry from visual data is a fundamental problem in the field of computer vision. Its success in standard vision is attributed to the maturity of feature extraction, data association and multi-view geometry. The recent emergence of neuromorphic event-based cameras places great demands on approaches that use raw event data as input to solve this fundamental…
▽ More
Recovering the camera motion and scene geometry from visual data is a fundamental problem in the field of computer vision. Its success in standard vision is attributed to the maturity of feature extraction, data association and multi-view geometry. The recent emergence of neuromorphic event-based cameras places great demands on approaches that use raw event data as input to solve this fundamental problem.Existing state-of-the-art solutions typically infer implicitly data association by iteratively reversing the event data generation process. However, the nonlinear nature of these methods limits their applicability in real-time tasks, and the constant-motion assumption leads to unstable results under agile motion. To this end, we rethink the problem formulation in a way that aligns better with the differential working principle of event cameras.We show that the event-based normal flow can be used, via the proposed geometric error term, as an alternative to the full flow in solving a family of geometric problems that involve instantaneous first-order kinematics and scene geometry. Furthermore, we develop a fast linear solver and a continuous-time nonlinear solver on top of the proposed geometric error term.Experiments on both synthetic and real data show the superiority of our linear solver in terms of accuracy and efficiency, and indicate its complementary feature as an initialization method for existing nonlinear solvers. Besides, our continuous-time non-linear solver exhibits exceptional capability in accommodating sudden variations in motion since it does not rely on the constant-motion assumption.
△ Less
Submitted 19 July, 2024; v1 submitted 16 July, 2024;
originally announced July 2024.
-
Multi-reservoir enhanced loading of tweezer atom arrays
Authors:
Xu Yan,
Chengdong He,
Kai Wen,
Zejian Ren,
Preston Tsz Fung Wong,
Elnur Hajiyev,
Gyu-Boong Jo
Abstract:
We introduce a species-independent method for improved loading into a single-atom optical tweezer array, utilizing iterative loading with multiple reservoir tweezers. Demonstrated with dual wavelength tweezer arrays of $^{88}$Sr atoms, our approach achieves a 96$\%$ loading rate after four reload cycles. This method can significantly enhance existing tweezer rearrangement protocols, potentially re…
▽ More
We introduce a species-independent method for improved loading into a single-atom optical tweezer array, utilizing iterative loading with multiple reservoir tweezers. Demonstrated with dual wavelength tweezer arrays of $^{88}$Sr atoms, our approach achieves a 96$\%$ loading rate after four reload cycles. This method can significantly enhance existing tweezer rearrangement protocols, potentially reducing iteration time and optical power consumption, thereby enabling a larger number of atoms in a quantum logic device.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Finite State Machines-Based Path-Following Collaborative Computing Strategy for Emergency UAV Swarms
Authors:
Jialin Hu,
Zhiyuan Ren,
Wenchi Cheng
Abstract:
Offloading services to UAV swarms for delay-sensitive tasks in Emergency UAV Networks (EUN) can greatly enhance rescue efficiency. Most task-offloading strategies assumed that UAVs were location-fixed and capable of handling all tasks. However, in complex disaster environments, UAV locations often change dynamically, and the heterogeneity of on-board resources presents a significant challenge in o…
▽ More
Offloading services to UAV swarms for delay-sensitive tasks in Emergency UAV Networks (EUN) can greatly enhance rescue efficiency. Most task-offloading strategies assumed that UAVs were location-fixed and capable of handling all tasks. However, in complex disaster environments, UAV locations often change dynamically, and the heterogeneity of on-board resources presents a significant challenge in optimizing task scheduling in EUN to minimize latency. To address these problems, a Finite state machines-based Path-following Collaborative computation strategy (FPC) for emergency UAV swarms is proposed. First, an Extended Finite State Machine Space-time Graph (EFSMSG) model is constructed to accurately characterize on-board resources and state transitions while shielding the EUN dynamic characteristic. Based on the EFSMSG, a mathematical model is formulated for the FPC strategy to minimize task processing delay while facilitating computation during transmission. Finally, the Constraint Selection Adaptive Binary Particle Swarm Optimization (CSABPSO) algorithm is proposed for the solution. Simulation results demonstrate that the proposed FPC strategy effectively reduces task processing delay, meeting the requirements of delay-sensitive tasks in emergency situations.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models
Authors:
Jialin Hu,
Zhiyuan Ren,
Wenchi Cheng,
Zhiliang Shuai,
Zhao Li
Abstract:
The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources. Given the imperative to mitigate traffic and network congestion, analyzing the performance of Internet of Vehicles (IoV) mesh networks is of great practical significance. Most studies focus solely on individual performance metric…
▽ More
The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources. Given the imperative to mitigate traffic and network congestion, analyzing the performance of Internet of Vehicles (IoV) mesh networks is of great practical significance. Most studies focus solely on individual performance metrics and influencing factors, and the adopted simulation tools, such as OPNET, cannot achieve the dynamic link generation of IoV mesh networks. To address these problems, a network performance analysis model based on actual switches is proposed. First, a typical IoV mesh network architecture is constructed and abstracted into a mathematical model that describes how the link and topology changes over time. Then, the task generation model and the task forwarding model based on actual switches are proposed to obtain the real traffic distribution of the network. Finally, a scientific network performance indicator system is constructed. Simulation results demonstrate that, with rising task traffic and decreasing node caching capacity, the packet loss rate increases, and the task arrival rate decreases in the network. The proposed model can effectively evaluate the network performance across various traffic states and provide valuable insights for network construction and enhancement.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Search for the rare $Λ_c^+ \to p μ^+ μ^-$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1062 additional authors not shown)
Abstract:
A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branchi…
▽ More
A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branching fraction of the $Λ_c^+ \to p μ^+ μ^-$ decay is determined to be $2.9~(3.2) \times 10^{-8}$ at 90% (95%) confidence level. The branching fractions in the dimuon invariant-mass regions dominated by the $η$, $ρ$ and $ω$ resonances are also determined.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Nuclear contacts of unstable nuclei
Authors:
Tongqi Liang,
Dong bai,
Zhongzhou Ren
Abstract:
Nuclear contact is a key quantity to describe the nucleon-nucleon short-range correlations (SRCs). While they have been determined by electron scattering experiments for selected stable nuclei, nuclear contacts are largely unknown for unstable nuclei. In this work, we study nuclear contacts for a number of nuclei in the vicinity of the doubly magic $^{132}$Sn from the theoretical perspective, with…
▽ More
Nuclear contact is a key quantity to describe the nucleon-nucleon short-range correlations (SRCs). While they have been determined by electron scattering experiments for selected stable nuclei, nuclear contacts are largely unknown for unstable nuclei. In this work, we study nuclear contacts for a number of nuclei in the vicinity of the doubly magic $^{132}$Sn from the theoretical perspective, with special emphasis on unstable nuclei. We find that the proton-proton contact generally gets suppressed by the excess neutrons for the Sn isotopes, resembling the suppression of $α$-cluster formation reported recently for the same isotopic chain [J. Tanaka $\textit{et al}.$, Science $\textbf{371}$, 260 (2021)]. This indicates a hidden universal aspect of SRCs and $α$ clustering, two different kinds of nuclear correlations. Meanwhile, a linear relation is found between the proton-proton contact and the proton number for the $N=82$ isotones. Our results can be helpful for future experimental studies of SRCs in unstable nuclei at advanced facilities worldwide.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Bulk high-temperature superconductivity in the high-pressure tetragonal phase of bilayer La2PrNi2O7
Authors:
Ningning Wang,
Gang Wang,
Xiaoling Shen,
Jun Hou,
Jun Luo,
Xiaoping Ma,
Huaixin Yang,
Lifen Shi,
Jie Dou,
Jie Feng,
Jie Yang,
Yunqing Shi,
Zhian Ren,
Hanming Ma,
Pengtao Yang,
Ziyi Liu,
Yue Liu,
Hua Zhang,
Xiaoli Dong,
Yuxin Wang,
Kun Jiang,
Jiangping Hu,
Stuart Calder,
Jiaqiang Yan,
Jianping Sun
, et al. (4 additional authors not shown)
Abstract:
The Ruddlesden-Popper (R-P) bilayer nickelate, La3Ni2O7, was recently found to show signatures of high-temperature superconductivity (HTSC) at pressures above 14 GPa. Subsequent investigations achieved zero resistance in single- and poly-crystalline samples under hydrostatic pressure conditions. Yet, obvious diamagnetic signals, the other hallmark of superconductors, are still lacking owing to the…
▽ More
The Ruddlesden-Popper (R-P) bilayer nickelate, La3Ni2O7, was recently found to show signatures of high-temperature superconductivity (HTSC) at pressures above 14 GPa. Subsequent investigations achieved zero resistance in single- and poly-crystalline samples under hydrostatic pressure conditions. Yet, obvious diamagnetic signals, the other hallmark of superconductors, are still lacking owing to the filamentary nature with low superconducting volume fraction. The presence of a novel "1313" polymorph and competing R-P phases obscured proper identification of the phase for HTSC. Thus, achieving bulk HTSC and identifying the phase at play are the most prominent tasks at present. Here, we address these issues in the praseodymium (Pr)-doped La2PrNi2O7 polycrystalline samples. We find that the substitutions of Pr for La effectively inhibits the intergrowth of different R-P phases, resulting in nearly pure bilayer structure. For La2PrNi2O7, pressure-induced orthorhombic-to-tetragonal structural transition takes place at Pc ~ 11 GPa, above which HTSC emerges gradually upon further compression. The superconducting transition temperatures at 18-20 GPa reach Tconset = 82.5 K and Tczero = 60 K, which are the highest values among known nickelate superconductors. More importantly, bulk HTSC was testified by detecting clear diamagnetic signals below ~75 K corresponding to an estimated superconducting volume fraction ~ 57(5)% at 20 GPa. Our results not only resolve the existing controversies but also illuminate directions for exploring bulk HTSC in the bilayer nickelates.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Ferromagnetic inter-layer coupling in FeSe$_{1-x}$S$_{x}$ superconductors revealed by inelastic neutron scattering
Authors:
Mingwei Ma,
Philippe Bourges,
Yvan Sidis,
Jinzhao Sun,
Guoqing Wang,
Kazuki Iida,
Kazuya Kamazawa,
Jitae T. Park,
Frederic Bourdarot,
Zhian Ren,
Yuan Li
Abstract:
FeSe$_{1-x}$S$_{x}$ superconductors are commonly considered layered van der Waals materials with negligible inter-layer coupling. Here, using inelastic neutron scattering to study spin excitations in single-crystal samples, we reveal that the magnetic coupling between adjacent Fe layers is not only significant, as it affects excitations up to \textcolor{black}{15} meV, but also ferromagnetic in na…
▽ More
FeSe$_{1-x}$S$_{x}$ superconductors are commonly considered layered van der Waals materials with negligible inter-layer coupling. Here, using inelastic neutron scattering to study spin excitations in single-crystal samples, we reveal that the magnetic coupling between adjacent Fe layers is not only significant, as it affects excitations up to \textcolor{black}{15} meV, but also ferromagnetic in nature, making the system different from most unconventional superconductors including iron pnictides. Our observation provides a new standpoint to understand the absence of magnetic order in FeSe$_{1-x}$S$_{x}$. Since intercalating between the Fe layers is known to enhance superconductivity and suppress the inter-layer coupling, superconductivity appears to be a more robust phenomenon in the two-dimensional limit than antiferromagnetic order.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Wideband Coherent Microwave Conversion via Magnon Nonlinearity in Hybrid Quantum System
Authors:
Jiahao Wu,
Jiacheng Liu,
Zheyu Ren,
Man Yin Leung,
Wai Kuen Leung,
Kin On Ho,
Xiangrong Wang,
Qiming Shao,
Sen Yang
Abstract:
Frequency conversion is a widely realized physical process in nonlinear systems of optics and electronics. As an emerging nonlinear platform, spintronic devices have the potential to achieve stronger frequency conversion. Here, we demonstrated a microwave frequency conversion method in a hybrid quantum system, integrating nitrogen-vacancy centers in diamond with magnetic thin film CoFeB. We achiev…
▽ More
Frequency conversion is a widely realized physical process in nonlinear systems of optics and electronics. As an emerging nonlinear platform, spintronic devices have the potential to achieve stronger frequency conversion. Here, we demonstrated a microwave frequency conversion method in a hybrid quantum system, integrating nitrogen-vacancy centers in diamond with magnetic thin film CoFeB. We achieve a conversion bandwidth ranging from 0.1 to 12GHz, presenting an up to $\mathrm{25^{th}}$ order frequency conversion and further display the application of this method for frequency detection and qubits coherent control. Distinct from traditional frequency conversion techniques based on nonlinear electric response, our approach employs nonlinear magnetic response in spintronic devices. The nonlinearity, originating from the symmetry breaking such as domain walls in magnetic films, presents that our method can be adapted to hybrid systems of other spintronic devices and spin qubits, expanding the application scope of spintronic devices and providing a promising on-chip platform for coupling quantum systems.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
PWTO: A Heuristic Approach for Trajectory Optimization in Complex Terrains
Authors:
Yilin Cai,
Zhongqiang Ren
Abstract:
This paper considers a trajectory planning problem for a robot navigating complex terrains, which arises in applications ranging from autonomous mining vehicles to planetary rovers. The problem seeks to find a low-cost dynamically feasible trajectory for the robot. The problem is challenging as it requires solving a non-linear optimization problem that often has many local minima due to the comple…
▽ More
This paper considers a trajectory planning problem for a robot navigating complex terrains, which arises in applications ranging from autonomous mining vehicles to planetary rovers. The problem seeks to find a low-cost dynamically feasible trajectory for the robot. The problem is challenging as it requires solving a non-linear optimization problem that often has many local minima due to the complex terrain. To address the challenge, we propose a method called Pareto-optimal Warm-started Trajectory Optimization (PWTO) that attempts to combine the benefits of graph search and trajectory optimization, two very different approaches to planning. PWTO first creates a state lattice using simplified dynamics of the robot and leverages a multi-objective graph search method to obtain a set of paths. Each of the paths is then used to warm-start a local trajectory optimization process, so that different local minima are explored to find a globally low-cost solution. In our tests, the solution cost computed by PWTO is often less than half of the costs computed by the baselines. In addition, we verify the trajectories generated by PWTO in Gazebo simulation in complex terrains with both wheeled and quadruped robots. The code of this paper is open sourced and can be found at https://github.com/rap-lab-org/public_pwto.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Towards Hypernuclei from Nuclear Lattice Effective Field Theory
Authors:
Fabian Hildenbrand,
Serdar Elhatisari,
Zhengxue Ren,
Ulf-G. Meißner
Abstract:
Understanding the strong interactions within baryonic systems beyond the up and down quark sector is pivotal for a comprehensive description of nuclear forces. This study explores the interactions involving hyperons, particularly the $Λ$ particle, within the framework of nuclear lattice effective field theory (NLEFT). By incorporating $Λ$ hyperons into the NLEFT framework, we extend our investigat…
▽ More
Understanding the strong interactions within baryonic systems beyond the up and down quark sector is pivotal for a comprehensive description of nuclear forces. This study explores the interactions involving hyperons, particularly the $Λ$ particle, within the framework of nuclear lattice effective field theory (NLEFT). By incorporating $Λ$ hyperons into the NLEFT framework, we extend our investigation into the $S = -1$ sector, allowing us to probe the third dimension of the nuclear chart. We calculate the $Λ$ separation energies ($B_Λ$) of hypernuclei up to the medium-mass region, providing valuable insights into hyperon-nucleon ($YN$) and hyperon-nucleon-nucleon ($YNN$) interactions. Our calculations employ high-fidelity chiral interactions at N${}^3$LO for nucleons and extend it to $Λ$ hyperons with leading-order S-wave $YN$ interactions as well as $YNN$ forces constrained only by the $A=4,5$ systems. Our results contribute to a deeper understanding of the SU(3) symmetry breaking and establish a foundation for future improvements in hypernuclear calculations.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
One-step synthesis of Cu-doped Pb$_{10}$(PO$_{4}$)$_{6}$Cl$_{2}$ apatite: A wide-gap semiconductor
Authors:
W. Z. Yang,
Z. H. Pang,
Z. Ren
Abstract:
The recent claim of potential room-temperature superconductivity in Pb$_{10-x}$Cu$_{x}$(PO$_{4}$)$_{6}$O has attracted widespread attention. However, the signature of superconductivity is later attributed to the Cu$_{2}$S impurity formed during the multiple-step synthesis procedure. Here we report a simple one-step approach to synthesize single-phase chloride analogue Cu-doped Pb$_{10}$(PO$_{4}$)…
▽ More
The recent claim of potential room-temperature superconductivity in Pb$_{10-x}$Cu$_{x}$(PO$_{4}$)$_{6}$O has attracted widespread attention. However, the signature of superconductivity is later attributed to the Cu$_{2}$S impurity formed during the multiple-step synthesis procedure. Here we report a simple one-step approach to synthesize single-phase chloride analogue Cu-doped Pb$_{10}$(PO$_{4}$)$_{6}$Cl$_{2}$ using PbO, PbCl$_{2}$, CuCl$_{2}$, and NH$_{4}$H$_{2}$PO$_{4}$ as starting materials. Irrespective of the initial stoichiometry, the Cu doping always leads to a lattice expansion in Pb$_{10}$(PO$_{4}$)$_{6}$Cl$_{2}$. This indicates that Cu prefers to reside in the hexagonal channels rather than substitutes at the Pb site, and the chemical formula is expressed as Pb$_{10}$(PO$_{4}$)$_{6}$Cu$_{x}$Cl$_{2}$. All the Pb$_{10}$(PO$_{4}$)$_{6}$Cu$_{x}$Cl$_{2}$ (0 $\leq$ $x$ $\leq$ 1.0) samples are found to be semiconductors with wide band gaps of 4.46-4.59 eV, and the Cu-doped ones ($x$ = 0.5 and 1.0) exhibit a paramagnetic behavior without any phase transition between 400 and 1.8 K. Our study calls for a reinvestigation of the Cu location in Pb$_{10-x}$Cu$_{x}$(PO$_{4}$)$_{6}$O, and supports the absence of superconductivity in this oxyapatite.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Probing the nature of the $χ_{c1}(3872)$ state using radiative decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1094 additional authors not shown)
Abstract:
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and…
▽ More
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and the ratio of its partial width to that of the $χ_{c1}(3872)\rightarrow J/ψγ$ decay is measured to be $$ \frac{Γ_{χ_{c1}(3872)\rightarrow ψ(2S)γ}}
{Γ_{χ_{c1}(3872)\rightarrow J/ψγ}} = 1.67 \pm 0.21 \pm 0.12 \pm0.04 , $$ where the first uncertainty is statistical, the second systematic and the third is due to the uncertainties on the branching fractions of the $ψ(2S)$ and $J/ψ$ mesons. The measured ratio makes the interpretation of the $χ_{c1}(3872)$ state as a~pure $D^0\bar{D}^{*0}+\bar{D}^0D^{*0}$ molecule questionable and strongly indicates a sizeable compact charmonium or tetraquark component within the $χ_{c1}(3872)$ state.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy
Authors:
Chen Wang,
Kaiyi Ji,
Junyi Geng,
Zhongqiang Ren,
Taimeng Fu,
Fan Yang,
Yifan Guo,
Haonan He,
Xiangyu Chen,
Zitong Zhan,
Qiwei Du,
Shaoshu Su,
Bowen Li,
Yuheng Qiu,
Yi Du,
Qihang Li,
Yifan Yang,
Xiao Lin,
Zhipeng Zhao
Abstract:
Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS…
▽ More
Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeSy) computational framework, imperative learning (IL), for robot autonomy, leveraging the generalization abilities of symbolic reasoning. The framework of IL consists of three primary components: a neural module, a reasoning engine, and a memory system. We formulate IL as a special bilevel optimization (BLO), which enables reciprocal learning over the three modules. This overcomes the label-intensive obstacles associated with data-driven approaches and takes advantage of symbolic reasoning concerning logical reasoning, physical principles, geometric analysis, etc. We discuss several optimization techniques for IL and verify their effectiveness in five distinct robot autonomy tasks including path planning, rule induction, optimal control, visual odometry, and multi-robot routing. Through various experiments, we show that IL can significantly enhance robot autonomy capabilities and we anticipate that it will catalyze further research across diverse domains.
△ Less
Submitted 6 August, 2024; v1 submitted 23 June, 2024;
originally announced June 2024.
-
Speech Emotion Recognition under Resource Constraints with Data Distillation
Authors:
Yi Chang,
Zhao Ren,
Zhonghao Zhao,
Thanh Tam Nguyen,
Kun Qian,
Tanja Schultz,
Björn W. Schuller
Abstract:
Speech emotion recognition (SER) plays a crucial role in human-computer interaction. The emergence of edge devices in the Internet of Things (IoT) presents challenges in constructing intricate deep learning models due to constraints in memory and computational resources. Moreover, emotional speech data often contains private information, raising concerns about privacy leakage during the deployment…
▽ More
Speech emotion recognition (SER) plays a crucial role in human-computer interaction. The emergence of edge devices in the Internet of Things (IoT) presents challenges in constructing intricate deep learning models due to constraints in memory and computational resources. Moreover, emotional speech data often contains private information, raising concerns about privacy leakage during the deployment of SER models. To address these challenges, we propose a data distillation framework to facilitate efficient development of SER models in IoT applications using a synthesised, smaller, and distilled dataset. Our experiments demonstrate that the distilled dataset can be effectively utilised to train SER models with fixed initialisation, achieving performances comparable to those developed using the original full emotional speech dataset.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Evidence for Three-$α$ Breathing Modes Uncovered by Control Neural Network
Authors:
Zheng Cheng,
Mengjiao Lyu,
Takayuki Myo,
Hisashi Horiuchi,
Hiroshi Toki,
Zhongzhou Ren,
Masahiro Isaka,
Mengyun Mao,
Hiroki Takemoto,
Niu Wan,
Wenlong You,
Qing Zhao
Abstract:
This work introduces a new Control Neural Network (Ctrl.NN) method to uncover evidence of exotic quantum state, \textit{i.e.}, the breathing modes in 3-$α$ resonant states of $^{12}$C nucleus. We provide the most precise microscopic description to date for the $^{12}$C energy spectrum, identify two new exotic breathing states, and uncover strong evidence that directly connects the recent experimen…
▽ More
This work introduces a new Control Neural Network (Ctrl.NN) method to uncover evidence of exotic quantum state, \textit{i.e.}, the breathing modes in 3-$α$ resonant states of $^{12}$C nucleus. We provide the most precise microscopic description to date for the $^{12}$C energy spectrum, identify two new exotic breathing states, and uncover strong evidence that directly connects the recent experimental observations to the breathing modes. The Ctrl.NN method significantly simplifies numerical calculations of quantum systems under multiple constraints and offers a new perspective for solving the nuclear many-body problem.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering
Authors:
Zhengliang Shi,
Shuo Zhang,
Weiwei Sun,
Shen Gao,
Pengjie Ren,
Zhumin Chen,
Zhaochun Ren
Abstract:
Multi-Hop Question Answering (MHQA) tasks present a significant challenge for large language models (LLMs) due to the intensive knowledge required. Current solutions, like Retrieval-Augmented Generation, typically retrieve potential documents from an external corpus to read an answer. However, the performance of this retrieve-then-read paradigm is constrained by the retriever and the inevitable no…
▽ More
Multi-Hop Question Answering (MHQA) tasks present a significant challenge for large language models (LLMs) due to the intensive knowledge required. Current solutions, like Retrieval-Augmented Generation, typically retrieve potential documents from an external corpus to read an answer. However, the performance of this retrieve-then-read paradigm is constrained by the retriever and the inevitable noise in the retrieved documents. To mitigate these challenges, we introduce a novel generate-then-ground (GenGround) framework, synergizing the parametric knowledge of LLMs and external documents to solve a multi-hop question. GenGround empowers LLMs to alternate two phases until the final answer is derived: (1) formulate a simpler, single-hop question and directly generate the answer; (2) ground the question-answer pair in retrieved documents, amending any wrong predictions in the answer. We also propose an instructional grounding distillation method to generalize our method into smaller models. Extensive experiments conducted on four datasets illustrate the superiority of our method.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Q-SNNs: Quantized Spiking Neural Networks
Authors:
Wenjie Wei,
Yu Liang,
Ammar Belatreche,
Yichen Xiao,
Honglin Cao,
Zhenbang Ren,
Guoqing Wang,
Malu Zhang,
Yang Yang
Abstract:
Brain-inspired Spiking Neural Networks (SNNs) leverage sparse spikes to represent information and process them in an asynchronous event-driven manner, offering an energy-efficient paradigm for the next generation of machine intelligence. However, the current focus within the SNN community prioritizes accuracy optimization through the development of large-scale models, limiting their viability in r…
▽ More
Brain-inspired Spiking Neural Networks (SNNs) leverage sparse spikes to represent information and process them in an asynchronous event-driven manner, offering an energy-efficient paradigm for the next generation of machine intelligence. However, the current focus within the SNN community prioritizes accuracy optimization through the development of large-scale models, limiting their viability in resource-constrained and low-power edge devices. To address this challenge, we introduce a lightweight and hardware-friendly Quantized SNN (Q-SNN) that applies quantization to both synaptic weights and membrane potentials. By significantly compressing these two key elements, the proposed Q-SNNs substantially reduce both memory usage and computational complexity. Moreover, to prevent the performance degradation caused by this compression, we present a new Weight-Spike Dual Regulation (WS-DR) method inspired by information entropy theory. Experimental evaluations on various datasets, including static and neuromorphic, demonstrate that our Q-SNNs outperform existing methods in terms of both model size and accuracy. These state-of-the-art results in efficiency and efficacy suggest that the proposed method can significantly improve edge intelligent computing.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Ask-before-Plan: Proactive Language Agents for Real-World Planning
Authors:
Xuan Zhang,
Yang Deng,
Zifeng Ren,
See-Kiong Ng,
Tat-Seng Chua
Abstract:
The evolution of large language models (LLMs) has enhanced the planning capabilities of language agents in diverse real-world scenarios. Despite these advancements, the potential of LLM-powered agents to comprehend ambiguous user instructions for reasoning and decision-making is still under exploration. In this work, we introduce a new task, Proactive Agent Planning, which requires language agents…
▽ More
The evolution of large language models (LLMs) has enhanced the planning capabilities of language agents in diverse real-world scenarios. Despite these advancements, the potential of LLM-powered agents to comprehend ambiguous user instructions for reasoning and decision-making is still under exploration. In this work, we introduce a new task, Proactive Agent Planning, which requires language agents to predict clarification needs based on user-agent conversation and agent-environment interaction, invoke external tools to collect valid information, and generate a plan to fulfill the user's demands. To study this practical problem, we establish a new benchmark dataset, Ask-before-Plan. To tackle the deficiency of LLMs in proactive planning, we propose a novel multi-agent framework, Clarification-Execution-Planning (\texttt{CEP}), which consists of three agents specialized in clarification, execution, and planning. We introduce the trajectory tuning scheme for the clarification agent and static execution agent, as well as the memory recollection mechanism for the dynamic execution agent. Extensive evaluations and comprehensive analyses conducted on the Ask-before-Plan dataset validate the effectiveness of our proposed framework.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Precision measurement of the $Ξ^-_b$ baryon lifetime
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1064 additional authors not shown)
Abstract:
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ^{\rm Run\,2}=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second sys…
▽ More
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ^{\rm Run\,2}=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second systematic. This value is averaged with the corresponding value from Run 1 to obtain ${r_τ^{\rm Run\,1,2} = 1.078\pm0.012\pm0.007}$. Multiplying by the world-average value of the $Λ^0_b$ lifetime yields $τ_{Ξ^-_b}^{\rm Run~1,2} = 1.578\pm0.018\pm0.010\pm0.011$ ps, where the uncertainties are statistical, systematic, and due to the limited knowledge of the $Λ^0_b$ lifetime. This measurement improves the precision of the current world average of the $Ξ^-_b$ lifetime by about a factor of two, and is in good agreement with the most recent theoretical predictions.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Propagative Distance Optimization for Constrained Inverse Kinematics
Authors:
Yu Chen,
Yilin Cai,
Jinyun Xu,
Zhongqiang Ren,
Guanya Shi,
Howie Choset
Abstract:
This paper investigates a constrained inverse kinematic (IK) problem that seeks a feasible configuration of an articulated robot under various constraints such as joint limits and obstacle collision avoidance. Due to the high-dimensionality and complex constraints, this problem is often solved numerically via iterative local optimization. Classic local optimization methods take joint angles as the…
▽ More
This paper investigates a constrained inverse kinematic (IK) problem that seeks a feasible configuration of an articulated robot under various constraints such as joint limits and obstacle collision avoidance. Due to the high-dimensionality and complex constraints, this problem is often solved numerically via iterative local optimization. Classic local optimization methods take joint angles as the decision variable, which suffers from non-linearity caused by the trigonometric constraints. Recently, distance-based IK methods have been developed as an alternative approach that formulates IK as an optimization over the distances among points attached to the robot and the obstacles. Although distance-based methods have demonstrated unique advantages, they still suffer from low computational efficiency, since these approaches usually ignore the chain structure in the kinematics of serial robots. This paper proposes a new method called propagative distance optimization for constrained inverse kinematics (PDO-IK), which captures and leverages the chain structure in the distance-based formulation and expedites the optimization by computing forward kinematics and the Jacobian propagatively along the kinematic chain. Test results show that PDO-IK runs up to two orders of magnitude faster than the existing distance-based methods under joint limits constraints and obstacle avoidance constraints. It also achieves up to three times higher success rates than the conventional joint-angle-based optimization methods for IK problems. The high runtime efficiency of PDO-IK allows the real-time computation (10$-$1500 Hz) and enables a simulated humanoid robot with 19 degrees of freedom (DoFs) to avoid moving obstacles, which is otherwise hard to achieve with the baselines.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows
Authors:
Zhenggang Tang,
Zhongzheng Ren,
Xiaoming Zhao,
Bowen Wen,
Jonathan Tremblay,
Stan Birchfield,
Alexander Schwing
Abstract:
We present a method for automatically modifying a NeRF representation based on a single observation of a non-rigid transformed version of the original scene. Our method defines the transformation as a 3D flow, specifically as a weighted linear blending of rigid transformations of 3D anchor points that are defined on the surface of the scene. In order to identify anchor points, we introduce a novel…
▽ More
We present a method for automatically modifying a NeRF representation based on a single observation of a non-rigid transformed version of the original scene. Our method defines the transformation as a 3D flow, specifically as a weighted linear blending of rigid transformations of 3D anchor points that are defined on the surface of the scene. In order to identify anchor points, we introduce a novel correspondence algorithm that first matches RGB-based pairs, then leverages multi-view information and 3D reprojection to robustly filter false positives in two steps. We also introduce a new dataset for exploring the problem of modifying a NeRF scene through a single observation. Our dataset ( https://github.com/nerfdeformer/nerfdeformer ) contains 113 synthetic scenes leveraging 47 3D assets. We show that our proposed method outperforms NeRF editing methods as well as diffusion-based methods, and we also explore different methods for filtering correspondences.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control
Authors:
Yuzhong Huang,
Zhong Li,
Zhang Chen,
Zhiyuan Ren,
Guosheng Lin,
Fred Morstatter,
Yi Xu
Abstract:
In the evolving landscape of text-to-3D technology, Dreamfusion has showcased its proficiency by utilizing Score Distillation Sampling (SDS) to optimize implicit representations such as NeRF. This process is achieved through the distillation of pretrained large-scale text-to-image diffusion models. However, Dreamfusion encounters fidelity and efficiency constraints: it faces the multi-head Janus i…
▽ More
In the evolving landscape of text-to-3D technology, Dreamfusion has showcased its proficiency by utilizing Score Distillation Sampling (SDS) to optimize implicit representations such as NeRF. This process is achieved through the distillation of pretrained large-scale text-to-image diffusion models. However, Dreamfusion encounters fidelity and efficiency constraints: it faces the multi-head Janus issue and exhibits a relatively slow optimization process. To circumvent these challenges, we introduce OrientDream, a camera orientation conditioned framework designed for efficient and multi-view consistent 3D generation from textual prompts. Our strategy emphasizes the implementation of an explicit camera orientation conditioned feature in the pre-training of a 2D text-to-image diffusion module. This feature effectively utilizes data from MVImgNet, an extensive external multi-view dataset, to refine and bolster its functionality. Subsequently, we utilize the pre-conditioned 2D images as a basis for optimizing a randomly initialized implicit representation (NeRF). This process is significantly expedited by a decoupled back-propagation technique, allowing for multiple updates of implicit parameters per optimization cycle. Our experiments reveal that our method not only produces high-quality NeRF models with consistent multi-view properties but also achieves an optimization speed significantly greater than existing methods, as quantified by comparative metrics.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Manipulating magnetism and transport properties of EuCd$_2$P$_2$ with a low carrier concentration
Authors:
Xiyu Chen,
Ziwen Wang,
Zhiyu Zhou,
Wuzhang Yang,
Yi Liu,
Jia-Yi Lu,
Zhi Ren,
Guang-Han Cao,
Fazel Tafti,
Shuai Dong,
Zhi-Cheng Wang
Abstract:
Materials that exhibit strongly coupled magnetic order and electronic properties are crucial for both fundamental research and technological applications. However, finding a material that not only shows remarkable magnetoresistive responses but also has an easily tunable ground state remains a challenge. Here, we report successful manipulation of the magnetic and transport properties of EuCd$_2$P…
▽ More
Materials that exhibit strongly coupled magnetic order and electronic properties are crucial for both fundamental research and technological applications. However, finding a material that not only shows remarkable magnetoresistive responses but also has an easily tunable ground state remains a challenge. Here, we report successful manipulation of the magnetic and transport properties of EuCd$_2$P$_2$, which is transformed from an A-type antiferromagnet ($T_\mathrm{N}$ = 11 K) exhibiting colossal magnetoresistance into a ferromagnet ($T_\mathrm{C}$ = 47 K) with metallic behavior. The dramatic alteration results from a low hole concentration of $10^{19}$ cm$^{-3}$ induced by changing the growth conditions. Electronic structure and total energy calculations confirm the tunability of magnetism with a small carrier concentration for EuCd$_2$P$_2$. It is feasible to switch between the magnetic states by using field-effect to control the carrier density, thereby changing the magneto-electronic response. The controllable magnetism and electrical transport of EuCd$_2$P$_2$ make it a potential candidate for spintronics.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Carrier-induced transition from antiferromagnetic insulator to ferromagnetic metal in the layered phosphide EuZn$_2$P$_2$
Authors:
Xiyu Chen,
Wuzhang Yang,
Jia-Yi Lu,
Zhiyu Zhou,
Zhi Ren,
Guang-Han Cao,
Shuai Dong,
Zhi-Cheng Wang
Abstract:
EuZn$_2$P$_2$ was reported to be an insulating antiferromagnet with $T_\mathrm{N}$ of 23.5 K. In this study, single crystals of EuZn$_2$P$_2$ exhibiting metallic behavior and a ferromagnetic order of 72 K ($T_\mathrm{C}$) are successfully synthesized via a salt flux method. The presence of hole carriers induced by the Eu vacancies in the lattice is found to be crucial for the drastic changes in ma…
▽ More
EuZn$_2$P$_2$ was reported to be an insulating antiferromagnet with $T_\mathrm{N}$ of 23.5 K. In this study, single crystals of EuZn$_2$P$_2$ exhibiting metallic behavior and a ferromagnetic order of 72 K ($T_\mathrm{C}$) are successfully synthesized via a salt flux method. The presence of hole carriers induced by the Eu vacancies in the lattice is found to be crucial for the drastic changes in magnetism and electrical transport. The carriers mediate the interlayer ferromagnetic interaction, and the coupling strength is directly related to $T_\mathrm{C}$, as evidenced by the linear dependence of $T_\mathrm{C}$ and the fitted Curie-Weiss temperatures on the Eu-layer distances for ferromagnetic Eu$M_2X_2$ ($M$ = Zn, Cd; $X$ = P, As). The ferromagnetic EuZn$_2$P$_2$ shows conspicuous negative magnetoresistance (MR) near $T_\mathrm{C}$, owing to strong magnetic scattering. The MR behavior is consistent with the Majumdar-Littlewood model, indicating that the MR can be enhanced by decreasing the carrier density. Our findings suggest that Eu$M_2X_2$ has highly tunable magnetism and charge transport, making it a promising material family for potential applications in spintronics.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
OAM-SWIPT for IoE-Driven 6G
Authors:
Runyu Lyu,
Wenchi Cheng,
Bazhong Shen,
Zhiyuan Ren,
Hailin Zhang
Abstract:
Simultaneous wireless information and power transfer (SWIPT), which achieves both wireless energy transfer (WET) and information transfer, is an attractive technique for future Internet of Everything (IoE) in the sixth-generation (6G) mobile communications. With SWIPT, battery-less IoE devices can be powered while communicating with other devices. Line-of-sight (LOS) RF transmission and near-field…
▽ More
Simultaneous wireless information and power transfer (SWIPT), which achieves both wireless energy transfer (WET) and information transfer, is an attractive technique for future Internet of Everything (IoE) in the sixth-generation (6G) mobile communications. With SWIPT, battery-less IoE devices can be powered while communicating with other devices. Line-of-sight (LOS) RF transmission and near-field inductive coupling based transmission are typical SWIPT scenarios, which are both LOS channels and without enough degree of freedom for high spectrum efficiency as well as high energy efficiency. Due to the orthogonal wavefronts, orbital angular momentum (OAM) can facilitate the SWIPT in LOS channels. In this article, we introduce the OAM-based SWIPT as well as discuss some basic advantages and challenges for it. After introducing the OAM-based SWIPT for IoE, we first propose an OAM-based SWIPT system model with the OAM-modes assisted dynamic power splitting (DPS). Then, four basic advantages regarding the OAM-based SWIPT are reviewed with some numerical analyses for further demonstrating the advantages. Next, four challenges regarding integrating OAM into SWIPT and possible solutions are discussed. OAM technology provides multiple orthogonal streams to increase both spectrum and energy efficiencies for SWIPT, thus creating many opportunities for future WET and SWIPT researches.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Ubiquitous Flat Bands in a Cr-based Kagome Superconductor
Authors:
Yucheng Guo,
Zehao Wang,
Fang Xie,
Yuefei Huang,
Bin Gao,
Ji Seop Oh,
Han Wu,
Zhaoyu Liu,
Zheng Ren,
Yuan Fang,
Ananya Biswas,
Yichen Zhang,
Ziqin Yue,
Cheng Hu,
Chris Jozwiak,
Aaron Bostwick,
Eli Rotenberg,
Makoto Hashimoto,
Donghui Lu,
Junichiro Kono,
Jiun-Haw Chu,
Boris I Yakobson,
Robert J Birgeneau,
Qimiao Si,
Pengcheng Dai
, et al. (1 additional authors not shown)
Abstract:
In the quest for novel quantum states driven by topology and correlation, kagome lattice materials have garnered significant interest due to their distinctive electronic band structures, featuring flat bands (FBs) arising from the quantum destructive interference of the electronic wave function. The tuning of the FBs to the chemical potential would lead to the possibility of liberating electronic…
▽ More
In the quest for novel quantum states driven by topology and correlation, kagome lattice materials have garnered significant interest due to their distinctive electronic band structures, featuring flat bands (FBs) arising from the quantum destructive interference of the electronic wave function. The tuning of the FBs to the chemical potential would lead to the possibility of liberating electronic instabilities that lead to emergent electronic orders. Despite extensive studies, direct evidence of FBs tuned to the chemical potential and their participation in emergent electronic orders have been lacking in bulk quantum materials. Here using a combination of Angle-Resolved Photoemission Spectroscopy (ARPES) and Density Functional Theory (DFT), we reveal that the low-energy electronic structure of the recently discovered Cr-based kagome metal superconductor CsCr3Sb5 is dominated by a pervasive FB in close proximity to, and below the Fermi level. A comparative analysis with orbital-projected DFT and polarization dependence measurement uncovers that an orbital-selective renormalization mechanism is needed to reconcile the discrepancy with the DFT calculations, which predict the FB to appear 200 meV above the Fermi level. Furthermore, we observe the FB to shift away from the Fermi level by 20 meV in the low-temperature density wave-ordered phase, highlighting the role of the FB in the emergent electronic order. Our results reveal CsCr3Sb5 to stand out as a promising platform for further exploration into the effects of FBs near the Fermi level on kagome lattices, and their role in emergent orders in bulk quantum materials.
△ Less
Submitted 12 June, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Authors:
Jitai Hao,
WeiWei Sun,
Xin Xin,
Qi Meng,
Zhumin Chen,
Pengjie Ren,
Zhaochun Ren
Abstract:
Parameter-Efficient Fine-tuning (PEFT) facilitates the fine-tuning of Large Language Models (LLMs) under limited resources. However, the fine-tuning performance with PEFT on complex, knowledge-intensive tasks is limited due to the constrained model capacity, which originates from the limited number of additional trainable parameters. To overcome this limitation, we introduce a novel mechanism that…
▽ More
Parameter-Efficient Fine-tuning (PEFT) facilitates the fine-tuning of Large Language Models (LLMs) under limited resources. However, the fine-tuning performance with PEFT on complex, knowledge-intensive tasks is limited due to the constrained model capacity, which originates from the limited number of additional trainable parameters. To overcome this limitation, we introduce a novel mechanism that fine-tunes LLMs with adapters of larger size yet memory-efficient. This is achieved by leveraging the inherent activation sparsity in the Feed-Forward Networks (FFNs) of LLMs and utilizing the larger capacity of Central Processing Unit (CPU) memory compared to Graphics Processing Unit (GPU). We store and update the parameters of larger adapters on the CPU. Moreover, we employ a Mixture of Experts (MoE)-like architecture to mitigate unnecessary CPU computations and reduce the communication volume between the GPU and CPU. This is particularly beneficial over the limited bandwidth of PCI Express (PCIe). Our method can achieve fine-tuning results comparable to those obtained with larger memory capacities, even when operating under more limited resources such as a 24GB memory single GPU setup, with acceptable loss in training efficiency. Our codes are available at https://github.com/CURRENTF/MEFT.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Measurement of the branching fraction ratios $R(D^{+})$ and $R(D^{*+})$ using muonic $τ$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1063 additional authors not shown)
Abstract:
The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining…
▽ More
The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining $D^+$ mesons with $τ^-\toμ^-\overlineν_μν_τ$ candidates, where the $D^+$ is reconstructed via the $D^+\to K^-π^+π^+$ decay. The results are
\begin{align*}
R(D^{+}) &= 0.249 \pm 0.043 \pm 0.047,
R(D^{*+}) &= 0.402 \pm 0.081\pm 0.085,
\end{align*}
where the first uncertainties are statistical and the second systematic. The two measurements have a correlation coefficient of $-0.39$ and are compatible with the Standard Model.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Observation of new charmonium(-like) states in $B^+ \to D^{*\pm} D^{\mp} K^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1062 additional authors not shown)
Abstract:
A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contribu…
▽ More
A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contributions from resonances decaying to $D^{\ast-}D^{+}$ and $D^{\ast+}D^{-}$ states linked by $C$ parity. This procedure allows the $C$-parities of resonances in the $D^{\ast\pm}D^{\mp}$ mass spectra to be determined. Four charmonium(-like) states are observed decaying into $D^{\ast\pm}D^{\mp}$: $η_c(3945)$, $h_c(4000)$, $χ_{c1}(4010)$ and $h_c(4300)$, with quantum numbers $J^{PC}$ equal to $0^{-+}$, $1^{+-}$, $1^{++}$ and $1^{+-}$, respectively. At least three of these states have not been observed previously. In addition, the existence of the $T_{\bar{c}\bar{s}0}^{*}(2870)^{0}$ and $T_{\bar{c}\bar{s}1}^{*}(2900)^{0}$ resonances in the $D^-K^+$ mass spectrum, already observed in the $B^+ \to D^+ D^- K^+$ decay, is confirmed in a different production channel.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
E-ICL: Enhancing Fine-Grained Emotion Recognition through the Lens of Prototype Theory
Authors:
Zhou Yang,
Zhaochun Ren,
Chenglong Ye,
Yufeng Wang,
Haizhou Sun,
Chao Chen,
Xiaofei Zhu,
Yunbing Wu,
Xiangwen Liao
Abstract:
In-context learning (ICL) achieves remarkable performance in various domains such as knowledge acquisition, commonsense reasoning, and semantic understanding. However, its performance significantly deteriorates for emotion detection tasks, especially fine-grained emotion recognition. The underlying reasons for this remain unclear. In this paper, we identify the reasons behind ICL's poor performanc…
▽ More
In-context learning (ICL) achieves remarkable performance in various domains such as knowledge acquisition, commonsense reasoning, and semantic understanding. However, its performance significantly deteriorates for emotion detection tasks, especially fine-grained emotion recognition. The underlying reasons for this remain unclear. In this paper, we identify the reasons behind ICL's poor performance from the perspective of prototype theory and propose a method to address this issue. Specifically, we conduct extensive pilot experiments and find that ICL conforms to the prototype theory on fine-grained emotion recognition. Based on this theory, we uncover the following deficiencies in ICL: (1) It relies on prototypes (example-label pairs) that are semantically similar but emotionally inaccurate to predict emotions. (2) It is prone to interference from irrelevant categories, affecting the accuracy and robustness of the predictions. To address these issues, we propose an Emotion Context Learning method (E-ICL) on fine-grained emotion recognition. E-ICL relies on more emotionally accurate prototypes to predict categories by referring to emotionally similar examples with dynamic labels. Simultaneously, E-ICL employs an exclusionary emotion prediction strategy to avoid interference from irrelevant categories, thereby increasing its accuracy and robustness. Note that the entire process is accomplished with the assistance of a plug-and-play emotion auxiliary model, without additional training. Experiments on the fine-grained emotion datasets EDOS, Empathetic-Dialogues, EmpatheticIntent, and GoEmotions show that E-ICL achieves superior emotion prediction performance. Furthermore, even when the emotion auxiliary model used is lower than 10% of the LLMs, E-ICL can still boost the performance of LLMs by over 4% on multiple datasets.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Full-Atom Peptide Design based on Multi-modal Flow Matching
Authors:
Jiahan Li,
Chaoran Cheng,
Zuofan Wu,
Ruihan Guo,
Shitong Luo,
Zhizhou Ren,
Jian Peng,
Jianzhu Ma
Abstract:
Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspi…
▽ More
Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspiration from the crucial roles of residue backbone orientations and side-chain dynamics in protein-peptide interactions, we characterize the peptide structure using rigid backbone frames within the $\mathrm{SE}(3)$ manifold and side-chain angles on high-dimensional tori. Furthermore, we represent discrete residue types in the peptide sequence as categorical distributions on the probability simplex. By learning the joint distributions of each modality using derived flows and vector fields on corresponding manifolds, our method excels in the fine-grained design of full-atom peptides. Harnessing the multi-modal paradigm, our approach adeptly tackles various tasks such as fix-backbone sequence design and side-chain packing through partial sampling. Through meticulously crafted experiments, we demonstrate that PepFlow exhibits superior performance in comprehensive benchmarks, highlighting its significant potential in computational peptide design and analysis.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.