Search | arXiv e-print repository

EXAONE 3.0 7.8B Instruction Tuned Language Model

Authors: LG AI Research, :, Soyoung An, Kyunghoon Bae, Eunbi Choi, Stanley Jungkyu Choi, Yemuk Choi, Seokhee Hong, Yeonjung Hong, Junwon Hwang, Hyojin Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung, Yountae Jung, Euisoon Kim, Hyosang Kim, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Youchul Kim, Edward Hwayoung Lee, Haeju Lee , et al. (14 additional authors not shown)

Abstract: We introduce EXAONE 3.0 instruction-tuned language model, the first open model in the family of Large Language Models (LLMs) developed by LG AI Research. Among different model sizes, we publicly release the 7.8B instruction-tuned model to promote open research and innovations. Through extensive evaluations across a wide range of public and in-house benchmarks, EXAONE 3.0 demonstrates highly compet… ▽ More We introduce EXAONE 3.0 instruction-tuned language model, the first open model in the family of Large Language Models (LLMs) developed by LG AI Research. Among different model sizes, we publicly release the 7.8B instruction-tuned model to promote open research and innovations. Through extensive evaluations across a wide range of public and in-house benchmarks, EXAONE 3.0 demonstrates highly competitive real-world performance with instruction-following capability against other state-of-the-art open models of similar size. Our comparative analysis shows that EXAONE 3.0 excels particularly in Korean, while achieving compelling performance across general tasks and complex reasoning. With its strong real-world effectiveness and bilingual proficiency, we hope that EXAONE keeps contributing to advancements in Expert AI. Our EXAONE 3.0 instruction-tuned model is available at https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct △ Less

Submitted 8 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

arXiv:2407.15778 [pdf, other]

Violating Bell's inequality in gate-defined quantum dots

Authors: Paul Steinacker, Tuomo Tanttu, Wee Han Lim, Nard Dumoulin Stuyck, MengKe Feng, Santiago Serrano, Ensar Vahapoglu, Rocky Y. Su, Jonathan Y. Huang, Cameron Jones, Kohei M. Itoh, Fay E. Hudson, Christopher C. Escott, Andrea Morello, Andre Saraiva, Chih Hwan Yang, Andrew S. Dzurak, Arne Laucht

Abstract: The superior computational power promised by quantum computers utilises the fundamental quantum mechanical principle of entanglement. However, achieving entanglement and verifying that the generated state does not follow the principle of local causality has proven difficult for spin qubits in gate-defined quantum dots, as it requires simultaneously high concurrence values and readout fidelities to… ▽ More The superior computational power promised by quantum computers utilises the fundamental quantum mechanical principle of entanglement. However, achieving entanglement and verifying that the generated state does not follow the principle of local causality has proven difficult for spin qubits in gate-defined quantum dots, as it requires simultaneously high concurrence values and readout fidelities to break the classical bound imposed by Bell's inequality. Here we employ advanced operational protocols for spin qubits in silicon, such as heralded initialization and calibration via gate set tomography (GST), to reduce all relevant errors and push the fidelities of the full 2-qubit gate set above 99%. We demonstrate a 97.17% Bell state fidelity without correcting for readout errors and violate Bell's inequality with a Bell signal of S = 2.731 close to the theoretical maximum of 2{\sqrt{2}}. Our measurements exceed the classical limit even at elevated temperatures of 1.1K or entanglement lifetimes of 100 μみゅーs. △ Less

Submitted 22 July, 2024; originally announced July 2024.

Comments: 19 pages, 5 main figures, 9 extended data figures

MSC Class: 81P68; 81-05

arXiv:2407.15151 [pdf, other]

Spin Qubits with Scalable milli-kelvin CMOS Control

Authors: Samuel K. Bartee, Will Gilbert, Kun Zuo, Kushal Das, Tuomo Tanttu, Chih Hwan Yang, Nard Dumoulin Stuyck, Sebastian J. Pauka, Rocky Y. Su, Wee Han Lim, Santiago Serrano, Christopher C. Escott, Fay E. Hudson, Kohei M. Itoh, Arne Laucht, Andrew S. Dzurak, David J. Reilly

Abstract: A key virtue of spin qubits is their sub-micron footprint, enabling a single silicon chip to host the millions of qubits required to execute useful quantum algorithms with error correction. With each physical qubit needing multiple control lines however, a fundamental barrier to scale is the extreme density of connections that bridge quantum devices to their external control and readout hardware.… ▽ More A key virtue of spin qubits is their sub-micron footprint, enabling a single silicon chip to host the millions of qubits required to execute useful quantum algorithms with error correction. With each physical qubit needing multiple control lines however, a fundamental barrier to scale is the extreme density of connections that bridge quantum devices to their external control and readout hardware. A promising solution is to co-locate the control system proximal to the qubit platform at milli-kelvin temperatures, wired-up via miniaturized interconnects. Even so, heat and crosstalk from closely integrated control have potential to degrade qubit performance, particularly for two-qubit entangling gates based on exchange coupling that are sensitive to electrical noise. Here, we benchmark silicon MOS-style electron spin qubits controlled via heterogeneously-integrated cryo-CMOS circuits with a low enough power density to enable scale-up. Demonstrating that cryo-CMOS can efficiently enable universal logic operations for spin qubits, we go on to show that mill-kelvin control has little impact on the performance of single- and two-qubit gates. Given the complexity of our milli-kelvin CMOS platform, with some 100-thousand transistors, these results open the prospect of scalable control based on the tight packaging of spin qubits with a chiplet style control architecture. △ Less

Submitted 21 July, 2024; originally announced July 2024.

arXiv:2407.09514 [pdf]

Machine Learning Based Prediction of Proton Conductivity in Metal-Organic Frameworks

Authors: Seunghee Han, Byeong Gwan Lee, Dae Woon Lim, Jihan Kim

Abstract: Recently, metal-organic frameworks (MOFs) have demonstrated their potential as solid-state electrolytes in proton exchange membrane fuel cells. However, the number of MOFs reported to exhibit proton conductivity remains limited, and the mechanisms underlying this phenomenon are not fully elucidated, complicating the design of proton-conductive MOFs. In response, we developed a comprehensive databa… ▽ More Recently, metal-organic frameworks (MOFs) have demonstrated their potential as solid-state electrolytes in proton exchange membrane fuel cells. However, the number of MOFs reported to exhibit proton conductivity remains limited, and the mechanisms underlying this phenomenon are not fully elucidated, complicating the design of proton-conductive MOFs. In response, we developed a comprehensive database of proton-conductive MOFs and applied machine learning techniques to predict their proton conductivity. Our approach included the construction of both descriptor-based and transformer-based models. Notably, the transformer-based transfer learning (Freeze) model performed the best with a mean absolute error (MAE) of 0.91, suggesting that the proton conductivity of MOFs can be estimated within one order of magnitude using this model. Additionally, we employed feature importance and principal component analysis to explore the factors influencing proton conductivity. The insights gained from our database and machine learning model are expected to facilitate the targeted design of proton-conductive MOFs. △ Less

Submitted 17 July, 2024; v1 submitted 18 June, 2024; originally announced July 2024.

arXiv:2407.04903 [pdf, other]

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Authors: Zekun Li, Xianjun Yang, Kyuri Choi, Wanrong Zhu, Ryan Hsieh, HyeonJung Kim, Jin Hyuk Lim, Sungyoung Ji, Byungju Lee, Xifeng Yan, Linda Ruth Petzold, Stephen D. Wilson, Woosang Lim, William Yang Wang

Abstract: The rapid advancement of Large Language Models (LLMs) and Large Multimodal Models (LMMs) has heightened the demand for AI-based scientific assistants capable of understanding scientific articles and figures. Despite progress, there remains a significant gap in evaluating models' comprehension of professional, graduate-level, and even PhD-level scientific content. Current datasets and benchmarks pr… ▽ More The rapid advancement of Large Language Models (LLMs) and Large Multimodal Models (LMMs) has heightened the demand for AI-based scientific assistants capable of understanding scientific articles and figures. Despite progress, there remains a significant gap in evaluating models' comprehension of professional, graduate-level, and even PhD-level scientific content. Current datasets and benchmarks primarily focus on relatively simple scientific tasks and figures, lacking comprehensive assessments across diverse advanced scientific disciplines. To bridge this gap, we collected a multimodal, multidisciplinary dataset from open-access scientific articles published in Nature Communications journals. This dataset spans 72 scientific disciplines, ensuring both diversity and quality. We created benchmarks with various tasks and settings to comprehensively evaluate LMMs' capabilities in understanding scientific figures and content. Our evaluation revealed that these tasks are highly challenging: many open-source models struggled significantly, and even GPT-4V and GPT-4o faced difficulties. We also explored using our dataset as training resources by constructing visual instruction-following data, enabling the 7B LLaVA model to achieve performance comparable to GPT-4V/o on our benchmark. Additionally, we investigated the use of our interleaved article texts and figure images for pre-training LMMs, resulting in improvements on the material generation task. The source dataset, including articles, figures, constructed benchmarks, and visual instruction-following data, is open-sourced. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: Code and data are available at https://github.com/Leezekun/MMSci

arXiv:2407.04383 [pdf]

Challenges for Real-Time Toxicity Detection in Online Games

Authors: Lynnette Hui Xian Ng, Adrian Xuan Wei Lim, Michael Miller Yoder

Abstract: Online multiplayer games like League of Legends, Counter Strike, and Skribbl.io create experiences through community interactions. Providing players with the ability to interact with each other through multiple modes also opens a Pandora box. Toxic behaviour and malicious players can ruin the experience, reduce the player base and potentially harming the success of the game and the studio. This ar… ▽ More Online multiplayer games like League of Legends, Counter Strike, and Skribbl.io create experiences through community interactions. Providing players with the ability to interact with each other through multiple modes also opens a Pandora box. Toxic behaviour and malicious players can ruin the experience, reduce the player base and potentially harming the success of the game and the studio. This article will give a brief overview of the challenges faced in toxic content detection in terms of text, audio and image processing problems, and behavioural toxicity. It also discusses the current practices in company-directed and user-directed content detection and discuss the values and limitations of automated content detection in the age of artificial intelligence. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: ACM Ethical Games Conference, Jan 2024

arXiv:2406.11570 [pdf, other]

doi 10.1145/3641234.3671036

Projecting Radiance Fields to Mesh Surfaces

Authors: Adrian Xuan Wei Lim, Lynnette Hui Xian Ng, Nicholas Kyger, Tomo Michigami, Faraz Baghernezhad

Abstract: Radiance fields produce high fidelity images with high rendering speed, but are difficult to manipulate. We effectively perform avatar texture transfer across different appearances by combining benefits from radiance fields and mesh surfaces. We represent the source as a radiance field using 3D Gaussian Splatter, then project the Gaussians on the target mesh. Our pipeline consists of Source Precon… ▽ More Radiance fields produce high fidelity images with high rendering speed, but are difficult to manipulate. We effectively perform avatar texture transfer across different appearances by combining benefits from radiance fields and mesh surfaces. We represent the source as a radiance field using 3D Gaussian Splatter, then project the Gaussians on the target mesh. Our pipeline consists of Source Preconditioning, Target Vectorization and Texture Projection. The projection completes in 1.12s in a pure CPU compute, compared to baselines techniques of Per Face Texture Projection and Ray Casting (31s, 4.1min). This method lowers the computational requirements, which makes it applicable to a broader range of devices from low-end mobiles to high end computers. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: SIGGRAPH Posteres 2024

arXiv:2406.05967 [pdf, other]

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, although these datasets often extend their linguistic range via translation or some other approaches, they usually keep images the same, resulting in narrow cultural representation. To address these limitations, we construct CVQA, a new Culturally-diverse multilingual Visual Question Answering benchmark, designed to cover a rich set of languages and cultures, where we engage native speakers and cultural experts in the data collection process. As a result, CVQA includes culturally-driven images and questions from across 28 countries on four continents, covering 26 languages with 11 scripts, providing a total of 9k questions. We then benchmark several Multimodal Large Language Models (MLLMs) on CVQA, and show that the dataset is challenging for the current state-of-the-art models. This benchmark can serve as a probing evaluation suite for assessing the cultural capability and bias of multimodal models and hopefully encourage more research efforts toward increasing cultural awareness and linguistic diversity in this field. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.00784 [pdf, other]

Multidimensional optical singularities and their applications

Authors: Soon Wei Daniel Lim, Christina M. Spaegele, Federico Capasso

Abstract: Optical singularities, which are positions within an electromagnetic field where certain field parameters become undefined, hold significant potential for applications in areas such as super-resolution microscopy, sensing, and communication. This potential stems from their high field confinement and characteristic rapidly-changing field distributions. Although the systematic characterization of th… ▽ More Optical singularities, which are positions within an electromagnetic field where certain field parameters become undefined, hold significant potential for applications in areas such as super-resolution microscopy, sensing, and communication. This potential stems from their high field confinement and characteristic rapidly-changing field distributions. Although the systematic characterization of the first singularities dates back many decades, recent advancements in sub-wavelength wavefront control at optical frequencies have led to a renewed interest in the field, and have substantially expanded the range of known optical singularities and singular structures. However, the diversity in descriptions, mathematical formulations, and naming conventions can create confusion and impede accessibility to the field. This review aims to clarify the nomenclature by demonstrating that any singular field can be conceptualized as a collection of a finite set of principal, 'generic' singularities. These singularities are robust against small perturbations due to their topological nature. We underscore that the control over the principal properties of those singularities, namely, their protection against perturbations and their dimension, utilizes a consistent mathematical framework. Additionally, we provide an overview of current design techniques for both stable and approximate singularities and discuss their applications across various disciplines. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.18802 [pdf, other]

Enhancing Security and Privacy in Federated Learning using Update Digests and Voting-Based Defense

Authors: Wenjie Li, Kai Fan, Jingyuan Zhang, Hui Li, Wei Yang Bryan Lim, Qiang Yang

Abstract: Federated Learning (FL) is a promising privacy-preserving machine learning paradigm that allows data owners to collaboratively train models while keeping their data localized. Despite its potential, FL faces challenges related to the trustworthiness of both clients and servers, especially in the presence of curious or malicious adversaries. In this paper, we introduce a novel framework named \unde… ▽ More Federated Learning (FL) is a promising privacy-preserving machine learning paradigm that allows data owners to collaboratively train models while keeping their data localized. Despite its potential, FL faces challenges related to the trustworthiness of both clients and servers, especially in the presence of curious or malicious adversaries. In this paper, we introduce a novel framework named \underline{\textbf{F}}ederated \underline{\textbf{L}}earning with \underline{\textbf{U}}pdate \underline{\textbf{D}}igest (FLUD), which addresses the critical issues of privacy preservation and resistance to Byzantine attacks within distributed learning environments. FLUD utilizes an innovative approach, the $\mathsf{LinfSample}$ method, allowing clients to compute the $l_{\infty}$ norm across sliding windows of updates as an update digest. This digest enables the server to calculate a shared distance matrix, significantly reducing the overhead associated with Secure Multi-Party Computation (SMPC) by three orders of magnitude while effectively distinguishing between benign and malicious updates. Additionally, FLUD integrates a privacy-preserving, voting-based defense mechanism that employs optimized SMPC protocols to minimize communication rounds. Our comprehensive experiments demonstrate FLUD's effectiveness in countering Byzantine adversaries while incurring low communication and runtime overhead. FLUD offers a scalable framework for secure and reliable FL in distributed environments, facilitating its application in scenarios requiring robust data management and security. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 14 pages

arXiv:2405.09008 [pdf, other]

Hyperbolicity of renormalization of critical quasicircle maps

Authors: Willie Rush Lim

Abstract: There is a well developed renormalization theory of real analytic critical circle maps by de Faria, de Melo, and Yampolsky. In this paper, we extend Yampolsky's result on hyperbolicity of renormalization periodic points to a larger class of dynamical objects, namely critical quasicircle maps, i.e. analytic self homeomorphisms of a quasicircle with a single critical point. Unlike critical circle ma… ▽ More There is a well developed renormalization theory of real analytic critical circle maps by de Faria, de Melo, and Yampolsky. In this paper, we extend Yampolsky's result on hyperbolicity of renormalization periodic points to a larger class of dynamical objects, namely critical quasicircle maps, i.e. analytic self homeomorphisms of a quasicircle with a single critical point. Unlike critical circle maps, the inner and outer criticalities of critical quasicircle maps can be distinct. We develop a compact analytic renormalization operator called Corona Renormalization with a hyperbolic fixed point whose stable manifold has codimension one and consists of critical quasicircle maps of the same criticality and periodic type rotation number. Our proof is an adaptation of Pacman Renormalization Theory for Siegel disks as well as rigidity results on the escaping dynamics of transcendental entire functions. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 86 pages, 14 figures

MSC Class: 37E20; 37F25; 37F44; 37F10

arXiv:2405.07414 [pdf, other]

Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains

Authors: Kyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon, Sanghyu Yoon, Woohyung Lim

Abstract: The ability of deep networks to learn superior representations hinges on leveraging the proper inductive biases, considering the inherent properties of datasets. In tabular domains, it is critical to effectively handle heterogeneous features (both categorical and numerical) in a unified manner and to grasp irregular functions like piecewise constant functions. To address the challenges in the self… ▽ More The ability of deep networks to learn superior representations hinges on leveraging the proper inductive biases, considering the inherent properties of datasets. In tabular domains, it is critical to effectively handle heterogeneous features (both categorical and numerical) in a unified manner and to grasp irregular functions like piecewise constant functions. To address the challenges in the self-supervised learning framework, we propose a novel pretext task based on the classical binning method. The idea is straightforward: reconstructing the bin indices (either orders or classes) rather than the original values. This pretext task provides the encoder with an inductive bias to capture the irregular dependencies, mapping from continuous inputs to discretized bins, and mitigates the feature heterogeneity by setting all features to have category-type targets. Our empirical investigations ascertain several advantages of binning: capturing the irregular function, compatibility with encoder architecture and additional modifications, standardizing all features into equal sets, grouping similar values within a feature, and providing ordering information. Comprehensive evaluations across diverse tabular datasets corroborate that our method consistently improves tabular representation learning performance for a wide range of downstream tasks. The codes are available in https://github.com/kyungeun-lee/tabularbinning. △ Less

Submitted 13 May, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

Comments: ICML 2024, 18 pages (including supplementary materials)

arXiv:2405.01815 [pdf, other]

Toward end-to-end interpretable convolutional neural networks for waveform signals

Authors: Linh Vu, Thu Tran, Wern-Han Lim, Raphael Phan

Abstract: This paper introduces a novel convolutional neural networks (CNN) framework tailored for end-to-end audio deep learning models, presenting advancements in efficiency and explainability. By benchmarking experiments on three standard speech emotion recognition datasets with five-fold cross-validation, our framework outperforms Mel spectrogram features by up to seven percent. It can potentially repla… ▽ More This paper introduces a novel convolutional neural networks (CNN) framework tailored for end-to-end audio deep learning models, presenting advancements in efficiency and explainability. By benchmarking experiments on three standard speech emotion recognition datasets with five-fold cross-validation, our framework outperforms Mel spectrogram features by up to seven percent. It can potentially replace the Mel-Frequency Cepstral Coefficients (MFCC) while remaining lightweight. Furthermore, we demonstrate the efficiency and interpretability of the front-end layer using the PhysioNet Heart Sound Database, illustrating its ability to handle and capture intricate long waveform patterns. Our contributions offer a portable solution for building efficient and interpretable models for raw waveform data. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.13888 [pdf, other]

Functions of Direct and Indirect Pathways for Action Selection Are Quantitatively Analyzed in A Spiking Neural Network of The Basal Ganglia

Authors: Sang-Yoon Kim, Woochang Lim

Abstract: We are concerned about action selection in the basal ganglia (BG). We quantitatively analyze functions of direct pathway (DP) and indirect pathway (IP) for action selection in a spiking neural network with 3 competing channels. For such quantitative analysis, in each channel, we obtain the competition degree ${\cal C}_d$, given by the ratio of strength of DP (${\cal S}_{DP}$) to strength of IP (… ▽ More We are concerned about action selection in the basal ganglia (BG). We quantitatively analyze functions of direct pathway (DP) and indirect pathway (IP) for action selection in a spiking neural network with 3 competing channels. For such quantitative analysis, in each channel, we obtain the competition degree ${\cal C}_d$, given by the ratio of strength of DP (${\cal S}_{DP}$) to strength of IP (${\cal S}_{IP}$) (i.e., ${\cal C}_d = {\cal S}_{DP} / {\cal S}_{IP}$). Then, a desired action is selected in the channel with the largest ${\cal C}_d$. Desired action selection is made mainly due to strong focused inhibitory projection to the output nucleus, SNr (substantia nigra pars reticulata) via the DP in the corresponding channel. Unlike the case of DP, there are two types of IPs; intra-channel IP and inter-channel IP, due to widespread diffusive excitation from the STN (subthalamic nucleus). The intra-channel IP serves a function of brake to suppress the desired action selection. In contrast, the inter-channel IP to the SNr in the neighboring channels suppresses competing actions, leading to highlight the desired action selection. In this way, function of the inter-channel IP is opposite to that of the intra-channel IP. However, to the best of our knowledge, no quantitative analysis for such functions of the DP and the two IPs was made. Here, through direct calculations of the DP and the intra- and the inter-channel IP presynaptic currents into the SNr in each channel, we obtain the competition degree of each channel to determine a desired action, and then functions of the DP and the intra- and inter-channel IPs are quantitatively made clear. △ Less

Submitted 3 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.00791 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446067

Personalized Neural Speech Codec

Authors: Inseon Jang, Haici Yang, Wootaek Lim, Seungkwon Beack, Minje Kim

Abstract: In this paper, we propose a personalized neural speech codec, envisioning that personalization can reduce the model complexity or improve perceptual speech quality. Despite the common usage of speech codecs where only a single talker is involved on each side of the communication, personalizing a codec for the specific user has rarely been explored in the literature. First, we assume speakers can b… ▽ More In this paper, we propose a personalized neural speech codec, envisioning that personalization can reduce the model complexity or improve perceptual speech quality. Despite the common usage of speech codecs where only a single talker is involved on each side of the communication, personalizing a codec for the specific user has rarely been explored in the literature. First, we assume speakers can be grouped into smaller subsets based on their perceptual similarity. Then, we also postulate that a group-specific codec can focus on the group's speech characteristics to improve its perceptual quality and computational efficiency. To this end, we first develop a Siamese network that learns the speaker embeddings from the LibriSpeech dataset, which are then grouped into underlying speaker clusters. Finally, we retrain the LPCNet-based speech codec baselines on each of the speaker clusters. Subjective listening tests show that the proposed personalization scheme introduces model compression while maintaining speech quality. In other words, with the same model complexity, personalized codecs produce better speech quality. △ Less

Submitted 31 March, 2024; originally announced April 2024.

Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 991-995

arXiv:2403.13982 [pdf, ps, other]

Virasoro constraints and representations for quiver moduli spaces

Authors: Woonam Lim, Miguel Moreira

Abstract: We study the Virasoro constraints for moduli spaces of representations of quiver with relations by Joyce's vertex algebras. Using the framed Virasoro constraints, we construct a representation of half of the Virasoro algebra on the cohomology of moduli stacks of quiver representations under smoothness assumption. By exploiting the non-commutative nature of the Virasoro operators, we apply our theo… ▽ More We study the Virasoro constraints for moduli spaces of representations of quiver with relations by Joyce's vertex algebras. Using the framed Virasoro constraints, we construct a representation of half of the Virasoro algebra on the cohomology of moduli stacks of quiver representations under smoothness assumption. By exploiting the non-commutative nature of the Virasoro operators, we apply our theory for quivers to del Pezzo surfaces using exceptional collections. In particular, the Virasoro constraints and representations are proven for moduli of sheaves on $\mathbb{P}^2$, $\mathbb{P}^1\times \mathbb{P}^1$ and $\text{Bl}_{\mathsf{pt}}(\mathbb{P}^2)$. Lastly, we unravel the Virasoro constraints for Grassmannians in terms of symmetric polynomials and Hecke operators. △ Less

Submitted 25 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

arXiv:2403.06277 [pdf, ps, other]

Cohomology rings of the moduli of one-dimensional sheaves on the projective plane

Authors: Yakov Kononov, Woonam Lim, Miguel Moreira, Weite Pi

Abstract: We initiate a systematic study on the cohomology rings of the moduli stack $\mathfrak{M}_{d,χかい}$ of semistable one-dimensional sheaves on the projective plane. We introduce a set of tautological relations of geometric origin, including Mumford-type relations, and prove that their ideal is generated by certain primitive relations via the Virasoro operators. Using BPS integrality and the computationa… ▽ More We initiate a systematic study on the cohomology rings of the moduli stack $\mathfrak{M}_{d,χかい}$ of semistable one-dimensional sheaves on the projective plane. We introduce a set of tautological relations of geometric origin, including Mumford-type relations, and prove that their ideal is generated by certain primitive relations via the Virasoro operators. Using BPS integrality and the computational efficiency of Virasoro operators, we show that our geometric relations completely determine the cohomology rings of the moduli stacks up to degree 5. As an application, we verify the refined Gopakumar--Vafa/Pandharipande--Thomas correspondence for local $\mathbb{P}^2$ in degree 5. Furthermore, we propose a substantially strengthened version of the $P=C$ conjecture, originally introduced by Shen and two of the authors. This can be viewed as an analogue of the $P=W$ conjecture in a compact and Fano setting. △ Less

Submitted 23 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: Revised introduction. 63 pages, comments are welcome!

arXiv:2403.02253 [pdf, other]

KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Authors: Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Hoon Wei Lim, Bryan Hooi

Abstract: Phishing attacks have inflicted substantial losses on individuals and businesses alike, necessitating the development of robust and efficient automated phishing detection approaches. Reference-based phishing detectors (RBPDs), which compare the logos on a target webpage to a known set of logos, have emerged as the state-of-the-art approach. However, a major limitation of existing RBPDs is that the… ▽ More Phishing attacks have inflicted substantial losses on individuals and businesses alike, necessitating the development of robust and efficient automated phishing detection approaches. Reference-based phishing detectors (RBPDs), which compare the logos on a target webpage to a known set of logos, have emerged as the state-of-the-art approach. However, a major limitation of existing RBPDs is that they rely on a manually constructed brand knowledge base, making it infeasible to scale to a large number of brands, which results in false negative errors due to the insufficient brand coverage of the knowledge base. To address this issue, we propose an automated knowledge collection pipeline, using which we collect a large-scale multimodal brand knowledge base, KnowPhish, containing 20k brands with rich information about each brand. KnowPhish can be used to boost the performance of existing RBPDs in a plug-and-play manner. A second limitation of existing RBPDs is that they solely rely on the image modality, ignoring useful textual information present in the webpage HTML. To utilize this textual information, we propose a Large Language Model (LLM)-based approach to extract brand information of webpages from text. Our resulting multimodal phishing detection approach, KnowPhish Detector (KPD), can detect phishing webpages with or without logos. We evaluate KnowPhish and KPD on a manually validated dataset, and a field study under Singapore's local context, showing substantial improvements in effectiveness and efficiency compared to state-of-the-art baselines. △ Less

Submitted 15 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted by USENIX Security 2024

arXiv:2402.12690 [pdf, other]

Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation

Authors: Zheng Wei Lim, Ekaterina Vylomova, Trevor Cohn, Charles Kemp

Abstract: A good translation should be faithful to the source and should respect the norms of the target language. We address a theoretical puzzle about the relationship between these objectives. On one hand, intuition and some prior work suggest that accuracy and fluency should trade off against each other, and that capturing every detail of the source can only be achieved at the cost of fluency. On the ot… ▽ More A good translation should be faithful to the source and should respect the norms of the target language. We address a theoretical puzzle about the relationship between these objectives. On one hand, intuition and some prior work suggest that accuracy and fluency should trade off against each other, and that capturing every detail of the source can only be achieved at the cost of fluency. On the other hand, quality assessment researchers often suggest that accuracy and fluency are highly correlated and difficult for human raters to distinguish (Callison-Burch et al., 2007). We show that the tension between these views is an instance of Simpson's paradox, and that accuracy and fluency are positively correlated at the level of the corpus but trade off at the level of individual source segments. We further suggest that the relationship between accuracy and fluency is best evaluated at the segment (or sentence) level, and that the trade off between these dimensions has implications both for assessing translation quality and developing improved MT systems. △ Less

Submitted 10 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

arXiv:2401.14654 [pdf, other]

A Korean Legal Judgment Prediction Dataset for Insurance Disputes

Authors: Alice Saebom Kwak, Cheonkam Jeong, Ji Weon Lim, Byeongcheol Min

Abstract: This paper introduces a Korean legal judgment prediction (LJP) dataset for insurance disputes. Successful LJP models on insurance disputes can benefit insurance companies and their customers. It can save both sides' time and money by allowing them to predict how the result would come out if they proceed to the dispute mediation process. As is often the case with low-resource languages, there is a… ▽ More This paper introduces a Korean legal judgment prediction (LJP) dataset for insurance disputes. Successful LJP models on insurance disputes can benefit insurance companies and their customers. It can save both sides' time and money by allowing them to predict how the result would come out if they proceed to the dispute mediation process. As is often the case with low-resource languages, there is a limitation on the amount of data available for this specific task. To mitigate this issue, we investigate how one can achieve a good performance despite the limitation in data. In our experiment, we demonstrate that Sentence Transformer Fine-tuning (SetFit, Tunstall et al., 2022) is a good alternative to standard fine-tuning when training data are limited. The models fine-tuned with the SetFit approach on our data show similar performance to the Korean LJP benchmark models (Hwang et al., 2022) despite the much smaller data size. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: 5 pages, 1 figure

arXiv:2401.11560 [pdf, other]

Polarized Light from Massive Protoclusters (POLIMAP). I. Dissecting the role of magnetic fields in the massive infrared dark cloud G28.37+0.07

Authors: C-Y Law, Jonathan C. Tan, Raphael Skalidis, Larry Morgan, Duo Xu, Felipe de Oliveira Alves, Ashley T. Barnes, Natalie Butterfield, Paola Caselli, Giuliana Cosentino, Francesco Fontani, Jonathan D. Henshaw, Izaskun Jimenez-Serra, Wanggi Lim

Abstract: Magnetic fields may play a crucial role in setting the initial conditions of massive star and star cluster formation. To investigate this, we report SOFIA-HAWC+ $214\:μみゅー$m observations of polarized thermal dust emission and high-resolution GBT-Argus C$^{18}$O(1-0) observations toward the massive Infrared Dark Cloud (IRDC) G28.37+0.07. Considering the local dispersion of $B$-field orientations, we p… ▽ More Magnetic fields may play a crucial role in setting the initial conditions of massive star and star cluster formation. To investigate this, we report SOFIA-HAWC+ $214\:μみゅー$m observations of polarized thermal dust emission and high-resolution GBT-Argus C$^{18}$O(1-0) observations toward the massive Infrared Dark Cloud (IRDC) G28.37+0.07. Considering the local dispersion of $B$-field orientations, we produce a map of $B$-field strength of the IRDC, which exhibits values between $\sim0.03 - 1\:$mG based on a refined Davis-Chandrasekhar-Fermi (r-DCF) method proposed by Skalidis \& Tassis. Comparing to a map of inferred density, the IRDC exhibits a $B-n$ relation with a power law index of $0.51\pm0.02$, which is consistent with a scenario of magnetically-regulated anisotropic collapse. Consideration of the mass-to-flux ratio map indicates that magnetic fields are dynamically important in most regions of the IRDC. A virial analysis of a sample of massive, dense cores in the IRDC, including evaluation of magnetic and kinetic internal and surface terms, indicates consistency with virial equilibrium, sub-Alfvénic conditions and a dominant role for $B-$fields in regulating collapse. A clear alignment of magnetic field morphology with direction of steepest column density gradient is also detected. However, there is no preferred orientation of protostellar outflow directions with the $B-$field. Overall, these results indicate that magnetic fields play a crucial role in regulating massive star and star cluster formation and so need to be accounted for in theoretical models of these processes. △ Less

Submitted 21 January, 2024; originally announced January 2024.

Comments: Submitted to ApJ, comments welcome

arXiv:2401.05593 [pdf, other]

doi 10.1145/3588028.3603653

Reverse Projection: Real-Time Local Space Texture Mapping

Authors: Adrian Xuan Wei Lim, Lynnette Hui Xian Ng, Conor Griffin, Nicholas Kyger, Faraz Baghernezhad

Abstract: We present Reverse Projection, a novel projective texture mapping technique for painting a decal directly to the texture of a 3D object. Designed to be used in games, this technique works in real-time. By using projection techniques that are computed in local space textures and outward-looking, users using low-end android devices to high-end gaming desktops are able to enjoy the personalization of… ▽ More We present Reverse Projection, a novel projective texture mapping technique for painting a decal directly to the texture of a 3D object. Designed to be used in games, this technique works in real-time. By using projection techniques that are computed in local space textures and outward-looking, users using low-end android devices to high-end gaming desktops are able to enjoy the personalization of their assets. We believe our proposed pipeline is a step in improving the speed and versatility of model painting. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: SIGGRAPH 2023

arXiv:2401.01985 [pdf, other]

Surveying the Giant HII Regions of the Milky Way with SOFIA: VI. NGC 3603

Authors: James M. De Buizer, Wanggi Lim, Nicole Karnath, James T. Radomski

Abstract: We present our sixth set of results from our mid-infrared imaging survey of Milky Way Giant HII regions with our detailed analysis of NGC 3603, the most luminous GHII region in the Galaxy. We used imaging data from the FORCAST instrument on the Stratospheric Observatory For Infrared Astronomy (SOFIA) at 20 and 37 microns which mapped the central ~8.5'x8.5' infrared-emitting area of NGC 3603 at a s… ▽ More We present our sixth set of results from our mid-infrared imaging survey of Milky Way Giant HII regions with our detailed analysis of NGC 3603, the most luminous GHII region in the Galaxy. We used imaging data from the FORCAST instrument on the Stratospheric Observatory For Infrared Astronomy (SOFIA) at 20 and 37 microns which mapped the central ~8.5'x8.5' infrared-emitting area of NGC 3603 at a spatial resolution of <~3". Utilizing these SOFIA data in conjunction with multi-wavelength observations from the near-infrared to radio, including Spitzer-IRAC and Herschel-PACS archival data, we investigate the physical nature of individual infrared sources and sub-components within NGC 3603. For individual compact sources we used the multi-wavelength photometry data to construct spectral energy distributions (SEDs) and fit them with massive young stellar object (MYSO) SED models, and find 14 sources that are likely to be MYSOs. We also detect dust emission from the 3 massive proplyd candidates, as well as from the disk and outflow of the evolved blue supergiant, Sher 25. Utilizing multi-wavelength data, we derived luminosity-to-mass ratio and virial parameters for the star-forming clumps within NGC 3603, estimating their relative ages and finding that NGC 3603 is an older GHII region overall, compared to our previously studied GHII regions. We discuss how NGC 3603, which we categorize as a 'cavity-type' GHII region, exhibits a more modest number of MYSOs and molecular clumps when compared to the 'distributed-type' GHII regions that share similar Lyman continuum photon rates. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 32 pages, 15 figures, accepted for publication in ApJ

arXiv:2312.11852 [pdf, other]

Predicting Human Translation Difficulty with Neural Machine Translation

Authors: Zheng Wei Lim, Ekaterina Vylomova, Charles Kemp, Trevor Cohn

Abstract: Human translators linger on some words and phrases more than others, and predicting this variation is a step towards explaining the underlying cognitive processes. Using data from the CRITT Translation Process Research Database, we evaluate the extent to which surprisal and attentional features derived from a Neural Machine Translation (NMT) model account for reading and production times of human… ▽ More Human translators linger on some words and phrases more than others, and predicting this variation is a step towards explaining the underlying cognitive processes. Using data from the CRITT Translation Process Research Database, we evaluate the extent to which surprisal and attentional features derived from a Neural Machine Translation (NMT) model account for reading and production times of human translators. We find that surprisal and attention are complementary predictors of translation difficulty, and that surprisal derived from a NMT model is the single most successful predictor of production duration. Our analyses draw on data from hundreds of translators operating across 13 language pairs, and represent the most comprehensive investigation of human translation difficulty to date. △ Less

Submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.05801 [pdf, other]

Stability and Character of Zero Field Skyrmionic States in Hybrid Magnetic Multilayer Nanodots

Authors: Alexander Kang-Jun Toh, McCoy W. Lim, T. S. Suraj, Xiaoye Chen, Hang Khume Tan, Royston Lim, Xuan Min Cheng, Nelson Lim, Sherry Yap, Durgesh Kumar, S. N. Piramanayagam, Pin Ho, Anjan Soumyanarayanan

Abstract: Ambient magnetic skyrmions stabilized in multilayer nanostructures are of immense interest due to their relevance to magnetic tunnel junction (MTJ) devices for memory and unconventional computing applications. However, existing skyrmionic nanostructures built using conventional metallic or oxide multilayer nanodots are unable to concurrently fulfill the requirements of nanoscale skyrmion stability… ▽ More Ambient magnetic skyrmions stabilized in multilayer nanostructures are of immense interest due to their relevance to magnetic tunnel junction (MTJ) devices for memory and unconventional computing applications. However, existing skyrmionic nanostructures built using conventional metallic or oxide multilayer nanodots are unable to concurrently fulfill the requirements of nanoscale skyrmion stability and feasibility of all-electrical readout and manipulation. Here, we develop a few-repeat hybrid multilayer platform consisting of metallic [Pt/CoB/Ir]3 and oxide [Pt/CoB/MgO] components that are coupled to evolve together as a single, composite stack. Zero-field (ZF) skyrmions with sizes as small as 50 nm are stabilized in the hybrid multilayer nanodots, which are smoothly modulated by up to 2.5x by varying CoB thickness and dot sizes. Meanwhile, skyrmion multiplets are also stabilized by small bias fields. Crucially, we observe higher order 'target' skyrmions with varying magnetization rotations in moderately-sized, low anisotropy nanodots. These results provide a viable route to realize long-sought skyrmionic MTJ devices and new possibilities for multi-state skyrmionic device concepts. △ Less

Submitted 10 December, 2023; originally announced December 2023.

arXiv:2312.03285 [pdf, ps, other]

doi 10.1088/1361-6382/ad0b9f

Periodic boundary conditions and $G_2$ cosmology

Authors: Alan Coley, Woei Chet Lim

Abstract: In the standard concordance cosmology the spatial curvature is assumed to be constant and zero (or at least very small). In particular, in numerical computations of the structure of the universe using N-body simulations, exact periodic boundary conditions are assumed which constrains the spatial curvature. In order to confirm this qualitatively, we numerically evolve a special class of spatially i… ▽ More In the standard concordance cosmology the spatial curvature is assumed to be constant and zero (or at least very small). In particular, in numerical computations of the structure of the universe using N-body simulations, exact periodic boundary conditions are assumed which constrains the spatial curvature. In order to confirm this qualitatively, we numerically evolve a special class of spatially inhomogeneous $G_2$ models with both periodic initial data and non periodic initial data using zooming techniques. We consequently demonstrate that in these models periodic initial conditions do indeed suppress the growth of the spatial curvature as the models evolve away from their initial isotropic and spatially homogeneous state, thereby verifying that the spatial curvature is necessarily very small in standard cosmology. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 10 pages, 3 figures. Matches published version

Journal ref: Class. Quantum Grav. 41 015009 (2024)

arXiv:2311.14464 [pdf, other]

Finite Volume Features, Global Geometry Representations, and Residual Training for Deep Learning-based CFD Simulation

Authors: Loh Sher En Jessica, Naheed Anjum Arafat, Wei Xian Lim, Wai Lee Chan, Adams Wai Kin Kong

Abstract: Computational fluid dynamics (CFD) simulation is an irreplaceable modelling step in many engineering designs, but it is often computationally expensive. Some graph neural network (GNN)-based CFD methods have been proposed. However, the current methods inherit the weakness of traditional numerical simulators, as well as ignore the cell characteristics in the mesh used in the finite volume method, a… ▽ More Computational fluid dynamics (CFD) simulation is an irreplaceable modelling step in many engineering designs, but it is often computationally expensive. Some graph neural network (GNN)-based CFD methods have been proposed. However, the current methods inherit the weakness of traditional numerical simulators, as well as ignore the cell characteristics in the mesh used in the finite volume method, a common method in practical CFD applications. Specifically, the input nodes in these GNN methods have very limited information about any object immersed in the simulation domain and its surrounding environment. Also, the cell characteristics of the mesh such as cell volume, face surface area, and face centroid are not included in the message-passing operations in the GNN methods. To address these weaknesses, this work proposes two novel geometric representations: Shortest Vector (SV) and Directional Integrated Distance (DID). Extracted from the mesh, the SV and DID provide global geometry perspective to each input node, thus removing the need to collect this information through message-passing. This work also introduces the use of Finite Volume Features (FVF) in the graph convolutions as node and edge attributes, enabling its message-passing operations to adjust to different nodes. Finally, this work is the first to demonstrate how residual training, with the availability of low-resolution data, can be adopted to improve the flow field prediction accuracy. Experimental results on two datasets with five different state-of-the-art GNN methods for CFD indicate that SV, DID, FVF and residual training can effectively reduce the predictive error of current GNN-based methods by as much as 41%. △ Less

Submitted 24 November, 2023; originally announced November 2023.

arXiv:2311.11212 [pdf, other]

Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

Authors: Chanhui Lee, Juhyeon Kim, Yongjun Jeong, Juhyun Lyu, Junghee Kim, Sangmin Lee, Sangjun Han, Hyeokjun Choe, Soyeon Park, Woohyung Lim, Sungbin Lim, Sanghack Lee

Abstract: Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of r… ▽ More Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of repetitive causal reasoning, achieved through specifically designed prompts. It highlights the usefulness of PLMs in discovering cause and effect, which is often limited by a lack of data, especially when dealing with multiple variables. Conversely, the characteristics of PLMs which are that PLMs do not analyze data and they are highly dependent on prompt design leads to a crucial limitation for directly using PLMs in causal discovery. Accordingly, PLM-based causal reasoning deeply depends on the prompt design and carries out the risk of overconfidence and false predictions in determining causal relationships. In this paper, we empirically demonstrate the aforementioned limitations of PLM-based causal reasoning through experiments on physics-inspired synthetic data. Then, we propose a new framework that integrates prior knowledge obtained from PLM with a causal discovery algorithm. This is accomplished by initializing an adjacency matrix for causal discovery and incorporating regularization using prior knowledge. Our proposed framework not only demonstrates improved performance through the integration of PLM and causal discovery but also suggests how to leverage PLM-extracted prior knowledge with existing causal discovery algorithms. △ Less

Submitted 18 November, 2023; originally announced November 2023.

ACM Class: I.2

arXiv:2311.09567 [pdf, other]

Entangling gates on degenerate spin qubits dressed by a global field

Authors: Ingvild Hansen, Amanda E. Seedhouse, Santiago Serrano, Andreas Nickl, MengKe Feng, Jonathan Y. Huang, Tuomo Tanttu, Nard Dumoulin Stuyck, Wee Han Lim, Fay E. Hudson, Kohei M. Itoh, Andre Saraiva, Arne Laucht, Andrew S. Dzurak, Chih Hwan Yang

Abstract: Coherently dressed spins have shown promising results as building blocks for future quantum computers owing to their resilience to environmental noise and their compatibility with global control fields. This mode of operation allows for more amenable qubit architecture requirements and simplifies signal routing on the chip. However, multi-qubit operations, such as qubit addressability and two-qubi… ▽ More Coherently dressed spins have shown promising results as building blocks for future quantum computers owing to their resilience to environmental noise and their compatibility with global control fields. This mode of operation allows for more amenable qubit architecture requirements and simplifies signal routing on the chip. However, multi-qubit operations, such as qubit addressability and two-qubit gates, are yet to be demonstrated to establish global control in combination with dressed qubits as a viable path to universal quantum computing. Here we demonstrate simultaneous on-resonance driving of degenerate qubits using a global field while retaining addressability for qubits with equal Larmor frequencies. Furthermore, we implement SWAP oscillations during on-resonance driving, constituting the demonstration of driven two-qubit gates. Significantly, our findings highlight the fragility of entangling gates between superposition states and how dressing can increase the noise robustness. These results represent a crucial milestone towards global control operation with dressed qubits. It also opens a door to interesting spin physics on degenerate spins. △ Less

Submitted 30 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.01390 [pdf, ps, other]

On $S$-Noetherian modules and $S$-strong Mori modules

Authors: Hyungtae Baek, Jung Wook Lim

Abstract: In this paper, we study some properties of $S$-Noetherian modules and $S$-strong Mori modules. Among other things, we prove the Hilbert basis theorem for $S$-Noetherian modules and $S$-strong Mori modules. In this paper, we study some properties of $S$-Noetherian modules and $S$-strong Mori modules. Among other things, we prove the Hilbert basis theorem for $S$-Noetherian modules and $S$-strong Mori modules. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: 15 pages

MSC Class: 13A15; 13B25; 13E99

arXiv:2310.19201 [pdf, ps, other]

Open Problems in DAOs

Authors: Joshua Tan, Tara Merk, Sarah Hubbard, Eliza R. Oak, Helena Rong, Joni Pirovich, Ellie Rennie, Rolf Hoefer, Michael Zargham, Jason Potts, Chris Berg, Reuben Youngblom, Primavera De Filippi, Seth Frey, Jeff Strnad, Morshed Mannan, Kelsie Nabben, Silke Noa Elrifai, Jake Hartnell, Benjamin Mako Hill, Tobin South, Ryan L. Thomas, Jonathan Dotan, Ariana Spring, Alexia Maddox , et al. (4 additional authors not shown)

Abstract: Decentralized autonomous organizations (DAOs) are a new, rapidly-growing class of organizations governed by smart contracts. Here we describe how researchers can contribute to the emerging science of DAOs and other digitally-constituted organizations. From granular privacy primitives to mechanism designs to model laws, we identify high-impact problems in the DAO ecosystem where existing gaps might… ▽ More Decentralized autonomous organizations (DAOs) are a new, rapidly-growing class of organizations governed by smart contracts. Here we describe how researchers can contribute to the emerging science of DAOs and other digitally-constituted organizations. From granular privacy primitives to mechanism designs to model laws, we identify high-impact problems in the DAO ecosystem where existing gaps might be tackled through a new data set or by applying tools and ideas from existing research fields such as political science, computer science, economics, law, and organizational science. Our recommendations encompass exciting research questions as well as promising business opportunities. We call on the wider research community to join the global effort to invent the next generation of organizations. △ Less

Submitted 12 June, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

Comments: includes major coordination problems

arXiv:2310.18032 [pdf, ps, other]

On $S$-$n$-absorbing ideals

Authors: Hyungtae Baek, Hyun Seung Choi, Jung Wook Lim

Abstract: Let $R$ be a commutative ring with identity, $S$ a multiplicative subset of $R$ and $I$ an ideal of $R$ disjoint from $S$. In this paper, we introduce the notion of an $S$-$n$-absorbing ideal which is a generalization of both the $S$-prime ideals and $n$-absorbing ideals. Moreover, we investigate the basic properties, quotient extension, existence and amalgamation of $S$-$n$-absorbing ideals. Let $R$ be a commutative ring with identity, $S$ a multiplicative subset of $R$ and $I$ an ideal of $R$ disjoint from $S$. In this paper, we introduce the notion of an $S$-$n$-absorbing ideal which is a generalization of both the $S$-prime ideals and $n$-absorbing ideals. Moreover, we investigate the basic properties, quotient extension, existence and amalgamation of $S$-$n$-absorbing ideals. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 18pages

MSC Class: 13A15

arXiv:2310.11635 [pdf, other]

Break-up and Recovery of Harmony between Direct and Indirect Pathways in The Basal Ganglia; Huntington's Disease and Treatment

Authors: Sang-Yoon Kim, Woochang Lim

Abstract: The basal ganglia (BG) in the brain exhibit diverse functions for motor, cognition, and emotion. Such BG functions could be made via competitive harmony between the two competing pathways, direct pathway (DP) (facilitating movement) and indirect pathway (IP) (suppressing movement). As a result of break-up of harmony between DP and IP, there appear pathological states with disorder for movement, co… ▽ More The basal ganglia (BG) in the brain exhibit diverse functions for motor, cognition, and emotion. Such BG functions could be made via competitive harmony between the two competing pathways, direct pathway (DP) (facilitating movement) and indirect pathway (IP) (suppressing movement). As a result of break-up of harmony between DP and IP, there appear pathological states with disorder for movement, cognition, and psychiatry. In this paper, we are concerned about the Huntington's disease (HD), which is a genetic neurodegenerative disorder causing involuntary movement and severe cognitive and psychiatric symptoms. For the HD, the number of D2 SPNs ($N_{\rm D2}$) is decreased due to degenerative loss, and hence, by decreasing $x_{\rm D2}$ (fraction of $N_{\rm D2}$), we investigate break-up of harmony between DP and IP in terms of their competition degree ${\cal C}_d$, given by the ratio of strength of DP (${\cal S}_{DP}$) to strength of IP (${\cal S}_{IP}$) (i.e., ${\cal C}_d = {\cal S}_{DP} / {\cal S}_{IP}$). In the case of HD, the IP is under-active, in contrast to the case of Parkinson's disease with over-active IP, which results in increase in ${\cal C}_d$ (from the normal value). Thus, hyperkinetic dyskinesia such as chorea (involuntary jerky movement) occurs. We also investigate treatment of HD, based on optogenetics and GP ablation, by increasing strength of IP, resulting in recovery of harmony between DP and IP. Finally, we study effect of loss of healthy synapses of all the BG cells on HD. Due to loss of healthy synapses, disharmony between DP and IP increases, leading to worsen symptoms of the HD. △ Less

Submitted 18 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2309.12540

arXiv:2310.09722 [pdf, other]

A singlet-triplet hole-spin qubit in MOS silicon

Authors: S. D. Liles, D. J. Halverson, Z. Wang, A. Shamim, R. S. Eggli, I. K. Jin, J. Hillier, K. Kumar, I. Vorreiter, M. Rendell, J. H. Huang, C. C. Escott, F. E. Hudson, W. H. Lim, D. Culcer, A. S. Dzurak, A. R. Hamilton

Abstract: Holes in silicon quantum dots are promising for spin qubit applications due to the strong intrinsic spin-orbit coupling. The spin-orbit coupling produces complex hole-spin dynamics, providing opportunities to further optimize spin qubits. Here, we demonstrate a singlet-triplet qubit using hole states in a planar metal-oxide-semiconductor double quantum dot. We observe rapid qubit control with sing… ▽ More Holes in silicon quantum dots are promising for spin qubit applications due to the strong intrinsic spin-orbit coupling. The spin-orbit coupling produces complex hole-spin dynamics, providing opportunities to further optimize spin qubits. Here, we demonstrate a singlet-triplet qubit using hole states in a planar metal-oxide-semiconductor double quantum dot. We observe rapid qubit control with singlet-triplet oscillations up to 400 MHz. The qubit exhibits promising coherence, with a maximum dephasing time of 600 ns, which is enhanced to 1.3 us using refocusing techniques. We investigate the magnetic field anisotropy of the eigenstates, and determine a magnetic field orientation to improve the qubit initialisation fidelity. These results present a step forward for spin qubit technology, by implementing a high quality singlet-triplet hole-spin qubit in planar architecture suitable for scaling up to 2D arrays of coupled qubits. △ Less

Submitted 14 October, 2023; originally announced October 2023.

arXiv:2310.06369 [pdf, other]

Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks

Authors: Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Woohyung Lim, Sehui Han

Abstract: Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the… ▽ More Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the Geometrically Aligned Transfer Encoder (GATE). In this method, we interpret the latent vectors from the model to exist on a Riemannian curved manifold. We find a proper diffeomorphism between pairs of tasks to ensure that every arbitrary point maps to a locally flat coordinate in the overlapping region, allowing the transfer of knowledge from the source to the target data. This also serves as an effective regularizer for the model to behave in extrapolation regions. In this article, we demonstrate that GATE outperforms conventional methods and exhibits stable behavior in both the latent space and extrapolation regions for various molecular graph datasets. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 12+11 pages, 6+1 figures, 0+7 tables

arXiv:2309.12542 [pdf, other]

Spatio-temporal correlations of noise in MOS spin qubits

Authors: Amanda E. Seedhouse, Nard Dumoulin Stuyck, Santiago Serrano, Tuomo Tanttu, Will Gilbert, Jonathan Yue Huang, Fay E. Hudson, Kohei M. Itoh, Arne Laucht, Wee Han Lim, Chih Hwan Yang, Andrew S. Dzurak, Andre Saraiva

Abstract: In quantum computing, characterising the full noise profile of qubits can aid the efforts towards increasing coherence times and fidelities by creating error mitigating techniques specific to the type of noise in the system, or by completely removing the sources of noise. Spin qubits in MOS quantum dots are exposed to noise originated from the complex glassy behaviour of two-level fluctuators, lea… ▽ More In quantum computing, characterising the full noise profile of qubits can aid the efforts towards increasing coherence times and fidelities by creating error mitigating techniques specific to the type of noise in the system, or by completely removing the sources of noise. Spin qubits in MOS quantum dots are exposed to noise originated from the complex glassy behaviour of two-level fluctuators, leading to non-trivial correlations between qubit properties both in space and time. With recent engineering progress, large amounts of data are being collected in typical spin qubit device experiments, and it is beneficiary to explore data analysis options inspired from fields of research that are experienced in managing large data sets, examples include astrophysics, finance and climate science. Here, we propose and demonstrate wavelet-based analysis techniques to decompose signals into both frequency and time components to gain a deeper insight into the sources of noise in our systems. We apply the analysis to a long feedback experiment performed on a state-of-the-art two-qubit system in a pair of SiMOS quantum dots. The observed correlations serve to identify common microscopic causes of noise, as well as to elucidate pathways for multi-qubit operation with a more scalable feedback system. △ Less

Submitted 24 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

Comments: updated reference

arXiv:2309.12541 [pdf, other]

doi 10.1063/5.0179958

Real-time feedback protocols for optimizing fault-tolerant two-qubit gate fidelities in a silicon spin system

Authors: Nard Dumoulin Stuyck, Amanda E. Seedhouse, Santiago Serrano, Tuomo Tanttu, Will Gilbert, Jonathan Yue Huang, Fay Hudson, Kohei M. Itoh, Arne Laucht, Wee Han Lim, Chih Hwan Yang, Andre Saraiva, Andrew S. Dzurak

Abstract: Recently, several groups have demonstrated two-qubit gate fidelities in semiconductor spin qubit systems above 99%. Achieving this regime of fault-tolerant compatible high fidelities is nontrivial and requires exquisite stability and precise control over the different qubit parameters over an extended period of time. This can be done by efficiently calibrating qubit control parameters against diff… ▽ More Recently, several groups have demonstrated two-qubit gate fidelities in semiconductor spin qubit systems above 99%. Achieving this regime of fault-tolerant compatible high fidelities is nontrivial and requires exquisite stability and precise control over the different qubit parameters over an extended period of time. This can be done by efficiently calibrating qubit control parameters against different sources of micro- and macroscopic noise. Here, we present several single- and two-qubit parameter feedback protocols, optimised for and implemented in state-of-the-art fast FPGA hardware. Furthermore, we use wavelet-based analysis on the collected feedback data to gain insight into the different sources of noise in the system. Scalable feedback is an outstanding challenge and the presented implementation and analysis gives insight into the benefits and drawbacks of qubit parameter feedback, as feedback related overhead increases. This work demonstrates a pathway towards robust qubit parameter feedback and systematic noise analysis, crucial for mitigation strategies towards systematic high-fidelity qubit operation compatible with quantum error correction protocols. △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.12540 [pdf, other]

Quantifying Harmony between Direct and Indirect Pathways in The Basal Ganglia; Healthy and Parkinsonian States

Authors: Sang-Yoon Kim, Woochang Lim

Abstract: The basal ganglia (BG) show a variety of functions for motor and cognition. There are two competitive pathways in the BG; direct pathway (DP) which facilitates movement and indirect pathway (IP) which suppresses movement. It is well known that diverse functions of the BG may be made through "balance" between DP and IP. But, to the best of our knowledge, so far no quantitative analysis for such bal… ▽ More The basal ganglia (BG) show a variety of functions for motor and cognition. There are two competitive pathways in the BG; direct pathway (DP) which facilitates movement and indirect pathway (IP) which suppresses movement. It is well known that diverse functions of the BG may be made through "balance" between DP and IP. But, to the best of our knowledge, so far no quantitative analysis for such balance was done. In this paper, as a first time, we introduce the competition degree ${\cal C}_d$ between DP and IP. Then, by employing ${\cal C}_d$, we quantify their competitive harmony (i.e., competition and cooperative interplay), which could lead to improving our understanding of the traditional "balance" so clearly and quantitatively. We first consider the case of normal dopamine (DA) level of $φふぁい^*=0.3$. In the case of phasic cortical input (10 Hzへるつ), a healthy state with ${\cal C}_d^* = 2.82$ (i.e., DP is 2.82 times stronger than IP) appears. In this case, normal movement occurs via harmony between DP and IP. Next, we consider the case of decreased DA level, $φふぁい= φふぁい^*(=0.3)~x_{DA}$ ($1 > x_{DA} \geq 0$). With decreasing $x_{DA}$ from 1, the competition degree ${\cal C}_d$ between DP and IP decreases monotonically from ${\cal C}_d^*$, which results in appearance of a pathological Parkinsonian state with reduced ${\cal C}_d$. In this Parkinsonian state, strength of IP is much increased than that in the case of normal healthy state, leading to disharmony between DP and IP. Due to such break-up of harmony between DP and IP, impaired movement occurs. Finally, we also study treatment of the pathological Parkinsonian state via recovery of harmony between DP and IP. △ Less

Submitted 12 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.01849 [pdf, other]

Impact of electrostatic crosstalk on spin qubits in dense CMOS quantum dot arrays

Authors: Jesus D. Cifuentes, Tuomo Tanttu, Paul Steinacker, Santiago Serrano, Ingvild Hansen, James P. Slack-Smith, Will Gilbert, Jonathan Y. Huang, Ensar Vahapoglu, Ross C. C. Leon, Nard Dumoulin Stuyck, Kohei Itoh, Nikolay Abrosimov, Hans-Joachim Pohl, Michael Thewalt, Arne Laucht, Chih Hwan Yang, Christopher C. Escott, Fay E. Hudson, Wee Han Lim, Rajib Rahman, Andrew S. Dzurak, Andre Saraiva

Abstract: Quantum processors based on integrated nanoscale silicon spin qubits are a promising platform for highly scalable quantum computation. Current CMOS spin qubit processors consist of dense gate arrays to define the quantum dots, making them susceptible to crosstalk from capacitive coupling between a dot and its neighbouring gates. Small but sizeable spin-orbit interactions can transfer this electros… ▽ More Quantum processors based on integrated nanoscale silicon spin qubits are a promising platform for highly scalable quantum computation. Current CMOS spin qubit processors consist of dense gate arrays to define the quantum dots, making them susceptible to crosstalk from capacitive coupling between a dot and its neighbouring gates. Small but sizeable spin-orbit interactions can transfer this electrostatic crosstalk to the spin g-factors, creating a dependence of the Larmor frequency on the electric field created by gate electrodes positioned even tens of nanometers apart. By studying the Stark shift from tens of spin qubits measured in nine different CMOS devices, we developed a theoretical frawework that explains how electric fields couple to the spin of the electrons in increasingly complex arrays, including those electric fluctuations that limit qubit dephasing times $T_2^*$. The results will aid in the design of robust strategies to scale CMOS quantum technology. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: 9 pages, 4 figures

arXiv:2308.12626 [pdf, other]

Methods for transverse and longitudinal spin-photon coupling in silicon quantum dots with intrinsic spin-orbit effect

Authors: Kevin S. Guo, MengKe Feng, Jonathan Y. Huang, Will Gilbert, Kohei M. Itoh, Fay E. Hudson, Kok Wai Chan, Wee Han Lim, Andrew S. Dzurak, Andre Saraiva

Abstract: In a full-scale quantum computer with a fault-tolerant architecture, having scalable, long-range interaction between qubits is expected to be a highly valuable resource. One promising method of achieving this is through the light-matter interaction between spins in semiconductors and photons in superconducting cavities. This paper examines the theory of both transverse and longitudinal spin-photon… ▽ More In a full-scale quantum computer with a fault-tolerant architecture, having scalable, long-range interaction between qubits is expected to be a highly valuable resource. One promising method of achieving this is through the light-matter interaction between spins in semiconductors and photons in superconducting cavities. This paper examines the theory of both transverse and longitudinal spin-photon coupling and their applications in the silicon metal-oxide-semiconductor (SiMOS) platform. We propose a method of coupling which uses the intrinsic spin-orbit interaction arising from orbital degeneracies in SiMOS qubits. Using theoretical analysis and experimental data, we show that the strong coupling regime is achievable in the transverse scheme. We also evaluate the feasibility of a longitudinal coupling driven by an AC modulation on the qubit. These coupling methods eschew the requirement for an external micromagnet, enhancing prospects for scalability and integration into a large-scale quantum computer. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.08245 [pdf, other]

doi 10.1109/TQE.2024.3386753

Multi-Objective Optimization and Network Routing with Near-Term Quantum Computers

Authors: Shao-Hen Chiew, Kilian Poirier, Rajesh Mishra, Ulrike Bornheimer, Ewan Munro, Si Han Foon, Christopher Wanru Chen, Wei Sheng Lim, Chee Wei Nga

Abstract: Multi-objective optimization is a ubiquitous problem that arises naturally in many scientific and industrial areas. Network routing optimization with multi-objective performance demands falls into this problem class, and finding good quality solutions at large scales is generally challenging. In this work, we develop a scheme with which near-term quantum computers can be applied to solve multi-obj… ▽ More Multi-objective optimization is a ubiquitous problem that arises naturally in many scientific and industrial areas. Network routing optimization with multi-objective performance demands falls into this problem class, and finding good quality solutions at large scales is generally challenging. In this work, we develop a scheme with which near-term quantum computers can be applied to solve multi-objective combinatorial optimization problems. We study the application of this scheme to the network routing problem in detail, by first mapping it to the multi-objective shortest path problem. Focusing on an implementation based on the quantum approximate optimization algorithm (QAOA) -- the go-to approach for tackling optimization problems on near-term quantum computers -- we examine the Pareto plot that results from the scheme, and qualitatively analyze its ability to produce Pareto-optimal solutions. We further provide theoretical and numerical scaling analyses of the resource requirements and performance of QAOA, and identify key challenges associated with this approach. Finally, through Amazon Braket we execute small-scale implementations of our scheme on the IonQ Harmony 11-qubit quantum computer. △ Less

Submitted 16 August, 2023; originally announced August 2023.

arXiv:2308.07217 [pdf, other]

Rigidity of J-rotational rational maps and critical quasicircle maps

Authors: Willie Rush Lim

Abstract: We present a number of rigidity results concerning holomorphic dynamical systems admitting rotation quasicircles. Firstly, we show the absence of line fields on the Julia set of any rational map that is geometrically finite away from a number of rotation quasicircles with bounded type rotation number. As an application, we prove combinatorial rigidity associated to the problem of degeneration of H… ▽ More We present a number of rigidity results concerning holomorphic dynamical systems admitting rotation quasicircles. Firstly, we show the absence of line fields on the Julia set of any rational map that is geometrically finite away from a number of rotation quasicircles with bounded type rotation number. As an application, we prove combinatorial rigidity associated to the problem of degeneration of Herman rings of the simplest configuration. Secondly, we extend a result of de Faria and de Melo on the $C^{1+αあるふぁ}$ rigidity of critical circle maps with bounded type rotation number to a larger class of dynamical objects, namely critical quasicircle maps. Unlike critical circle maps, critical quasicircle maps may have imbalanced inner and outer criticalities. As a consequence, we prove dynamical universality and exponential convergence of renormalization towards a horseshoe attractor. △ Less

Submitted 23 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

Comments: 58 pages, 11 figures. in the new version, the abstract is rewritten and more references are added

MSC Class: 37E20; 37F10; 37F25; 37F50

arXiv:2308.02111 [pdf, other]

doi 10.1038/s41586-024-07160-2

High-fidelity operation and algorithmic initialisation of spin qubits above one kelvin

Authors: Jonathan Y. Huang, Rocky Y. Su, Wee Han Lim, MengKe Feng, Barnaby van Straaten, Brandon Severin, Will Gilbert, Nard Dumoulin Stuyck, Tuomo Tanttu, Santiago Serrano, Jesus D. Cifuentes, Ingvild Hansen, Amanda E. Seedhouse, Ensar Vahapoglu, Nikolay V. Abrosimov, Hans-Joachim Pohl, Michael L. W. Thewalt, Fay E. Hudson, Christopher C. Escott, Natalia Ares, Stephen D. Bartlett, Andrea Morello, Andre Saraiva, Arne Laucht, Andrew S. Dzurak , et al. (1 additional authors not shown)

Abstract: The encoding of qubits in semiconductor spin carriers has been recognised as a promising approach to a commercial quantum computer that can be lithographically produced and integrated at scale. However, the operation of the large number of qubits required for advantageous quantum applications will produce a thermal load exceeding the available cooling power of cryostats at millikelvin temperatures… ▽ More The encoding of qubits in semiconductor spin carriers has been recognised as a promising approach to a commercial quantum computer that can be lithographically produced and integrated at scale. However, the operation of the large number of qubits required for advantageous quantum applications will produce a thermal load exceeding the available cooling power of cryostats at millikelvin temperatures. As the scale-up accelerates, it becomes imperative to establish fault-tolerant operation above 1 kelvin, where the cooling power is orders of magnitude higher. Here, we tune up and operate spin qubits in silicon above 1 kelvin, with fidelities in the range required for fault-tolerant operation at such temperatures. We design an algorithmic initialisation protocol to prepare a pure two-qubit state even when the thermal energy is substantially above the qubit energies, and incorporate radio-frequency readout to achieve fidelities up to 99.34 per cent for both readout and initialisation. Importantly, we demonstrate a single-qubit Clifford gate fidelity of 99.85 per cent, and a two-qubit gate fidelity of 98.92 per cent. These advances overcome the fundamental limitation that the thermal energy must be well below the qubit energies for high-fidelity operation to be possible, surmounting a major obstacle in the pathway to scalable and fault-tolerant quantum computation. △ Less

Submitted 18 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

Journal ref: Nature 627, 772-777 (2024)

arXiv:2307.12452 [pdf, other]

Characterizing non-Markovian Quantum Process by Fast Bayesian Tomography

Authors: R. Y. Su, J. Y. Huang, N. Dumoulin. Stuyck, M. K. Feng, W. Gilbert, T. J. Evans, W. H. Lim, F. E. Hudson, K. W. Chan, W. Huang, Kohei M. Itoh, R. Harper, S. D. Bartlett, C. H. Yang, A. Laucht, A. Saraiva, T. Tanttu, A. S. Dzurak

Abstract: To push gate performance to levels beyond the thresholds for quantum error correction, it is important to characterize the error sources occurring on quantum gates. However, the characterization of non-Markovian error poses a challenge to current quantum process tomography techniques. Fast Bayesian Tomography (FBT) is a self-consistent gate set tomography protocol that can be bootstrapped from ear… ▽ More To push gate performance to levels beyond the thresholds for quantum error correction, it is important to characterize the error sources occurring on quantum gates. However, the characterization of non-Markovian error poses a challenge to current quantum process tomography techniques. Fast Bayesian Tomography (FBT) is a self-consistent gate set tomography protocol that can be bootstrapped from earlier characterization knowledge and be updated in real-time with arbitrary gate sequences. Here we demonstrate how FBT allows for the characterization of key non-Markovian error processes. We introduce two experimental protocols for FBT to diagnose the non-Markovian behavior of two-qubit systems on silicon quantum dots. To increase the efficiency and scalability of the experiment-analysis loop, we develop an online FBT software stack. To reduce experiment cost and analysis time, we also introduce a native readout method and warm boot strategy. Our results demonstrate that FBT is a useful tool for probing non-Markovian errors that can be detrimental to the ultimate realization of fault-tolerant operation on quantum computing. △ Less

Submitted 4 October, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

arXiv:2307.08186 [pdf]

All-glass 100 mm Diameter Visible Metalens for Imaging the Cosmos

Authors: Joon-Suh Park, Soon Wei Daniel Lim, Arman Amirzhan, Hyukmo Kang, Karlene Karrfalt, Daewook Kim, Joel Leger, Augustine M. Urbas, Marcus Ossiander, Zhaoyi Li, Federico Capasso

Abstract: Metasurfaces, optics made from subwavelength-scale nanostructures, have been limited to millimeter-sizes by the scaling challenge of producing vast numbers of precisely engineered elements over a large area. In this study, we demonstrate an all-glass 100 mm diameter metasurface lens (metalens) comprising 18.7 billion nanostructures that operates in the visible spectrum with a fast f-number (f/1.5,… ▽ More Metasurfaces, optics made from subwavelength-scale nanostructures, have been limited to millimeter-sizes by the scaling challenge of producing vast numbers of precisely engineered elements over a large area. In this study, we demonstrate an all-glass 100 mm diameter metasurface lens (metalens) comprising 18.7 billion nanostructures that operates in the visible spectrum with a fast f-number (f/1.5, NA=0.32) using deep-ultraviolet (DUV) projection lithography. Our work overcomes the exposure area constraints of lithography tools and demonstrates that large metasurfaces are commercially feasible. Additionally, we investigate the impact of various fabrication errors on the imaging quality of the metalens, several of which are unique to such large area metasurfaces. We demonstrate direct astronomical imaging of the Sun, the Moon, and emission nebulae at visible wavelengths and validate the robustness of such metasurfaces under extreme environmental thermal swings for space applications. △ Less

Submitted 16 July, 2023; originally announced July 2023.

arXiv:2307.08106 [pdf, other]

Polarization Multi-Image Synthesis with Birefringent Metasurfaces

Authors: Dean Hazineh, Soon Wei Daniel Lim, Qi Guo, Federico Capasso, Todd Zickler

Abstract: Optical metasurfaces composed of precisely engineered nanostructures have gained significant attention for their ability to manipulate light and implement distinct functionalities based on the properties of the incident field. Computational imaging systems have started harnessing this capability to produce sets of coded measurements that benefit certain tasks when paired with digital post-processi… ▽ More Optical metasurfaces composed of precisely engineered nanostructures have gained significant attention for their ability to manipulate light and implement distinct functionalities based on the properties of the incident field. Computational imaging systems have started harnessing this capability to produce sets of coded measurements that benefit certain tasks when paired with digital post-processing. Inspired by these works, we introduce a new system that uses a birefringent metasurface with a polarizer-mosaicked photosensor to capture four optically-coded measurements in a single exposure. We apply this system to the task of incoherent opto-electronic filtering, where digital spatial-filtering operations are replaced by simpler, per-pixel sums across the four polarization channels, independent of the spatial filter size. In contrast to previous work on incoherent opto-electronic filtering that can realize only one spatial filter, our approach can realize a continuous family of filters from a single capture, with filters being selected from the family by adjusting the post-capture digital summation weights. To find a metasurface that can realize a set of user-specified spatial filters, we introduce a form of gradient descent with a novel regularizer that encourages light efficiency and a high signal-to-noise ratio. We demonstrate several examples in simulation and with fabricated prototypes, including some with spatial filters that have prescribed variations with respect to depth and wavelength. Visit the Project Page at https://deanhazineh.github.io/publications/Multi_Image_Synthesis/MIS_Home.html △ Less

Submitted 11 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

Comments: Published in the Proceedings of the 2023 IEEE International Conference of Computational Photography

arXiv:2307.07724 [pdf, other]

doi 10.1103/PRXQuantum.5.010301

Improved Single-Shot Qubit Readout Using Twin RF-SET Charge Correlations

Authors: Santiago Serrano, MengKe Feng, Wee Han Lim, Amanda E. Seedhouse, Tuomo Tanttu, Will Gilbert, Christopher C. Escott, Nikolay V. Abrosimov, Hans-Joachim Pohl, Michael L. W. Thewalt, Fay E. Hudson, Andre Saraiva, Andrew S. Dzurak, Arne Laucht

Abstract: High fidelity qubit readout is critical in order to obtain the thresholds needed to implement quantum error correction protocols and achieve fault-tolerant quantum computing. Large-scale silicon qubit devices will have densely-packed arrays of quantum dots with multiple charge sensors that are, on average, farther away from the quantum dots, entailing a reduction in readout fidelities. Here, we pr… ▽ More High fidelity qubit readout is critical in order to obtain the thresholds needed to implement quantum error correction protocols and achieve fault-tolerant quantum computing. Large-scale silicon qubit devices will have densely-packed arrays of quantum dots with multiple charge sensors that are, on average, farther away from the quantum dots, entailing a reduction in readout fidelities. Here, we present a readout technique that enhances the readout fidelity in a linear SiMOS 4-dot array by amplifying correlations between a pair of single-electron transistors, known as a twin SET. By recording and subsequently correlating the twin SET traces as we modulate the dot detuning across a charge transition, we demonstrate a reduction in the charge readout infidelity by over one order of magnitude compared to traditional readout methods. We also study the spin-to-charge conversion errors introduced by the modulation technique, and conclude that faster modulation frequencies avoid relaxation-induced errors without introducing significant spin flip errors, favouring the use of the technique at short integration times. This method not only allows for faster and higher fidelity qubit measurements, but it also enhances the signal corresponding to charge transitions that take place farther away from the sensors, enabling a way to circumvent the reduction in readout fidelities in large arrays of qubits. △ Less

Submitted 15 July, 2023; originally announced July 2023.

Journal ref: PRX QUANTUM 5, 010301 (2024)

arXiv:2307.04550 [pdf, other]

Gradient Surgery for One-shot Unlearning on Generative Model

Authors: Seohui Bae, Seoyoon Kim, Hyemin Jung, Woohyung Lim

Abstract: Recent regulation on right-to-be-forgotten emerges tons of interest in unlearning pre-trained machine learning models. While approximating a straightforward yet expensive approach of retrain-from-scratch, recent machine unlearning methods unlearn a sample by updating weights to remove its influence on the weight parameters. In this paper, we introduce a simple yet effective approach to remove a da… ▽ More Recent regulation on right-to-be-forgotten emerges tons of interest in unlearning pre-trained machine learning models. While approximating a straightforward yet expensive approach of retrain-from-scratch, recent machine unlearning methods unlearn a sample by updating weights to remove its influence on the weight parameters. In this paper, we introduce a simple yet effective approach to remove a data influence on the deep generative model. Inspired by works in multi-task learning, we propose to manipulate gradients to regularize the interplay of influence among samples by projecting gradients onto the normal plane of the gradients to be retained. Our work is agnostic to statistics of the removal samples, outperforming existing baselines while providing theoretical analysis for the first time in unlearning a generative model. △ Less

Submitted 18 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: ICML 2023 Workshop on Generative AI & Law

arXiv:2306.04628 [pdf, other]

Systematic Analysis of Music Representations from BERT

Authors: Sangjun Han, Hyeongrae Ihm, Woohyung Lim

Abstract: There have been numerous attempts to represent raw data as numerical vectors that effectively capture semantic and contextual information. However, in the field of symbolic music, previous works have attempted to validate their music embeddings by observing the performance improvement of various fine-tuning tasks. In this work, we directly analyze embeddings from BERT and BERT with contrastive lea… ▽ More There have been numerous attempts to represent raw data as numerical vectors that effectively capture semantic and contextual information. However, in the field of symbolic music, previous works have attempted to validate their music embeddings by observing the performance improvement of various fine-tuning tasks. In this work, we directly analyze embeddings from BERT and BERT with contrastive learning trained on bar-level MIDI, inspecting their musical information that can be obtained from MIDI events. We observe that the embeddings exhibit distinct characteristics of information depending on the contrastive objectives and the choice of layers. Our code is available at https://github.com/sjhan91/MusicBERT. △ Less

Submitted 6 June, 2023; originally announced June 2023.

arXiv:2305.14193 [pdf, ps, other]

Cohomological $χかい$-dependence of ring structure for the moduli of one-dimensional sheaves on $\mathbb{P}^2$

Authors: Woonam Lim, Miguel Moreira, Weite Pi

Abstract: We prove that the cohomology rings of the moduli space $M_{d,χかい}$ of one-dimensional sheaves on the projective plane are not isomorphic for general different choices of the Euler characteristics. This stands in contrast to the $χかい$-independence of the Betti numbers of these moduli spaces. As a corollary, we deduce that $M_{d,χかい}$ are topologically different unless they are related by obvious symmetri… ▽ More We prove that the cohomology rings of the moduli space $M_{d,χかい}$ of one-dimensional sheaves on the projective plane are not isomorphic for general different choices of the Euler characteristics. This stands in contrast to the $χかい$-independence of the Betti numbers of these moduli spaces. As a corollary, we deduce that $M_{d,χかい}$ are topologically different unless they are related by obvious symmetries, strengthening a previous result of Woolf distinguishing them as algebraic varieties. △ Less

Submitted 4 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: Added Section 1.3 on the structure of the proof; final version to appear in Forum Math. Sigma

Showing 1–50 of 327 results for author: Lim, W