Search | arXiv e-print repository

DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement

Authors: Jia-Wei Liao, Winston Wang, Tzu-Sian Wang, Li-Xuan Peng, Ju-Hsuan Weng, Cheng-Fu Chou, Jun-Cheng Chen

Abstract: With the success of Diffusion Models for image generation, the technologies also have revolutionized the aesthetic Quick Response (QR) code generation. Despite significant improvements in visual attractiveness for the beautified codes, their scannabilities are usually sacrificed and thus hinder their practical uses in real-world scenarios. To address this issue, we propose a novel Diffusion-based… ▽ More With the success of Diffusion Models for image generation, the technologies also have revolutionized the aesthetic Quick Response (QR) code generation. Despite significant improvements in visual attractiveness for the beautified codes, their scannabilities are usually sacrificed and thus hinder their practical uses in real-world scenarios. To address this issue, we propose a novel Diffusion-based QR Code generator (DiffQRCoder) to effectively craft both scannable and visually pleasing QR codes. The proposed approach introduces Scanning-Robust Perceptual Guidance (SRPG), a new diffusion guidance for Diffusion Models to guarantee the generated aesthetic codes to obey the ground-truth QR codes while maintaining their attractiveness during the denoising process. Additionally, we present another post-processing technique, Scanning Robust Manifold Projected Gradient Descent (SR-MPGD), to further enhance their scanning robustness through iterative latent space optimization. With extensive experiments, the results demonstrate that our approach not only outperforms other compared methods in Scanning Success Rate (SSR) with better or comparable CLIP aesthetic score (CLIP-aes.) but also significantly improves the SSR of the ControlNet-only approach from 60% to 99%. The subjective evaluation indicates that our approach achieves promising visual attractiveness to users as well. Finally, even with different scanning angles and the most rigorous error tolerance settings, our approach robustly achieves over 95% SSR, demonstrating its capability for real-world applications. △ Less

Submitted 10 September, 2024; originally announced September 2024.

arXiv:2408.13687 [pdf, other]

Quantum error correction below the surface code threshold

Authors: Rajeev Acharya, Laleh Aghababaie-Beni, Igor Aleiner, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Dave Bacon, Brian Ballard, Joseph C. Bardin, Johannes Bausch, Andreas Bengtsson, Alexander Bilmes, Sam Blackwell, Sergio Boixo, Gina Bortoli, Alexandre Bourassa, Jenna Bovaird, Leon Brill, Michael Broughton, David A. Browne , et al. (224 additional authors not shown)

Abstract: Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this… ▽ More Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this threshold: a distance-7 code and a distance-5 code integrated with a real-time decoder. The logical error rate of our larger quantum memory is suppressed by a factor of $Λらむだ$ = 2.14 $\pm$ 0.02 when increasing the code distance by two, culminating in a 101-qubit distance-7 code with 0.143% $\pm$ 0.003% error per cycle of error correction. This logical memory is also beyond break-even, exceeding its best physical qubit's lifetime by a factor of 2.4 $\pm$ 0.3. We maintain below-threshold performance when decoding in real time, achieving an average decoder latency of 63 $μみゅー$s at distance-5 up to a million cycles, with a cycle time of 1.1 $μみゅー$s. To probe the limits of our error-correction performance, we run repetition codes up to distance-29 and find that logical performance is limited by rare correlated error events occurring approximately once every hour, or 3 $\times$ 10$^9$ cycles. Our results present device performance that, if scaled, could realize the operational requirements of large scale fault-tolerant quantum algorithms. △ Less

Submitted 24 August, 2024; originally announced August 2024.

Comments: 10 pages, 4 figures, Supplementary Information

arXiv:2408.11810 [pdf, other]

Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models

Authors: Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao, Ernie Chu, Cheng-Fu Chou, Jun-Cheng Chen

Abstract: Diffusion Models have emerged as powerful generative models for high-quality image synthesis, with many subsequent image editing techniques based on them. However, the ease of text-based image editing introduces significant risks, such as malicious editing for scams or intellectual property infringement. Previous works have attempted to safeguard images from diffusion-based editing by adding imper… ▽ More Diffusion Models have emerged as powerful generative models for high-quality image synthesis, with many subsequent image editing techniques based on them. However, the ease of text-based image editing introduces significant risks, such as malicious editing for scams or intellectual property infringement. Previous works have attempted to safeguard images from diffusion-based editing by adding imperceptible perturbations. These methods are costly and specifically target prevalent Latent Diffusion Models (LDMs), while Pixel-domain Diffusion Models (PDMs) remain largely unexplored and robust against such attacks. Our work addresses this gap by proposing a novel attacking framework with a feature representation attack loss that exploits vulnerabilities in denoising UNets and a latent optimization strategy to enhance the naturalness of protected images. Extensive experiments demonstrate the effectiveness of our approach in attacking dominant PDM-based editing methods (e.g., SDEdit) while maintaining reasonable protection fidelity and robustness against common defense methods. Additionally, our framework is extensible to LDMs, achieving comparable performance to existing approaches. △ Less

Submitted 21 August, 2024; originally announced August 2024.

arXiv:2407.17724 [pdf, other]

Monte Carlo studies of quantum cosmology by the generalized Lefschetz thimble method

Authors: Chien-Yu Chou, Jun Nishimura

Abstract: Quantum cosmology aims at elucidating the beginning of our Universe. Back in early 80's, Vilenkin and Hartle-Hawking put forward the "tunneling from nothing" and "no boundary" proposals. Recently there has been renewed interest in this subject from the viewpoint of defining the oscillating path integral for Lorentzian quantum gravity using the Picard-Lefschetz theory. Aiming at going beyond the mi… ▽ More Quantum cosmology aims at elucidating the beginning of our Universe. Back in early 80's, Vilenkin and Hartle-Hawking put forward the "tunneling from nothing" and "no boundary" proposals. Recently there has been renewed interest in this subject from the viewpoint of defining the oscillating path integral for Lorentzian quantum gravity using the Picard-Lefschetz theory. Aiming at going beyond the mini-superspace and saddle-point approximations, we perform Monte Carlo calculations using the generalized Lefschetz thimble method to overcome the sign problem. In particular, we confirm that either Vilenkin or Hartle-Hawking saddle point becomes relevant if one uses the Robin boundary condition depending on its parameter. We also clarify some fundamental issues in quantum cosmology, such as an issue related to the integration domain of the lapse function and an issue related to reading off the real geometry from the complex geometry obtained at the saddle point. △ Less

Submitted 2 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

Comments: 36 pages, 8 figures (v2) references added

Report number: KEK-TH-2640

arXiv:2407.12867 [pdf, other]

Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 50 pages, 10 figures, 4 tables

arXiv:2406.12917 [pdf, other]

doi 10.1117/12.3019835

The Black Hole Explorer: Motivation and Vision

Authors: Michael D. Johnson, Kazunori Akiyama, Rebecca Baturin, Bryan Bilyeu, Lindy Blackburn, Don Boroson, Alejandro Cardenas-Avendano, Andrew Chael, Chi-kwan Chan, Dominic Chang, Peter Cheimets, Cathy Chou, Sheperd S. Doeleman, Joseph Farah, Peter Galison, Ronald Gamble, Charles F. Gammie, Zachary Gelles, Jose L. Gomez, Samuel E. Gralla, Paul Grimes, Leonid I. Gurvits, Shahar Hadar, Kari Haworth, Kazuhiro Hada , et al. (43 additional authors not shown)

Abstract: We present the Black Hole Explorer (BHEX), a mission that will produce the sharpest images in the history of astronomy by extending submillimeter Very-Long-Baseline Interferometry (VLBI) to space. BHEX will discover and measure the bright and narrow "photon ring" that is predicted to exist in images of black holes, produced from light that has orbited the black hole before escaping. This discovery… ▽ More We present the Black Hole Explorer (BHEX), a mission that will produce the sharpest images in the history of astronomy by extending submillimeter Very-Long-Baseline Interferometry (VLBI) to space. BHEX will discover and measure the bright and narrow "photon ring" that is predicted to exist in images of black holes, produced from light that has orbited the black hole before escaping. This discovery will expose universal features of a black hole's spacetime that are distinct from the complex astrophysics of the emitting plasma, allowing the first direct measurements of a supermassive black hole's spin. In addition to studying the properties of the nearby supermassive black holes M87* and Sgr A*, BHEX will measure the properties of dozens of additional supermassive black holes, providing crucial insights into the processes that drive their creation and growth. BHEX will also connect these supermassive black holes to their relativistic jets, elucidating the power source for the brightest and most efficient engines in the universe. BHEX will address fundamental open questions in the physics and astrophysics of black holes that cannot be answered without submillimeter space VLBI. The mission is enabled by recent technological breakthroughs, including the development of ultra-high-speed downlink using laser communications, and it leverages billions of dollars of existing ground infrastructure. We present the motivation for BHEX, its science goals and associated requirements, and the pathway to launch within the next decade. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: Proceedings for SPIE Astronomical Telescopes and Instrumentation

arXiv:2406.10272 [pdf, other]

Connected Speech-Based Cognitive Assessment in Chinese and English

Authors: Saturnino Luz, Sofia De La Fuente Garcia, Fasih Haider, Davida Fromm, Brian MacWhinney, Alyssa Lanzi, Ya-Ning Chang, Chia-Ju Chou, Yi-Chien Liu

Abstract: We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age… ▽ More We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age and sex by propensity score analysis to ensure balance and representativity in model training. The prediction tasks encompass mild cognitive impairment diagnosis and cognitive test score prediction. This framework was designed to encourage the development of approaches to speech-based cognitive assessment which generalise across languages. We illustrate it by presenting baseline prediction models that employ language-agnostic and comparable features for diagnosis and cognitive test score prediction. The models achieved unweighted average recall was 59.2% in diagnosis, and root mean squared error of 2.89 in score prediction. △ Less

Submitted 18 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

Comments: To appear in Proceedings of Interspeech 2024

ACM Class: J.3; I.5.4

arXiv:2405.12026 [pdf, other]

Enzymatic cycle-based receivers with high input impedance for approximate maximum a posteriori demodulation of concentration modulated signals

Authors: Chun Tung Chou

Abstract: Molecular communication is a bio-inspired communication paradigm where molecules are used as the information carrier. This paper considers a molecular communication network where the transmitter uses concentration modulated signals for communication. Our focus is to design receivers that can demodulate these signals. We impose three features on our receivers. We want the receivers to use enzymatic… ▽ More Molecular communication is a bio-inspired communication paradigm where molecules are used as the information carrier. This paper considers a molecular communication network where the transmitter uses concentration modulated signals for communication. Our focus is to design receivers that can demodulate these signals. We impose three features on our receivers. We want the receivers to use enzymatic cycles as their building blocks, have high input impedance and can work approximately as a maximum a posteriori (MAP) demodulator. No receivers with all these three features exist in the current molecular communication literature. We consider enzymatic cycles because they are a very common class of chemical reactions that are found in living cells. Since a receiver is to be placed in the communication environment, it should ideally have a high input impedance so that it has minimal impact on the environment and on other receivers. Lastly, a MAP receiver has good statistical performance. In this paper, we show how we can use time-scale separation to make an enzymatic cycle to have high input impedance and how the parameters of the enzymatic cycles can be chosen so that the receiver can approximately implement a MAP demodulator. We use simulation to study the performance of this receiver. In particular, we consider an environment with multiple receivers and show that a receiver has little impact on the bit error ratio of a nearby receiver because they have high input impedance. △ Less

Submitted 5 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.08266 [pdf]

Double orbits of weakly almost periodic functions

Authors: Ching Chou

Abstract: A locally compact group G is called a WS-group if the double orbits of the weakly almost periodic functions on G are relatively weakly compact. It is known that Moore-groups are WS-groups. We will show that if a discrete FC-group is a WS-group then its center is of finite index in G. We will study noncompact locally compact groups with the property that if the double orbits of bounded continuous f… ▽ More A locally compact group G is called a WS-group if the double orbits of the weakly almost periodic functions on G are relatively weakly compact. It is known that Moore-groups are WS-groups. We will show that if a discrete FC-group is a WS-group then its center is of finite index in G. We will study noncompact locally compact groups with the property that if the double orbits of bounded continuous functions on G are relatively weakly compact then they are relatively norm compact. Examples of such groups include the motion group M(n) and the special linear group SL(n,R). △ Less

Submitted 13 May, 2024; originally announced May 2024.

MSC Class: 22D05; 22D15 43A30; 43A60

arXiv:2405.06851 [pdf, other]

Nonlinear classification of neural manifolds with contextual information

Authors: Francesca Mignacco, Chi-Ning Chou, SueYeon Chung

Abstract: Understanding how neural systems efficiently process information through distributed representations is a fundamental challenge at the interface of neuroscience and machine learning. Recent approaches analyze the statistical and geometrical attributes of neural representations as population-level mechanistic descriptors of task implementation. In particular, manifold capacity has emerged as a prom… ▽ More Understanding how neural systems efficiently process information through distributed representations is a fundamental challenge at the interface of neuroscience and machine learning. Recent approaches analyze the statistical and geometrical attributes of neural representations as population-level mechanistic descriptors of task implementation. In particular, manifold capacity has emerged as a promising framework linking population geometry to the separability of neural manifolds. However, this metric has been limited to linear readouts. Here, we propose a theoretical framework that overcomes this limitation by leveraging contextual input information. We derive an exact formula for the context-dependent capacity that depends on manifold geometry and context correlations, and validate it on synthetic and real data. Our framework's increased expressivity captures representation untanglement in deep networks at early stages of the layer hierarchy, previously inaccessible to analysis. As context-dependent nonlinearity is ubiquitous in neural systems, our data-driven and theoretically grounded approach promises to elucidate context-dependent computation across scales, datasets, and models. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 5 pages, 5 figures

arXiv:2405.06801 [pdf, other]

doi 10.1109/MNET.2024.3391271

LEO Satellite Network Access in the Wild: Potentials, Experiences, and Challenges

Authors: Sami Ma, Yi Ching Chou, Miao Zhang, Hao Fang, Haoyuan Zhao, Jiangchuan Liu, William I. Atlas

Abstract: In the past three years, working with the Pacific Salmon Foundation and various First Nations groups, we have established Starlink-empowered wild salmon monitoring sites in remote Northern British Columbia, Canada. We report our experiences with the network services in these challenging environments, including deep woods and deep valleys, that lack infrastructural support with some close to Starli… ▽ More In the past three years, working with the Pacific Salmon Foundation and various First Nations groups, we have established Starlink-empowered wild salmon monitoring sites in remote Northern British Columbia, Canada. We report our experiences with the network services in these challenging environments, including deep woods and deep valleys, that lack infrastructural support with some close to Starlink's service boundary at the far north. We assess the portability and mobility of the satellite dishes and the quality of existing network access in underdeveloped countries that Starlink expects to cover. Our experiences suggest that network access based on LEO satellite constellations holds promise but faces hurdles such as energy supply constraints and environmental factors like temperature, precipitation, and solar storms. The presence of wildlife and respecting local residents' culture and heritage pose further complications. We envision several technical solutions addressing the challenges and believe that further regulations will be necessary. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 8 pages, 6 figures

ACM Class: C.2.1

arXiv:2405.02474 [pdf, other]

Nonlinear magnetic sensing with hybrid nitrogen-vacancy/magnon systems

Authors: Zhongqiang Hu, Zhiping He, Qiuyuan Wang, Chung-Tao Chou, Justin T. Hou, Luqiao Liu

Abstract: Magnetic sensing beyond linear regime could broaden the frequency range of detectable magnetic fields, which is crucial to various microwave and quantum applications. Recently, nonlinear interactions in diamond nitrogen-vacancy (NV) centers, one of the most extensively studied quantum magnetic sensors, are proposed to realize magnetic sensing across arbitrary frequencies. In this work, we enhance… ▽ More Magnetic sensing beyond linear regime could broaden the frequency range of detectable magnetic fields, which is crucial to various microwave and quantum applications. Recently, nonlinear interactions in diamond nitrogen-vacancy (NV) centers, one of the most extensively studied quantum magnetic sensors, are proposed to realize magnetic sensing across arbitrary frequencies. In this work, we enhance these capabilities by exploiting the nonlinear spin dynamics in hybrid systems of NV centers and ferri- or ferro-magnetic (FM) thin films. We study the frequency mixing effect in the hybrid NV/magnon systems, and demonstrate that the introduction of FM not only amplifies the intensity of nonlinear resonance signals that are intrinsic to NV spins, but also enables novel frequency mixings through parametric pumping and nonlinear magnon scattering effects. The discovery and understanding of the magnetic nonlinearities in hybrid NV/magnon systems position them as a prime candidate for magnetic sensing with a broad frequency range and high tunablity, particularly meaningful for nanoscale, dynamical, and non-invasive materials characterization. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2404.15252 [pdf, other]

Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions

Authors: Xingguang Zhang, Chih-Hsien Chou

Abstract: When deploying pre-trained video object detectors in real-world scenarios, the domain gap between training and testing data caused by adverse image conditions often leads to performance degradation. Addressing this issue becomes particularly challenging when only the pre-trained model and degraded videos are available. Although various source-free domain adaptation (SFDA) methods have been propose… ▽ More When deploying pre-trained video object detectors in real-world scenarios, the domain gap between training and testing data caused by adverse image conditions often leads to performance degradation. Addressing this issue becomes particularly challenging when only the pre-trained model and degraded videos are available. Although various source-free domain adaptation (SFDA) methods have been proposed for single-frame object detectors, SFDA for video object detection (VOD) remains unexplored. Moreover, most unsupervised domain adaptation works for object detection rely on two-stage detectors, while SFDA for one-stage detectors, which are more vulnerable to fine-tuning, is not well addressed in the literature. In this paper, we propose Spatial-Temporal Alternate Refinement with Mean Teacher (STAR-MT), a simple yet effective SFDA method for VOD. Specifically, we aim to improve the performance of the one-stage VOD method, YOLOV, under adverse image conditions, including noise, air turbulence, and haze. Extensive experiments on the ImageNetVOD dataset and its degraded versions demonstrate that our method consistently improves video object detection performance in challenging imaging conditions, showcasing its potential for real-world applications. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: accepted by the UG2+ workshop at CVPR 2024

arXiv:2404.08772 [pdf, other]

Nonlinear Wave-Spin Interactions in Nitrogen-Vacancy Centers

Authors: Zhongqiang Hu, Qiuyuan Wang, Chung-Tao Chou, Justin T. Hou, Zhiping He, Luqiao Liu

Abstract: Nonlinear phenomena represent one of the central topics in the study of wave-matter interactions and constitute the key blocks for various applications in optical communication, computing, sensing, and imaging. In this work, we show that by employing the interactions between microwave photons and electron spins of nitrogen-vacancy (NV) centers, one can realize a variety of nonlinear effects, rangi… ▽ More Nonlinear phenomena represent one of the central topics in the study of wave-matter interactions and constitute the key blocks for various applications in optical communication, computing, sensing, and imaging. In this work, we show that by employing the interactions between microwave photons and electron spins of nitrogen-vacancy (NV) centers, one can realize a variety of nonlinear effects, ranging from the resonance at the sum or difference frequency of two or more waves to electromagnetically induced transparency from the interference between spin transitions. We further verify the phase coherence through two-photon Rabi-oscillation measurements. The highly sensitive, optically detected NV-center dynamics not only provides a platform for studying magnetically induced nonlinearities but also promises novel functionalities in quantum control and quantum sensing. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 15 pages and 10 figures

arXiv:2404.04248 [pdf, other]

doi 10.3847/2041-8213/ad5beb

Observation of Gravitational Waves from the Coalescence of a $2.5\text{-}4.5~M_\odot$ Compact Object and a Neutron Star

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, S. Akçay, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah , et al. (1771 additional authors not shown)

Abstract: We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the so… ▽ More We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the source has a mass less than $5~M_\odot$ at 99% credibility. We cannot definitively determine from gravitational-wave data alone whether either component of the source is a neutron star or a black hole. However, given existing estimates of the maximum neutron star mass, we find the most probable interpretation of the source to be the coalescence of a neutron star with a black hole that has a mass between the most massive neutron stars and the least massive black holes observed in the Galaxy. We provisionally estimate a merger rate density of $55^{+127}_{-47}~\text{Gpc}^{-3}\,\text{yr}^{-1}$ for compact binary coalescences with properties similar to the source of GW230529_181500; assuming that the source is a neutron star-black hole merger, GW230529_181500-like sources constitute about 60% of the total merger rate inferred for neutron star-black hole coalescences. The discovery of this system implies an increase in the expected rate of neutron star-black hole mergers with electromagnetic counterparts and provides further evidence for compact objects existing within the purported lower mass gap. △ Less

Submitted 26 July, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: 45 pages (10 pages author list, 13 pages main text, 1 page acknowledgements, 13 pages appendices, 8 pages bibliography), 17 figures, 16 tables. Update to match version published in The Astrophysical Journal Letters. Data products available from https://zenodo.org/records/10845779

Report number: LIGO-P2300352

Journal ref: ApJL 970, L34 (2024)

arXiv:2403.16451 [pdf, other]

DeepMachining: Online Prediction of Machining Errors of Lathe Machines

Authors: Xiang-Li Lu, Hwai-Jung Hsu, Che-Wei Chou, H. T. Kung, Chen-Hsin Lee, Sheng-Mao Cheng

Abstract: We describe DeepMachining, a deep learning-based AI system for online prediction of machining errors of lathe machine operations. We have built and evaluated DeepMachining based on manufacturing data from factories. Specifically, we first pretrain a deep learning model for a given lathe machine's operations to learn the salient features of machining states. Then, we fine-tune the pretrained model… ▽ More We describe DeepMachining, a deep learning-based AI system for online prediction of machining errors of lathe machine operations. We have built and evaluated DeepMachining based on manufacturing data from factories. Specifically, we first pretrain a deep learning model for a given lathe machine's operations to learn the salient features of machining states. Then, we fine-tune the pretrained model to adapt to specific machining tasks. We demonstrate that DeepMachining achieves high prediction accuracy for multiple tasks that involve different workpieces and cutting tools. To the best of our knowledge, this work is one of the first factory experiments using pre-trained deep-learning models to predict machining errors of lathe machines. △ Less

Submitted 28 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.15878 [pdf, other]

Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance

Authors: Jia-Wei Liao, Winston Wang, Tzu-Sian Wang, Li-Xuan Peng, Cheng-Fu Chou, Jun-Cheng Chen

Abstract: QR codes, prevalent in daily applications, lack visual appeal due to their conventional black-and-white design. Integrating aesthetics while maintaining scannability poses a challenge. In this paper, we introduce a novel diffusion-model-based aesthetic QR code generation pipeline, utilizing pre-trained ControlNet and guided iterative refinement via a novel classifier guidance (SRG) based on the pr… ▽ More QR codes, prevalent in daily applications, lack visual appeal due to their conventional black-and-white design. Integrating aesthetics while maintaining scannability poses a challenge. In this paper, we introduce a novel diffusion-model-based aesthetic QR code generation pipeline, utilizing pre-trained ControlNet and guided iterative refinement via a novel classifier guidance (SRG) based on the proposed Scanning-Robust Loss (SRL) tailored with QR code mechanisms, which ensures both aesthetics and scannability. To further improve the scannability while preserving aesthetics, we propose a two-stage pipeline with Scanning-Robust Perceptual Guidance (SRPG). Moreover, we can further enhance the scannability of the generated QR code by post-processing it through the proposed Scanning-Robust Projected Gradient Descent (SRPGD) post-processing technique based on SRL with proven convergence. With extensive quantitative, qualitative, and subjective experiments, the results demonstrate that the proposed approach can generate diverse aesthetic QR codes with flexibility in detail. In addition, our pipelines outperforming existing models in terms of Scanning Success Rate (SSR) 86.67% (+40%) with comparable aesthetic scores. The pipeline combined with SRPGD further achieves 96.67% (+50%). Our code will be available https://github.com/jwliao1209/DiffQRCode. △ Less

Submitted 23 March, 2024; originally announced March 2024.

arXiv:2403.12186 [pdf, ps, other]

Grothendieck polynomials of inverse fireworks permutations

Authors: Chen-An Chou, Tianyi Yu

Abstract: Pipedreams are combinatorial objects that compute Grothendieck polynomials. We introduce a new combinatorial object that naturally recast the pipedream formula. From this, we obtain the first direct combinatorial formula for the top degree components of Grothendieck polynomials, also known as the Castelnuovo-Mumford polynomials. We also prove the inverse fireworks case of a conjecture of Mészáros,… ▽ More Pipedreams are combinatorial objects that compute Grothendieck polynomials. We introduce a new combinatorial object that naturally recast the pipedream formula. From this, we obtain the first direct combinatorial formula for the top degree components of Grothendieck polynomials, also known as the Castelnuovo-Mumford polynomials. We also prove the inverse fireworks case of a conjecture of Mészáros, Setiabrata, and St. Dizier on the support of Grothendieck polynomials. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.07225 [pdf, other]

Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints

Authors: Weihan Wang, Chieh Chou, Ganesh Sevagamoorthy, Kevin Chen, Zheng Chen, Ziyue Feng, Youjie Xia, Feiyang Cai, Yi Xu, Philippos Mordohai

Abstract: We propose an accurate and robust initialization approach for stereo visual-inertial SLAM systems. Unlike the current state-of-the-art method, which heavily relies on the accuracy of a pure visual SLAM system to estimate inertial variables without updating camera poses, potentially compromising accuracy and robustness, our approach offers a different solution. We realize the crucial impact of prec… ▽ More We propose an accurate and robust initialization approach for stereo visual-inertial SLAM systems. Unlike the current state-of-the-art method, which heavily relies on the accuracy of a pure visual SLAM system to estimate inertial variables without updating camera poses, potentially compromising accuracy and robustness, our approach offers a different solution. We realize the crucial impact of precise gyroscope bias estimation on rotation accuracy. This, in turn, affects trajectory accuracy due to the accumulation of translation errors. To address this, we first independently estimate the gyroscope bias and use it to formulate a maximum a posteriori problem for further refinement. After this refinement, we proceed to update the rotation estimation by performing IMU integration with gyroscope bias removed from gyroscope measurements. We then leverage robust and accurate rotation estimates to enhance translation estimation via 3-DoF bundle adjustment. Moreover, we introduce a novel approach for determining the success of the initialization by evaluating the residual of the normal epipolar constraint. Extensive evaluations on the EuRoC dataset illustrate that our method excels in accuracy and robustness. It outperforms ORB-SLAM3, the current leading stereo visual-inertial initialization method, in terms of absolute trajectory error and relative rotation error, while maintaining competitive computational speed. Notably, even with 5 keyframes for initialization, our method consistently surpasses the state-of-the-art approach using 10 keyframes in rotation accuracy. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.17776 [pdf, other]

A New Architecture for Energy Efficient Fault Detection Using Energy Harvesters

Authors: Dongti Zhang, Patricio Peralta-Braz, Chun Tung Chou, Elena Atroshchenko, Mehrisadat Makki Alamdari, Mahbub Hassan

Abstract: The current battery-powered fault detection system for vibration monitoring has a rather limited lifetime. This is because the high-frequency sampling (typically tens of kilo-Hertz) required for vibration monitoring results in high energy consumption in both the analog-to-digital (ADC) converter and wireless transmissions. This paper proposes a new fault detection architecture that can significant… ▽ More The current battery-powered fault detection system for vibration monitoring has a rather limited lifetime. This is because the high-frequency sampling (typically tens of kilo-Hertz) required for vibration monitoring results in high energy consumption in both the analog-to-digital (ADC) converter and wireless transmissions. This paper proposes a new fault detection architecture that can significantly reduce the energy consumption of the ADC and wireless transmission. Our inspiration for the new architecture is based on the observation that the many tens of thousand of data samples collected for fault detection are ultimately transformed into a small number of features. If we can generate these features directly without high frequency sampling, then we can avoid the the energy cost for ADC and wireless transmissions. We propose to use piezoelectric energy harvesters (which can be designed to have different frequency responses) and integrators to obtain these features in an energy-efficient manner. By using a publicly available data set for ball bearing fault detection (which was originally sampled at 51.2kHzきろへるつ) and piezoelectric energy harvester models, we can produce features, which when sampled at 0.33Hzへるつ, give a fault detection accuracy of 89% while reducing the sampling requirement by 4 orders-of-magnitude. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 8 pages, 8 figures

arXiv:2401.17244 [pdf, other]

LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation

Authors: Yuan Chiang, Elvis Hsieh, Chia-Hong Chou, Janosh Riebesell

Abstract: Reducing hallucination of Large Language Models (LLMs) is imperative for use in the sciences, where reliability and reproducibility are crucial. However, LLMs inherently lack long-term memory, making it a nontrivial, ad hoc, and inevitably biased task to fine-tune them on domain-specific literature and data. Here we introduce LLaMP, a multimodal retrieval-augmented generation (RAG) framework of hi… ▽ More Reducing hallucination of Large Language Models (LLMs) is imperative for use in the sciences, where reliability and reproducibility are crucial. However, LLMs inherently lack long-term memory, making it a nontrivial, ad hoc, and inevitably biased task to fine-tune them on domain-specific literature and data. Here we introduce LLaMP, a multimodal retrieval-augmented generation (RAG) framework of hierarchical reasoning-and-acting (ReAct) agents that can dynamically and recursively interact with computational and experimental data on Materials Project (MP) and run atomistic simulations via high-throughput workflow interface. Without fine-tuning, LLaMP demonstrates strong tool usage ability to comprehend and integrate various modalities of materials science concepts, fetch relevant data stores on the fly, process higher-order data (such as crystal structure and elastic tensor), and streamline complex tasks in computational materials and chemistry. We propose a simple metric combining uncertainty and confidence estimates to evaluate the self-consistency of responses by LLaMP and vanilla LLMs. Our benchmark shows that LLaMP effectively mitigates the intrinsic bias in LLMs, counteracting the errors on bulk moduli, electronic bandgaps, and formation energies that seem to derive from mixed data sources. We also demonstrate LLaMP's capability to edit crystal structures and run annealing molecular dynamics simulations using pre-trained machine-learning force fields. The framework offers an intuitive and nearly hallucination-free approach to exploring and scaling materials informatics, and establishes a pathway for knowledge distillation and fine-tuning other language models. Code and live demo are available at https://github.com/chiang-yuan/llamp △ Less

Submitted 2 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 31 pages, 5 figures

arXiv:2401.16945 [pdf, other]

Online Resource Allocation with Non-Stationary Customers

Authors: Xiaoyue Zhang, Hanzhang Qin, Mabel C. Chou

Abstract: We propose a novel algorithm for online resource allocation with non-stationary customer arrivals and unknown click-through rates. We assume multiple types of customers arrive in a nonstationary stochastic fashion, with unknown arrival rates in each period, and that customers' click-through rates are unknown and can only be learned online. By leveraging results from the stochastic contextual bandi… ▽ More We propose a novel algorithm for online resource allocation with non-stationary customer arrivals and unknown click-through rates. We assume multiple types of customers arrive in a nonstationary stochastic fashion, with unknown arrival rates in each period, and that customers' click-through rates are unknown and can only be learned online. By leveraging results from the stochastic contextual bandit with knapsack and online matching with adversarial arrivals, we develop an online scheme to allocate the resources to nonstationary customers. We prove that under mild conditions, our scheme achieves a ``best-of-both-world'' result: the scheme has a sublinear regret when the customer arrivals are near-stationary, and enjoys an optimal competitive ratio under general (non-stationary) customer arrival distributions. Finally, we conduct extensive numerical experiments to show our approach generates near-optimal revenues for all different customer scenarios. △ Less

Submitted 2 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.02905 [pdf, other]

H2G2-Net: A Hierarchical Heterogeneous Graph Generative Network Framework for Discovery of Multi-Modal Physiological Responses

Authors: Haidong Gu, Nathan Gaw, Yinan Wang, Chancellor Johnstone, Christine Beauchene, Sophia Yuditskaya, Hrishikesh Rao, Chun-An Chou

Abstract: Discovering human cognitive and emotional states using multi-modal physiological signals draws attention across various research applications. Physiological responses of the human body are influenced by human cognition and commonly used to analyze cognitive states. From a network science perspective, the interactions of these heterogeneous physiological modalities in a graph structure may provide… ▽ More Discovering human cognitive and emotional states using multi-modal physiological signals draws attention across various research applications. Physiological responses of the human body are influenced by human cognition and commonly used to analyze cognitive states. From a network science perspective, the interactions of these heterogeneous physiological modalities in a graph structure may provide insightful information to support prediction of cognitive states. However, there is no clue to derive exact connectivity between heterogeneous modalities and there exists a hierarchical structure of sub-modalities. Existing graph neural networks are designed to learn on non-hierarchical homogeneous graphs with pre-defined graph structures; they failed to learn from hierarchical, multi-modal physiological data without a pre-defined graph structure. To this end, we propose a hierarchical heterogeneous graph generative network (H2G2-Net) that automatically learns a graph structure without domain knowledge, as well as a powerful representation on the hierarchical heterogeneous graph in an end-to-end fashion. We validate the proposed method on the CogPilot dataset that consists of multi-modal physiological signals. Extensive experiments demonstrate that our proposed method outperforms the state-of-the-art GNNs by 5%-20% in prediction accuracy. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

arXiv:2312.17104 [pdf, other]

doi 10.1126/science.ado1001

Quantum state tracking and control of a single molecular ion in a thermal environment

Authors: Yu Liu, Julian Schmidt, Zhimin Liu, David R. Leibrandt, Dietrich Leibfried, Chin-wen Chou

Abstract: Understanding molecular state evolution is central to many disciplines, including molecular dynamics, precision measurement, and molecule-based quantum technology. Details of the evolution are obscured when observing a statistical ensemble of molecules. Here, we reported real-time observations of thermal radiation-driven transitions between individual states ("jumps") of a single molecule. We reve… ▽ More Understanding molecular state evolution is central to many disciplines, including molecular dynamics, precision measurement, and molecule-based quantum technology. Details of the evolution are obscured when observing a statistical ensemble of molecules. Here, we reported real-time observations of thermal radiation-driven transitions between individual states ("jumps") of a single molecule. We reversed these "jumps" through microwave-driven transitions, resulting in a twentyfold improvement in the time the molecule dwells in a chosen state. The measured transition rates showed anisotropy in the thermal environment, pointing to the possibility of using single molecules as in-situ probes for the strengths of ambient fields. Our approaches for state detection and manipulation could apply to a wide range of species, facilitating their uses in fields including quantum science, molecular physics, and ion-neutral chemistry. △ Less

Submitted 1 August, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

Comments: 37 pages, 8 figures

Journal ref: Science, ado1001 (2024)

arXiv:2312.14285 [pdf, other]

Probing Biological and Artificial Neural Networks with Task-dependent Neural Manifolds

Authors: Michael Kuoch, Chi-Ning Chou, Nikhil Parthasarathy, Joel Dapello, James J. DiCarlo, Haim Sompolinsky, SueYeon Chung

Abstract: Recently, growth in our understanding of the computations performed in both biological and artificial neural networks has largely been driven by either low-level mechanistic studies or global normative approaches. However, concrete methodologies for bridging the gap between these levels of abstraction remain elusive. In this work, we investigate the internal mechanisms of neural networks through t… ▽ More Recently, growth in our understanding of the computations performed in both biological and artificial neural networks has largely been driven by either low-level mechanistic studies or global normative approaches. However, concrete methodologies for bridging the gap between these levels of abstraction remain elusive. In this work, we investigate the internal mechanisms of neural networks through the lens of neural population geometry, aiming to provide understanding at an intermediate level of abstraction, as a way to bridge that gap. Utilizing manifold capacity theory (MCT) from statistical physics and manifold alignment analysis (MAA) from high-dimensional statistics, we probe the underlying organization of task-dependent manifolds in deep neural networks and macaque neural recordings. Specifically, we quantitatively characterize how different learning objectives lead to differences in the organizational strategies of these models and demonstrate how these geometric analyses are connected to the decodability of task-relevant information. These analyses present a strong direction for bridging mechanistic and normative theories in neural networks through neural population geometry, potentially opening up many future research avenues in both machine learning and neuroscience. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: To appear in the proceedings of the Conference on Parsimony and Learning (CPAL) 2024

arXiv:2312.01250 [pdf, ps, other]

Constructing maximal pipedreams of double Grothendieck polynomials

Authors: Chen-An Chou, Tianyi Yu

Abstract: Pechenik, Speyer and Weigandt defined a statistic $\mathsf{rajcode}(\cdot)$ on permutations which characterizes the leading monomial in top degree components of double Grothendieck polynomials. Their proof is combinatorial: They showed there exists a unique pipedream of a permutation $w$ with row weight $\mathsf{rajcode}(w)$ and column weight $\mathsf{rajcode}(w^{-1})$. They proposed the problem o… ▽ More Pechenik, Speyer and Weigandt defined a statistic $\mathsf{rajcode}(\cdot)$ on permutations which characterizes the leading monomial in top degree components of double Grothendieck polynomials. Their proof is combinatorial: They showed there exists a unique pipedream of a permutation $w$ with row weight $\mathsf{rajcode}(w)$ and column weight $\mathsf{rajcode}(w^{-1})$. They proposed the problem of finding a ``direct recipe'' for this pipedream. We solve this problem by providing an algorithm that constructs this pipedream via ladder moves. △ Less

Submitted 2 December, 2023; originally announced December 2023.

arXiv:2311.11265 [pdf, other]

ZZPolyCalc: An open-source code with fragment caching for determination of Zhang-Zhang polynomials of carbon nanostructures

Authors: Rafał Podeszwa, Henryk A. Witek, Chien-Pin Chou

Abstract: Determination of topological invariants of graphene flakes, nanotubes, and fullerenes constitutes a challenging task due to its time-intensive nature and exponential scaling. The invariants can be organized in a form of a combinatorial polynomial commonly known as the Zhang-Zhang (ZZ) polynomial or the Clar covering polynomial. We report here a computer program, ZZPolyCalc, specifically designed t… ▽ More Determination of topological invariants of graphene flakes, nanotubes, and fullerenes constitutes a challenging task due to its time-intensive nature and exponential scaling. The invariants can be organized in a form of a combinatorial polynomial commonly known as the Zhang-Zhang (ZZ) polynomial or the Clar covering polynomial. We report here a computer program, ZZPolyCalc, specifically designed to compute ZZ polynomials of large carbon nanostructures. The curse of exponential scaling is avoided for a broad class of nanostructures by employing a sophisticated bookkeeping algorithm, in which each fragment appearing in the recursive decomposition is stored in the cache repository of molecular fragments indexed by a hash of the corresponding adjacency matrix. Although exponential scaling persists for the remaining nanostructures, the computational time is reduced by a few orders of magnitude owing to efficient use of hash-based fragment bookkeeping. The provided benchmark timings show that ZZPolyCalc allows for treating much larger carbon nanostructures than previously envisioned. △ Less

Submitted 19 November, 2023; originally announced November 2023.

Comments: 8 pages, 7 figures; submitted to "Comput. Phys. Commun

arXiv:2311.05477 [pdf, other]

Using ResNet to Utilize 4-class T2-FLAIR Slice Classification Based on the Cholinergic Pathways Hyperintensities Scale for Pathological Aging

Authors: Wei-Chun Kevin Tsai, Yi-Chien Liu, Ming-Chun Yu, Chia-Ju Chou, Sui-Hing Yan, Yang-Teng Fan, Yan-Hsiang Huang, Yen-Ling Chiu, Yi-Fang Chuang, Ran-Zan Wang, Yao-Chia Shih

Abstract: The Cholinergic Pathways Hyperintensities Scale (CHIPS) is a visual rating scale used to assess the extent of cholinergic white matter hyperintensities in T2-FLAIR images, serving as an indicator of dementia severity. However, the manual selection of four specific slices for rating throughout the entire brain is a time-consuming process. Our goal was to develop a deep learning-based model capable… ▽ More The Cholinergic Pathways Hyperintensities Scale (CHIPS) is a visual rating scale used to assess the extent of cholinergic white matter hyperintensities in T2-FLAIR images, serving as an indicator of dementia severity. However, the manual selection of four specific slices for rating throughout the entire brain is a time-consuming process. Our goal was to develop a deep learning-based model capable of automatically identifying the four slices relevant to CHIPS. To achieve this, we trained a 4-class slice classification model (BSCA) using the ADNI T2-FLAIR dataset (N=150) with the assistance of ResNet. Subsequently, we tested the model's performance on a local dataset (N=30). The results demonstrated the efficacy of our model, with an accuracy of 99.82% and an F1-score of 99.83%. This achievement highlights the potential impact of BSCA as an automatic screening tool, streamlining the selection of four specific T2-FLAIR slices that encompass white matter landmarks along the cholinergic pathways. Clinicians can leverage this tool to assess the risk of clinical dementia development efficiently. △ Less

Submitted 11 September, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: 8 pages, 2 figures, 2 tables

arXiv:2311.03285 [pdf, other]

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Authors: Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica

Abstract: The "pretrain-then-finetune" paradigm is commonly adopted in the deployment of large language models. Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning method, is often employed to adapt a base model to a multitude of tasks, resulting in a substantial collection of LoRA adapters derived from one base model. We observe that this paradigm presents significant opportunities for batched in… ▽ More The "pretrain-then-finetune" paradigm is commonly adopted in the deployment of large language models. Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning method, is often employed to adapt a base model to a multitude of tasks, resulting in a substantial collection of LoRA adapters derived from one base model. We observe that this paradigm presents significant opportunities for batched inference during serving. To capitalize on these opportunities, we present S-LoRA, a system designed for the scalable serving of many LoRA adapters. S-LoRA stores all adapters in the main memory and fetches the adapters used by the currently running queries to the GPU memory. To efficiently use the GPU memory and reduce fragmentation, S-LoRA proposes Unified Paging. Unified Paging uses a unified memory pool to manage dynamic adapter weights with different ranks and KV cache tensors with varying sequence lengths. Additionally, S-LoRA employs a novel tensor parallelism strategy and highly optimized custom CUDA kernels for heterogeneous batching of LoRA computation. Collectively, these features enable S-LoRA to serve thousands of LoRA adapters on a single GPU or across multiple GPUs with a small overhead. Compared to state-of-the-art libraries such as HuggingFace PEFT and vLLM (with naive support of LoRA serving), S-LoRA can improve the throughput by up to 4 times and increase the number of served adapters by several orders of magnitude. As a result, S-LoRA enables scalable serving of many task-specific fine-tuned models and offers the potential for large-scale customized fine-tuning services. The code is available at https://github.com/S-LoRA/S-LoRA △ Less

Submitted 5 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

arXiv:2310.20539 [pdf, other]

The Computational Lens: from Quantum Physics to Neuroscience

Authors: Chi-Ning Chou

Abstract: Two transformative waves of computing have redefined the way we approach science. The first wave came with the birth of the digital computer, which enabled scientists to numerically simulate their models and analyze massive datasets. This technological breakthrough led to the emergence of many sub-disciplines bearing the prefix "computational" in their names. Currently, we are in the midst of the… ▽ More Two transformative waves of computing have redefined the way we approach science. The first wave came with the birth of the digital computer, which enabled scientists to numerically simulate their models and analyze massive datasets. This technological breakthrough led to the emergence of many sub-disciplines bearing the prefix "computational" in their names. Currently, we are in the midst of the second wave, marked by the remarkable advancements in artificial intelligence. From predicting protein structures to classifying galaxies, the scope of its applications is vast, and there can only be more awaiting us on the horizon. While these two waves influence scientific methodology at the instrumental level, in this dissertation, I will present the computational lens in science, aiming at the conceptual level. Specifically, the central thesis posits that computation serves as a convenient and mechanistic language for understanding and analyzing information processing systems, offering the advantages of composability and modularity. This dissertation begins with an illustration of the blueprint of the computational lens, supported by a review of relevant previous work. Subsequently, I will present my own works in quantum physics and neuroscience as concrete examples. In the concluding chapter, I will contemplate the potential of applying the computational lens across various scientific fields, in a way that can provide significant domain insights, and discuss potential future directions. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: PhD thesis, Harvard University, Cambridge, Massachusetts, USA. 2023. Some chapters report joint work

arXiv:2309.17445 [pdf]

The characteristics of LK-99 by Cu$_2$S removal using ammonia solution: A diamagnetic semiconductor

Authors: Zhujialei Lei, Chin-Wei Lin, I-Nan Chen, Chun-Tse Chou, Li-Min Wang

Abstract: In this study, we re-evaluated the superconducting properties of LK-99. The LK-99 samples were synthesized using the process proposed by the original Korean team. Additionally, we examined whether the results of the Korean team are related to Cu$_2$S by using ammonia solution (NH$_3$-H$_2$O) to remove Cu$_2$S . Through x-ray diffraction (XRD) analysis, a distinct Cu$_2$S phase was identified in… ▽ More In this study, we re-evaluated the superconducting properties of LK-99. The LK-99 samples were synthesized using the process proposed by the original Korean team. Additionally, we examined whether the results of the Korean team are related to Cu$_2$S by using ammonia solution (NH$_3$-H$_2$O) to remove Cu$_2$S . Through x-ray diffraction (XRD) analysis, a distinct Cu$_2$S phase was identified in the LK-99 samples. A subsequent treatment using an ammonia solution effectively eliminated this phase. The appearance of blue Cu$^{+2}$ ions in the solution and the elimination of the Cu$_2$S peak in XRD support the conclusion. The magnetic and electrical properties of LK-99 with and without Cu$_2$S postulate that the superconducting-like behavior in LK-99 predominantly arises from a transition in resistivity due to the influence of Cu$_2$S . As such, LK-99 is better classified as a diamagnetic semiconductor than a room-temperature superconductor. The room-temperature superconductors still require further research. △ Less

Submitted 20 August, 2023; originally announced September 2023.

arXiv:2308.07941 [pdf, other]

doi 10.1021/acs.chemmater.3c02054

Phase Stability of Lead Phosphate Apatite Pb$_{10-x}$Cu$_{x}$(PO$_{4}$)$_{6}$O, Pb$_{10-x}$Cu$_{x}$(PO$_{4}$)$_{6}$(OH)$_{2}$, and Pb$_{8}$Cu$_{2}$(PO$_{4}$)$_{6}$

Authors: Jiahong Shen, Dale Gaines II, Shima Shahabfar, Zhi Li, Dohun Kang, Sean Griesemer, Adolfo Salgado-Casanova, Tzu-chen Liu, Chang-Ti Chou, Yi Xia, Chris Wolverton

Abstract: Recently, Cu-substituted lead apatite LK-99 was reported to have room-temperature ambient-pressure superconductivity. Here we utilize density functional theory (DFT) total energy and harmonic phonon calculations to investigate the thermodynamic and dynamic stability of two lead phosphate apatites in their pure and Cu-substituted structures. Though Pb$_{10}$(PO$_4$)$_6$O and Pb$_{10}$(PO$_4$)$_6$(O… ▽ More Recently, Cu-substituted lead apatite LK-99 was reported to have room-temperature ambient-pressure superconductivity. Here we utilize density functional theory (DFT) total energy and harmonic phonon calculations to investigate the thermodynamic and dynamic stability of two lead phosphate apatites in their pure and Cu-substituted structures. Though Pb$_{10}$(PO$_4$)$_6$O and Pb$_{10}$(PO$_4$)$_6$(OH)$_2$ are found to be thermodynamically stable (i.e., on the T=0K ground state convex hull), their Cu-substituted counterparts are above the convex hull. Harmonic phonon calculations reveal dynamic instabilities in all four of these structures. Oxygen vacancy formation energies demonstrate that the addition of Cu dopant substituting for Pb increases the likelihood of the formation of oxygen vacancies on the anion site. We propose a new possible phase in this system, Pb$_8$Cu$_2$(PO$_4$)$_6$, where two monovalent Cu atoms are substituted for two Pb(1) atoms and the anion oxygen is removed. We also propose several reaction pathways for Pb$_9$Cu(PO$_4$)$_6$O and Pb$_8$Cu$_2$(PO$_4$)$_6$, and found that both of these two structures are likely to be synthesized under a 1:1 ratio of reactants Pb$_2$SO$_5$ and Cu$_3$P. Our work provides a thorough foundation for the thermodynamic and dynamic stabilities of LK-99 related compounds and we propose several possible novel synthesis reaction pathways and a new predicted structure for future studies. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2308.03822 [pdf, other]

Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hzへるつ orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 24 pages, 5 figures

Report number: LIGO-P2300080

arXiv:2306.17550 [pdf, other]

TTSWING: a Dataset for Table Tennis Swing Analysis

Authors: Che-Yu Chou, Zheng-Hao Chen, Yung-Hoh Sheu, Hung-Hsuan Chen, Sheng K. Wu

Abstract: We introduce TTSWING, a novel dataset designed for table tennis swing analysis. This dataset comprises comprehensive swing information obtained through 9-axis sensors integrated into custom-made racket grips, accompanied by anonymized demographic data of the players. We detail the data collection and annotation procedures. Furthermore, we conduct pilot studies utilizing diverse machine learning mo… ▽ More We introduce TTSWING, a novel dataset designed for table tennis swing analysis. This dataset comprises comprehensive swing information obtained through 9-axis sensors integrated into custom-made racket grips, accompanied by anonymized demographic data of the players. We detail the data collection and annotation procedures. Furthermore, we conduct pilot studies utilizing diverse machine learning models for swing analysis. TTSWING holds tremendous potential to facilitate innovative research in table tennis analysis and is a valuable resource for the scientific community. We release the dataset and experimental codes at https://github.com/DEPhantom/TTSWING. △ Less

Submitted 30 June, 2023; originally announced June 2023.

arXiv:2306.16744 [pdf, other]

doi 10.1007/JHEP05(2024)342

Page Curve of AdS-Vaidya Model for Evaporating Black Holes

Authors: Chia-Jui Chou, Hans B. Lao, Yi Yang

Abstract: We study an evaporating black hole in the boundary conformal field theory (BCFT) model under the fully time-dependent AdS-Vaidya spacetime geometry. We introduce the time-dependent finite bath termed the effective Hawking radiation region. This is described by a nontrivial BCFT solution that acts as a time-dependent brane which we call the moving end-of-the-radiation (METR) brane that leads to a n… ▽ More We study an evaporating black hole in the boundary conformal field theory (BCFT) model under the fully time-dependent AdS-Vaidya spacetime geometry. We introduce the time-dependent finite bath termed the effective Hawking radiation region. This is described by a nontrivial BCFT solution that acts as a time-dependent brane which we call the moving end-of-the-radiation (METR) brane that leads to a new type of Hubeny-Rangamani-Takayanagi surface. We further examine the island formulation in this particular time-dependent spacetime. The Page curve is calculated by using Holographic Entanglement Entropy (HEE) in the context of double holography. △ Less

Submitted 2 June, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: v2: 49 pages. Typos corrected and appendices were added; paper restructured and expanded discussion for more clarity. v3: Published version

Journal ref: JHEP05(2024)342

arXiv:2306.11366 [pdf, other]

Demonstration of Machine Learning-assisted real-time noise regression in gravitational wave detectors

Authors: Muhammed Saleem, Alec Gunny, Chia-Jui Chou, Li-Cheng Yang, Shu-Wei Yeh, Andy H. Y. Chen, Ryan Magee, William Benoit, Tri Nguyen, Pinchen Fan, Deep Chatterjee, Ethan Marx, Eric Moreno, Rafia Omer, Ryan Raikman, Dylan Rankin, Ritwik Sharma, Michael Coughlin, Philip Harris, Erik Katsavounidis

Abstract: Real-time noise regression algorithms are crucial for maximizing the science outcomes of the LIGO, Virgo, and KAGRA gravitational-wave detectors. This includes improvements in the detectability, source localization and pre-merger detectability of signals thereby enabling rapid multi-messenger follow-up. In this paper, we demonstrate the effectiveness of \textit{DeepClean}, a convolutional neural n… ▽ More Real-time noise regression algorithms are crucial for maximizing the science outcomes of the LIGO, Virgo, and KAGRA gravitational-wave detectors. This includes improvements in the detectability, source localization and pre-merger detectability of signals thereby enabling rapid multi-messenger follow-up. In this paper, we demonstrate the effectiveness of \textit{DeepClean}, a convolutional neural network architecture that uses witness sensors to estimate and subtract non-linear and non-stationary noise from gravitational-wave strain data. Our study uses LIGO data from the third observing run with injected compact binary signals. As a demonstration, we use \textit{DeepClean} to subtract the noise at 60 Hzへるつ due to the power mains and their sidebands arising from non-linear coupling with other instrumental noise sources. Our parameter estimation study on the injected signals shows that \textit{DeepClean} does not do any harm to the underlying astrophysical signals in the data while it can enhances the signal-to-noise ratio of potential signals. We show that \textit{DeepClean} can be used for low-latency noise regression to produce cleaned output data at latencies $\sim 1-2$\, s. We also discuss various considerations that may be made while training \textit{DeepClean} for low latency applications. △ Less

Submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.09198 [pdf, other]

doi 10.1016/j.physrep.2024.03.002

A Review on Quantum Approximate Optimization Algorithm and its Variants

Authors: Kostas Blekos, Dean Brand, Andrea Ceschini, Chiao-Hui Chou, Rui-Hao Li, Komal Pandya, Alessandro Summer

Abstract: The Quantum Approximate Optimization Algorithm (QAOA) is a highly promising variational quantum algorithm that aims to solve combinatorial optimization problems that are classically intractable. This comprehensive review offers an overview of the current state of QAOA, encompassing its performance analysis in diverse scenarios, its applicability across various problem instances, and considerations… ▽ More The Quantum Approximate Optimization Algorithm (QAOA) is a highly promising variational quantum algorithm that aims to solve combinatorial optimization problems that are classically intractable. This comprehensive review offers an overview of the current state of QAOA, encompassing its performance analysis in diverse scenarios, its applicability across various problem instances, and considerations of hardware-specific challenges such as error susceptibility and noise resilience. Additionally, we conduct a comparative study of selected QAOA extensions and variants, while exploring future prospects and directions for the algorithm. We aim to provide insights into key questions about the algorithm, such as whether it can outperform classical algorithms and under what circumstances it should be used. Towards this goal, we offer specific practical points in a form of a short guide. Keywords: Quantum Approximate Optimization Algorithm (QAOA), Variational Quantum Algorithms (VQAs), Quantum Optimization, Combinatorial Optimization Problems, NISQ Algorithms △ Less

Submitted 26 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: 67 pages, 9 figures, 9 tables; version 2 -- added more discussions and practical guides

arXiv:2306.08106 [pdf, other]

Applications of Deep Learning to physics workflows

Authors: Manan Agarwal, Jay Alameda, Jeroen Audenaert, Will Benoit, Damon Beveridge, Meghna Bhattacharya, Chayan Chatterjee, Deep Chatterjee, Andy Chen, Muhammed Saleem Cholayil, Chia-Jui Chou, Sunil Choudhary, Michael Coughlin, Maximilian Dax, Aman Desai, Andrea Di Luca, Javier Mauricio Duarte, Steven Farrell, Yongbin Feng, Pooyan Goodarzi, Ekaterina Govorkova, Matthew Graham, Jonathan Guiang, Alec Gunny, Weichangfeng Guo , et al. (43 additional authors not shown)

Abstract: Modern large-scale physics experiments create datasets with sizes and streaming rates that can exceed those from industry leaders such as Google Cloud and Netflix. Fully processing these datasets requires both sufficient compute power and efficient workflows. Recent advances in Machine Learning (ML) and Artificial Intelligence (AI) can either improve or replace existing domain-specific algorithms… ▽ More Modern large-scale physics experiments create datasets with sizes and streaming rates that can exceed those from industry leaders such as Google Cloud and Netflix. Fully processing these datasets requires both sufficient compute power and efficient workflows. Recent advances in Machine Learning (ML) and Artificial Intelligence (AI) can either improve or replace existing domain-specific algorithms to increase workflow efficiency. Not only can these algorithms improve the physics performance of current algorithms, but they can often be executed more quickly, especially when run on coprocessors such as GPUs or FPGAs. In the winter of 2023, MIT hosted the Accelerating Physics with ML at MIT workshop, which brought together researchers from gravitational-wave physics, multi-messenger astrophysics, and particle physics to discuss and share current efforts to integrate ML tools into their workflows. The following white paper highlights examples of algorithms and computing frameworks discussed during this workshop and summarizes the expected computing needs for the immediate future of the involved fields. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: Whitepaper resulting from Accelerating Physics with ML@MIT workshop in Jan/Feb 2023

arXiv:2306.04879 [pdf, other]

Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization

Authors: Clemens JS Schaefer, Navid Lambert-Shirzad, Xiaofan Zhang, Chiachen Chou, Tom Jablin, Jian Li, Elfie Guo, Caitlin Stanton, Siddharth Joshi, Yu Emma Wang

Abstract: Efficiently serving neural network models with low latency is becoming more challenging due to increasing model complexity and parameter count. Model quantization offers a solution which simultaneously reduces memory footprint and compute requirements. However, aggressive quantization may lead to an unacceptable loss in model accuracy owing to differences in sensitivity to numerical imperfection a… ▽ More Efficiently serving neural network models with low latency is becoming more challenging due to increasing model complexity and parameter count. Model quantization offers a solution which simultaneously reduces memory footprint and compute requirements. However, aggressive quantization may lead to an unacceptable loss in model accuracy owing to differences in sensitivity to numerical imperfection across different layers in the model. To address this challenge, we propose a mixed-precision post training quantization (PTQ) approach that assigns different numerical precisions to tensors in a network based on their specific needs, for a reduced memory footprint and improved latency while preserving model accuracy. Previous works rely on layer-wise Hessian information to determine numerical precision, but as we demonstrate, Hessian estimation is typically insufficient in determining an effective ordering of layer sensitivities. We address this by augmenting the estimated Hessian with additional information to capture inter-layer dependencies. We demonstrate that this consistently improves PTQ performance along the accuracy-latency Pareto frontier across multiple models. Our method combines second-order information and inter-layer dependencies to guide a bisection search, finding quantization configurations within a user-configurable model accuracy degradation range. We evaluate the effectiveness of our method on the ResNet50, MobileNetV2, and BERT models. Our experiments demonstrate latency reductions compared to a 16-bit baseline of $25.48\%$, $21.69\%$, and $33.28\%$ respectively, while maintaining model accuracy to within $99.99\%$ of the baseline model. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2304.13878 [pdf, other]

doi 10.1126/science.adh9932

Stable Quantum-Correlated Many Body States through Engineered Dissipation

Authors: X. Mi, A. A. Michailidis, S. Shabani, K. C. Miao, P. V. Klimov, J. Lloyd, E. Rosenberg, R. Acharya, I. Aleiner, T. I. Andersen, M. Ansmann, F. Arute, K. Arya, A. Asfaw, J. Atalaya, J. C. Bardin, A. Bengtsson, G. Bortoli, A. Bourassa, J. Bovaird, L. Brill, M. Broughton, B. B. Buckley, D. A. Buell, T. Burger , et al. (142 additional authors not shown)

Abstract: Engineered dissipative reservoirs have the potential to steer many-body quantum systems toward correlated steady states useful for quantum simulation of high-temperature superconductivity or quantum magnetism. Using up to 49 superconducting qubits, we prepared low-energy states of the transverse-field Ising model through coupling to dissipative auxiliary qubits. In one dimension, we observed long-… ▽ More Engineered dissipative reservoirs have the potential to steer many-body quantum systems toward correlated steady states useful for quantum simulation of high-temperature superconductivity or quantum magnetism. Using up to 49 superconducting qubits, we prepared low-energy states of the transverse-field Ising model through coupling to dissipative auxiliary qubits. In one dimension, we observed long-range quantum correlations and a ground-state fidelity of 0.86 for 18 qubits at the critical point. In two dimensions, we found mutual information that extends beyond nearest neighbors. Lastly, by coupling the system to auxiliaries emulating reservoirs with different chemical potentials, we explored transport in the quantum Heisenberg model. Our results establish engineered dissipation as a scalable alternative to unitary evolution for preparing entangled many-body states on noisy quantum processors. △ Less

Submitted 5 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

Journal ref: Science 383, 1332-1337 (2024)

arXiv:2304.11119 [pdf, other]

Phase transition in Random Circuit Sampling

Authors: A. Morvan, B. Villalonga, X. Mi, S. Mandrà, A. Bengtsson, P. V. Klimov, Z. Chen, S. Hong, C. Erickson, I. K. Drozdov, J. Chau, G. Laun, R. Movassagh, A. Asfaw, L. T. A. N. Brandão, R. Peralta, D. Abanin, R. Acharya, R. Allen, T. I. Andersen, K. Anderson, M. Ansmann, F. Arute, K. Arya, J. Atalaya , et al. (160 additional authors not shown)

Abstract: Undesired coupling to the surrounding environment destroys long-range correlations on quantum processors and hinders the coherent evolution in the nominally available computational space. This incoherent noise is an outstanding challenge to fully leverage the computation power of near-term quantum processors. It has been shown that benchmarking Random Circuit Sampling (RCS) with Cross-Entropy Benc… ▽ More Undesired coupling to the surrounding environment destroys long-range correlations on quantum processors and hinders the coherent evolution in the nominally available computational space. This incoherent noise is an outstanding challenge to fully leverage the computation power of near-term quantum processors. It has been shown that benchmarking Random Circuit Sampling (RCS) with Cross-Entropy Benchmarking (XEB) can provide a reliable estimate of the effective size of the Hilbert space coherently available. The extent to which the presence of noise can trivialize the outputs of a given quantum algorithm, i.e. making it spoofable by a classical computation, is an unanswered question. Here, by implementing an RCS algorithm we demonstrate experimentally that there are two phase transitions observable with XEB, which we explain theoretically with a statistical model. The first is a dynamical transition as a function of the number of cycles and is the continuation of the anti-concentration point in the noiseless case. The second is a quantum phase transition controlled by the error per cycle; to identify it analytically and experimentally, we create a weak link model which allows varying the strength of noise versus coherent evolution. Furthermore, by presenting an RCS experiment with 67 qubits at 32 cycles, we demonstrate that the computational cost of our experiment is beyond the capabilities of existing classical supercomputers, even when accounting for the inevitable presence of noise. Our experimental and theoretical work establishes the existence of transitions to a stable computationally complex phase that is reachable with current quantum processors. △ Less

Submitted 21 December, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

arXiv:2302.01382 [pdf, other]

Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search

Authors: Clemens JS Schaefer, Elfie Guo, Caitlin Stanton, Xiaofan Zhang, Tom Jablin, Navid Lambert-Shirzad, Jian Li, Chiachen Chou, Siddharth Joshi, Yu Emma Wang

Abstract: Serving large-scale machine learning (ML) models efficiently and with low latency has become challenging owing to increasing model size and complexity. Quantizing models can simultaneously reduce memory and compute requirements, facilitating their widespread access. However, for large models not all layers are equally amenable to the same numerical precision and aggressive quantization can lead to… ▽ More Serving large-scale machine learning (ML) models efficiently and with low latency has become challenging owing to increasing model size and complexity. Quantizing models can simultaneously reduce memory and compute requirements, facilitating their widespread access. However, for large models not all layers are equally amenable to the same numerical precision and aggressive quantization can lead to unacceptable loss in model accuracy. One approach to prevent this accuracy degradation is mixed-precision quantization, which allows different tensors to be quantized to varying levels of numerical precision, leveraging the capabilities of modern hardware. Such mixed-precision quantiztaion can more effectively allocate numerical precision to different tensors `as needed' to preserve model accuracy while reducing footprint and compute latency. In this paper, we propose a method to efficiently determine quantization configurations of different tensors in ML models using post-training mixed precision quantization. We analyze three sensitivity metrics and evaluate them for guiding configuration search of two algorithms. We evaluate our method for computer vision and natural language processing and demonstrate latency reductions of up to 27.59% and 34.31% compared to the baseline 16-bit floating point model while guaranteeing no more than 1% accuracy degradation. △ Less

Submitted 6 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

arXiv:2212.13697 [pdf, other]

Network Characteristics of LEO Satellite Constellations: A Starlink-Based Measurement from End Users

Authors: Sami Ma, Yi Ching Chou, Haoyuan Zhao, Long Chen, Xiaoqiang Ma, Jiangchuan Liu

Abstract: Low Earth orbit Satellite Networks (LSNs) have been advocated as a key infrastructure for truly global coverage in the forthcoming 6G. This paper presents our initial measurement results and observations on the end-to-end network characteristics of Starlink, arguably the largest LSN constellation to date. Our findings confirm that LSNs are a promising solution towards ubiquitous Internet coverage… ▽ More Low Earth orbit Satellite Networks (LSNs) have been advocated as a key infrastructure for truly global coverage in the forthcoming 6G. This paper presents our initial measurement results and observations on the end-to-end network characteristics of Starlink, arguably the largest LSN constellation to date. Our findings confirm that LSNs are a promising solution towards ubiquitous Internet coverage over the Earth; yet, we also find that the users of Starlink experience much more dynamics in throughput and latency than terrestrial network users, and even frequent outages. Its user experiences are heavily affected by environmental factors such as terrain, solar storms, rain, clouds, and temperature, so is the power consumption. We further analyze Starlink's current bent-pipe relay strategy and its limits, particularly for cross-ocean routes. We have also explored its mobility and portability potentials, and extended our experiments from urban cities to wild remote areas that are facing distinct practical and cultural challenges. △ Less

Submitted 27 December, 2022; originally announced December 2022.

Comments: 12 pages, 20 figures, to be published in IEEE INFOCOM 2023

arXiv:2210.10255 [pdf, other]

Non-Abelian braiding of graph vertices in a superconducting processor

Authors: Trond I. Andersen, Yuri D. Lensky, Kostyantyn Kechedzhi, Ilya Drozdov, Andreas Bengtsson, Sabrina Hong, Alexis Morvan, Xiao Mi, Alex Opremcak, Rajeev Acharya, Richard Allen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Juan Atalaya, Ryan Babbush, Dave Bacon, Joseph C. Bardin, Gina Bortoli, Alexandre Bourassa, Jenna Bovaird, Leon Brill, Michael Broughton, Bob B. Buckley , et al. (144 additional authors not shown)

Abstract: Indistinguishability of particles is a fundamental principle of quantum mechanics. For all elementary and quasiparticles observed to date - including fermions, bosons, and Abelian anyons - this principle guarantees that the braiding of identical particles leaves the system unchanged. However, in two spatial dimensions, an intriguing possibility exists: braiding of non-Abelian anyons causes rotatio… ▽ More Indistinguishability of particles is a fundamental principle of quantum mechanics. For all elementary and quasiparticles observed to date - including fermions, bosons, and Abelian anyons - this principle guarantees that the braiding of identical particles leaves the system unchanged. However, in two spatial dimensions, an intriguing possibility exists: braiding of non-Abelian anyons causes rotations in a space of topologically degenerate wavefunctions. Hence, it can change the observables of the system without violating the principle of indistinguishability. Despite the well developed mathematical description of non-Abelian anyons and numerous theoretical proposals, the experimental observation of their exchange statistics has remained elusive for decades. Controllable many-body quantum states generated on quantum processors offer another path for exploring these fundamental phenomena. While efforts on conventional solid-state platforms typically involve Hamiltonian dynamics of quasi-particles, superconducting quantum processors allow for directly manipulating the many-body wavefunction via unitary gates. Building on predictions that stabilizer codes can host projective non-Abelian Ising anyons, we implement a generalized stabilizer code and unitary protocol to create and braid them. This allows us to experimentally verify the fusion rules of the anyons and braid them to realize their statistics. We then study the prospect of employing the anyons for quantum computation and utilize braiding to create an entangled state of anyons encoding three logical qubits. Our work provides new insights about non-Abelian braiding and - through the future inclusion of error correction to achieve topological protection - could open a path toward fault-tolerant quantum computing. △ Less

Submitted 31 May, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

arXiv:2209.08763 [pdf]

Decentralized Vehicle Coordination: The Berkeley DeepDrive Drone Dataset

Authors: Fangyu Wu, Dequan Wang, Minjune Hwang, Chenhui Hao, Jiawei Lu, Jiamu Zhang, Christopher Chou, Trevor Darrell, Alexandre Bayen

Abstract: Decentralized multiagent planning has been an important field of research in robotics. An interesting and impactful application in the field is decentralized vehicle coordination in understructured road environments. For example, in an intersection, it is useful yet difficult to deconflict multiple vehicles of intersecting paths in absence of a central coordinator. We learn from common sense that,… ▽ More Decentralized multiagent planning has been an important field of research in robotics. An interesting and impactful application in the field is decentralized vehicle coordination in understructured road environments. For example, in an intersection, it is useful yet difficult to deconflict multiple vehicles of intersecting paths in absence of a central coordinator. We learn from common sense that, for a vehicle to navigate through such understructured environments, the driver must understand and conform to the implicit "social etiquette" observed by nearby drivers. To study this implicit driving protocol, we collect the Berkeley DeepDrive Drone dataset. The dataset contains 1) a set of aerial videos recording understructured driving, 2) a collection of images and annotations to train vehicle detection models, and 3) a kit of development scripts for illustrating typical usages. We believe that the dataset is of primary interest for studying decentralized multiagent planning employed by human drivers and, of secondary interest, for computer vision in remote sensing settings. △ Less

Submitted 22 September, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

Comments: 6 pages, 10 figures, 1 table

arXiv:2209.06791 [pdf, other]

doi 10.1007/s00170-022-10789-w

Vibration Compensation of Delta 3D Printer with Position-varying Dynamics using Filtered B-Splines

Authors: Nosakhare Edoimioya, Cheng-Hao Chou, Chinedum E. Okwudire

Abstract: The delta robot is becoming a popular choice for the mechanical design of fused filament fabrication 3D printers because it can reach higher speeds than traditional serial-axis designs. Like serial 3D printers, delta printers suffer from undesirable vibration at high speeds which degrades the quality of fabricated parts. This undesirable vibration has been suppressed in serial printers using linea… ▽ More The delta robot is becoming a popular choice for the mechanical design of fused filament fabrication 3D printers because it can reach higher speeds than traditional serial-axis designs. Like serial 3D printers, delta printers suffer from undesirable vibration at high speeds which degrades the quality of fabricated parts. This undesirable vibration has been suppressed in serial printers using linear model-inversion feedforward control methods like the filtered B-splines (FBS) approach. However, techniques like the FBS approach are computationally challenging to implement on delta 3D printers because of their coupled, position-dependent dynamics. In this paper, we propose a methodology to address the computational bottlenecks by (1) parameterizing the position-dependent portions of the dynamics offline to enable efficient online model generation, (2) computing real-time models at sampled points (instead of every point) along the given trajectory, and (3) employing QR factorization to reduce the number of floating-point arithmetic operations associated with matrix inversion. In simulations, we report a computation time reduction of up to 23x using the proposed method, while maintaining high accuracy, when compared to a controller using the computationally expensive exact LPV model. In experiments, we demonstrate significant quality improvements on parts printed at various positions on the delta 3D printer using our proposed controller compared to a baseline alternative, which uses an LTI model from one position. Through acceleration measurements during printing, we also show that the print quality boost of the proposed controller is due to vibration reductions of more than 20\% when compared to the baseline controller. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: 12 pages, 10 figures, submitted to Int. Journal of Advanced Manufacturing Technology

arXiv:2207.10215 [pdf, other]

doi 10.1103/PhysRevLett.130.223201

Rotational spectroscopy of a single molecular ion at sub part-per-trillion resolution

Authors: Alejandra L. Collopy, Julian Schmidt, Dietrich Leibfried, David R. Leibrandt, Chin-Wen Chou

Abstract: We use quantum-logic spectroscopy (QLS) and interrogate rotational transitions of a single CaH+ ion with a highly coherent frequency comb, achieving a fractional statistical uncertainty for a transition line center of 4 x 10^-13. We also improve the resolution in measurement of the Stark effect due to the radio-frequency (rf) electric field experienced by a molecular ion in an rf Paul trap, which… ▽ More We use quantum-logic spectroscopy (QLS) and interrogate rotational transitions of a single CaH+ ion with a highly coherent frequency comb, achieving a fractional statistical uncertainty for a transition line center of 4 x 10^-13. We also improve the resolution in measurement of the Stark effect due to the radio-frequency (rf) electric field experienced by a molecular ion in an rf Paul trap, which we characterize and model. This allows us to determine the electric dipole moment of CaH+ by systematically displacing the ion to sample different known rf electric fields and measuring the resultant shifts in transition frequency. △ Less

Submitted 20 July, 2022; originally announced July 2022.

Comments: 5 pages, 3 figures, plus supplemental material

Journal ref: Phys. Rev. Lett. 130, 223201 (2023)

arXiv:2207.02738 [pdf, other]

A Hybrid Approach for Binary Classification of Imbalanced Data

Authors: Hsin-Han Tsai, Ta-Wei Yang, Wai-Man Wong, Cheng-Fu Chou

Abstract: Binary classification with an imbalanced dataset is challenging. Models tend to consider all samples as belonging to the majority class. Although existing solutions such as sampling methods, cost-sensitive methods, and ensemble learning methods improve the poor accuracy of the minority class, these methods are limited by overfitting problems or cost parameters that are difficult to decide. We prop… ▽ More Binary classification with an imbalanced dataset is challenging. Models tend to consider all samples as belonging to the majority class. Although existing solutions such as sampling methods, cost-sensitive methods, and ensemble learning methods improve the poor accuracy of the minority class, these methods are limited by overfitting problems or cost parameters that are difficult to decide. We propose HADR, a hybrid approach with dimension reduction that consists of data block construction, dimentionality reduction, and ensemble learning with deep neural network classifiers. We evaluate the performance on eight imbalanced public datasets in terms of recall, G-mean, and AUC. The results show that our model outperforms state-of-the-art methods. △ Less

Submitted 7 July, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

arXiv:2206.11960 [pdf, other]

A physics-guided data-driven feedforward tracking controller for systems with unmodeled dynamics -- applied to 3D printing

Authors: Cheng-Hao Chou, Molong Duan, Chinedum E. Okwudire

Abstract: A hybrid (i.e., physics-guided data-driven) feedforward tracking controller is proposed for systems with unmodeled linear or nonlinear dynamics. The controller is based on the filtered basis function (FBF) approach, hence it is called a hybrid FBF controller. It formulates the feedforward control input to a system as a linear combination of a set of basis functions whose coefficients are selected… ▽ More A hybrid (i.e., physics-guided data-driven) feedforward tracking controller is proposed for systems with unmodeled linear or nonlinear dynamics. The controller is based on the filtered basis function (FBF) approach, hence it is called a hybrid FBF controller. It formulates the feedforward control input to a system as a linear combination of a set of basis functions whose coefficients are selected to minimize tracking errors. The basis functions are filtered using a combination of two linear models to predict and minimize the tracking errors. The first model is physics-based and remains unaltered during the execution of the controller, while the second is data-driven and is continuously updated during the execution of the controller. To ensure its practicality and safe learning, the proposed hybrid FBF controller is equipped with the ability to handle delays in data acquisition and to detect impending instability due to its inherent data-driven feedback loop. Its effectiveness is demonstrated via application to vibration compensation of a 3D printer with unmodeled linear and nonlinear dynamics. Thanks to the proposed hybrid FBF controller, the tracking accuracy of the 3D printer is significantly improved in experiments involving high-speed printing, compared to a standard FBF controller that does not incorporate a data-driven model. Furthermore, the ability of the hybrid FBF controller to detect and, hence, potentially avoid impending instability is demonstrated offline using data collected online from experiments. △ Less

Submitted 23 June, 2022; originally announced June 2022.

Comments: 10 pages, 11 figures, submitted to the IEEE for possible journal publication

Showing 1–50 of 416 results for author: Chou, C