Search | arXiv e-print repository

arXiv:2409.08140 [pdf, other]

Loading-dependent microscale measures control bulk properties in granular material: an experimental test of the Stress-Force-Fabric relation

Authors: Carmen L. Lee, Ephraim Bililign, Emilien Azéma, Karen E. Daniels

Abstract: The bulk behaviour of granular materials is tied to its mesoscale and particle-scale features: strength properties arise from the buildup of various anisotropic structures at the particle-scale induced by grain connectivity (fabric), force transmission, and frictional mobilization. More fundamentally, these anisotropic structures work collectively to define features like the bulk friction coeffici… ▽ More The bulk behaviour of granular materials is tied to its mesoscale and particle-scale features: strength properties arise from the buildup of various anisotropic structures at the particle-scale induced by grain connectivity (fabric), force transmission, and frictional mobilization. More fundamentally, these anisotropic structures work collectively to define features like the bulk friction coefficient and the stress tensor at the macroscale and can be explained by the Stress-Force-Fabric (SFF) relationship stemming from the microscale. Although the SFF relation has been extensively verified by discrete numerical simulations, a laboratory realization has remained elusive due to the challenge of measuring both normal and frictional contact forces. In this study, we analyze experiments performed on a photoelastic granular system under four different loading conditions: uniaxial compression, isotropic compression, pure shear, and annular shear. During these experiments, we record particle locations, contacts, and normal and frictional forces vectors to measure the particle-scale response to progressing strain. We track microscale measures like the packing fraction, average coordination number and average normal force along with anisotropic distributions of contacts and forces. We match the particle-scale anisotropy to the bulk using the SFF relation, which is founded on two key principles, a Stress Rule to describe the stress tensor and a Sum Rule to describe the bulk friction coefficient; we find that the Sum and Stress Rules accurately describe bulk measurements. Additionally, we test the assumption that fabric and forces transmit load equally through our granular packings and show that this assumption is sufficient at large strain values, and can be applied to areas like rock mechanics, soft colloids, or cellular tissue where force information is inaccessible. △ Less

Submitted 12 September, 2024; originally announced September 2024.

Comments: 10 pages, 5 figures, 5 pages supplemental with 6 figures

arXiv:2409.07945 [pdf, other]

Exploring Accessibility Trends and Challenges in Mobile App Development: A Study of Stack Overflow Questions

Authors: Amila Indika, Christopher Lee, Haochen Wang, Justin Lisoway, Anthony Peruma, Rick Kazman

Abstract: The proliferation of mobile applications (apps) has made it crucial to ensure their accessibility for users with disabilities. However, there is a lack of research on the real-world challenges developers face in implementing mobile accessibility features. This study presents a large-scale empirical analysis of accessibility discussions on Stack Overflow to identify the trends and challenges Androi… ▽ More The proliferation of mobile applications (apps) has made it crucial to ensure their accessibility for users with disabilities. However, there is a lack of research on the real-world challenges developers face in implementing mobile accessibility features. This study presents a large-scale empirical analysis of accessibility discussions on Stack Overflow to identify the trends and challenges Android and iOS developers face. We examine the growth patterns, characteristics, and common topics mobile developers discuss. Our results show several challenges, including integrating assistive technologies like screen readers, ensuring accessible UI design, supporting text-to-speech across languages, handling complex gestures, and conducting accessibility testing. We envision our findings driving improvements in developer practices, research directions, tool support, and educational resources. △ Less

Submitted 12 September, 2024; originally announced September 2024.

Comments: This paper was accepted for publication at the 58th Hawaii International Conference on System Sciences (HICSS) - Software Technology Track

arXiv:2409.07598 [pdf]

High Performance Three-Terminal Thyristor RAM with a P+/P/N/P/N/N+ Doping Profile on a Silicon-Photonic CMOS Platform

Authors: Changseob Lee, Ikhyeon Kwon, Anirban Samanta, Siwei Li, S. J. Ben Yoo

Abstract: 3T TRAM with doping profile (P+PNPNN+) is experimentally demonstrated on a silicon photonic platform. By using additional implant layers, this device provides excellent memory performance compared to the conventional structure (PNPN). TCAD is used to reflect the physical behavior, and the high-speed memory operations are described through the model. 3T TRAM with doping profile (P+PNPNN+) is experimentally demonstrated on a silicon photonic platform. By using additional implant layers, this device provides excellent memory performance compared to the conventional structure (PNPN). TCAD is used to reflect the physical behavior, and the high-speed memory operations are described through the model. △ Less

Submitted 11 September, 2024; originally announced September 2024.

Comments: 4 pages, 15 figures

arXiv:2409.07467 [pdf, other]

Flexible Control in Symbolic Music Generation via Musical Metadata

Authors: Sangjun Han, Jiwon Ham, Chaeeun Lee, Heejin Kim, Soojong Do, Sihyuk Yi, Jun Seo, Seoyoon Kim, Yountae Jung, Woohyung Lim

Abstract: In this work, we introduce the demonstration of symbolic music generation, focusing on providing short musical motifs that serve as the central theme of the narrative. For the generation, we adopt an autoregressive model which takes musical metadata as inputs and generates 4 bars of multitrack MIDI sequences. During training, we randomly drop tokens from the musical metadata to guarantee flexible… ▽ More In this work, we introduce the demonstration of symbolic music generation, focusing on providing short musical motifs that serve as the central theme of the narrative. For the generation, we adopt an autoregressive model which takes musical metadata as inputs and generates 4 bars of multitrack MIDI sequences. During training, we randomly drop tokens from the musical metadata to guarantee flexible control. It provides users with the freedom to select input types while maintaining generative performance, enabling greater flexibility in music composition. We validate the effectiveness of the strategy through experiments in terms of model capacity, musical fidelity, diversity, and controllability. Additionally, we scale up the model and compare it with other music generation model through a subjective test. Our results indicate its superiority in both control and music quality. We provide a URL link https://www.youtube.com/watch?v=-0drPrFJdMQ to our demonstration video. △ Less

Submitted 28 August, 2024; originally announced September 2024.

arXiv:2409.07113 [pdf, other]

A Post-Starburst Pathway to Forming Massive Galaxies and Their Black Holes at z>6

Authors: Masafusa Onoue, Xuheng Ding, John D. Silverman, Yoshiki Matsuoka, Takuma Izumi, Michael A. Strauss, Charlotte Ward, Camryn L. Phillips, Irham T. Andika, Kentaro Aoki, Junya Arita, Shunsuke Baba, Rebekka Bieri, Sarah E. I. Bosman, Anna-Christina Eilers, Seiji Fujimoto, Melanie Habouzit, Zoltan Haiman, Masatoshi Imanishi, Kohei Inayoshi, Kei Ito, Kazushi Iwasawa, Knud Jahnke, Nobunari Kashikawa, Toshihiro Kawaguchi , et al. (23 additional authors not shown)

Abstract: Understanding the rapid formation of supermassive black holes (SMBHs) in the early universe requires an understanding of how stellar mass grows in the host galaxies. Here, we perform an analysis of rest-frame optical spectra and imaging from JWST of two quasar host galaxies at z>6 which exhibit Balmer absorption lines. These features in the stellar continuum indicate a lack of young stars, similar… ▽ More Understanding the rapid formation of supermassive black holes (SMBHs) in the early universe requires an understanding of how stellar mass grows in the host galaxies. Here, we perform an analysis of rest-frame optical spectra and imaging from JWST of two quasar host galaxies at z>6 which exhibit Balmer absorption lines. These features in the stellar continuum indicate a lack of young stars, similar to low-redshift post-starburst galaxies whose star formation was recently quenched. We find that the stellar mass (log(M_* / M_sun) > 10.6) of each quasar host grew in a starburst episode at redshift 7 or 8. One of the targets exhibits little ongoing star formation, as evidenced by the photometric signature of the Balmer break and a lack of spatially resolved H-alpha emission, placing it well below the star formation main sequence at z = 6. The other galaxy is transitioning to a quiescent phase; together, the two galaxies represent the most distant massive post-starburst galaxies known. The maturity of these two galaxies is further supported by the stellar velocity dispersions of their host galaxies, placing them slightly above the upper end of the local M_BH - sigma_* relation. The properties of our two post-starburst galaxies, each hosting an active SMBH with log(M_BH / M_sun) > 9, suggests that black holes played a major role in shaping the formation of the first massive galaxies in the Universe. △ Less

Submitted 11 September, 2024; originally announced September 2024.

Comments: 24 pages, 7 figures, submitted to a Nature journal

arXiv:2409.06932 [pdf, other]

Boosting uniformity in quasirandom groups: fast and simple

Authors: Harm Derksen, Chin Ho Lee, Emanuele Viola

Abstract: We study the communication complexity of multiplying $k\times t$ elements from the group $H=\text{SL}(2,q)$ in the number-on-forehead model with $k$ parties. We prove a lower bound of $(t\log H)/c^{k}$. This is an exponential improvement over previous work, and matches the state-of-the-art in the area. Relatedly, we show that the convolution of $k^{c}$ independent copies of a 3-uniform distribut… ▽ More We study the communication complexity of multiplying $k\times t$ elements from the group $H=\text{SL}(2,q)$ in the number-on-forehead model with $k$ parties. We prove a lower bound of $(t\log H)/c^{k}$. This is an exponential improvement over previous work, and matches the state-of-the-art in the area. Relatedly, we show that the convolution of $k^{c}$ independent copies of a 3-uniform distribution over $H^{m}$ is close to a $k$-uniform distribution. This is again an exponential improvement over previous work which needed $c^{k}$ copies. The proofs are remarkably simple; the results extend to other quasirandom groups. We also show that for any group $H$, any distribution over $H^{m}$ whose weight-$k$ Fourier coefficients are small is close to a $k$-uniform distribution. This generalizes previous work in the abelian setting, and the proof is simpler. △ Less

Submitted 10 September, 2024; originally announced September 2024.

arXiv:2409.06038 [pdf, ps, other]

High-Speed Outflows and Dusty Disks during the AGB to PN Transition: The PANORAMA survey

Authors: Raghvendra Sahai, Javier Alcolea, Bruce Balick, Eric G. Blackman, Valentin Bujarrabal, Arancha Castro-Carrizo, Orsola De Marco, Joel Kastner, Hyosun Kim, Eric Lagadec, Chin-Fei Lee, Laurence Sabin, M. Santander-Garcia, Carmen Sánchez Contreras, Daniel Tafoya, Toshiya Ueta, Wouter Vlemmings, Albert Zijlstra

Abstract: As mass-losing asymptotic giant branch (AGB) stars evolve to planetary nebulae (PNe), the mass outflow geometries transform from nearly spherical to extreme aspherical. The physical mechanisms governing this transformation are widely believed to be linked to binarity and the associated production of disks and fast jets during transitional (post-AGB) evolutionary stages. We are carrying out a syste… ▽ More As mass-losing asymptotic giant branch (AGB) stars evolve to planetary nebulae (PNe), the mass outflow geometries transform from nearly spherical to extreme aspherical. The physical mechanisms governing this transformation are widely believed to be linked to binarity and the associated production of disks and fast jets during transitional (post-AGB) evolutionary stages. We are carrying out a systematic ALMA survey ($P$re-planet$A$ry $N$ebulae high-angular-res$O$lution su$R$vey with $A$L$MA$ or PANORAMA) of a representative sample of bipolar and multipolar post-AGB objects. We have obtained high angular-resolution (0".1-0".4) observations of the CO(3--2) and/or 6--5 emission in order to probe the spatio-kinematic structure of the collimated outflows and the central disk/torii. The results are remarkable, generally showing the presence of bipolar or multipolar high-velocity outflows, dense toroidal waists, and in one case, a geometrically-thin circular ring around the central bipolar nebula. A high degree of point-symmetry characterizes the morphology of the mass ejecta. In this contribution, we present these and other highlights from our survey. We aim to use 2D/3D radiative transfer modeling in order to derive accurate outflow momenta, masses and mass-loss rates for our sample, and build hydrodynamical models that can explain the observed spatio-kinematic structures. These results will then be used to distinguish between different classes of PN-shaping binary interaction models. △ Less

Submitted 9 September, 2024; originally announced September 2024.

arXiv:2409.05492 [pdf, other]

JCMT 850 $\micron$ continuum observations of density structures in the G35 molecular complex

Authors: Xianjin Shen, Hong-Li Liu, Zhiyuan Ren, Anandmayee Tej, Di Li, Hauyu Baobab Liu, Gary A. Fuller, Jinjin Xie, Sihan Jiao, Aiyuan Yang, Patrick M. Koch, Fengwei Xu, Patricio Sanhueza, Pham N. Diep, Nicolas Peretto, Ram K. Yadav, Busaba H. Kramer, Koichiro Sugiyama, Mark Rawlings, Chang Won Lee, Ken'ichi Tatematsu, Daniel Harsono, David Eden, Woojin Kwon, Chao-Wei Tsai , et al. (10 additional authors not shown)

Abstract: Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also com… ▽ More Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also compiled a catalogue of 350 dense clumps, 183 of which are associated with the filaments. We investigated the physical properties of the filaments and clumps, such as mass, density, and size, and their relation to star formation. We find that the global mass-length trend of the filaments is consistent with a turbulent origin, while the hub-composing filaments of high line masses ($m_{\rm l}\,>$\,230\,$\mathrm{M_{\odot}~pc^{-1}}$) in HFSs deviate from this relation, possibly due to feedback from massive star formation. We also find that the most massive and densest clumps (R\,$>$\,0.2\,pc, M\,$>35\,\mathrm{M_{\odot}}$, $\mathrmΣしぐま>\,0.05\,\mathrm{g~cm^{-2}}$) are located in the filaments and in the hubs of HFS with the latter bearing a higher probability of occurrence of high-mass star-forming signatures, highlighting the preferential sites of HFSs for high-mass star formation. We do not find significant variation in the clump mass surface density across different evolutionary environments of the clouds, which may reflect the balance between mass accretion and stellar feedback. △ Less

Submitted 9 September, 2024; originally announced September 2024.

Comments: 34 pages, 17 figures. Accepted for publication in ApJ

arXiv:2409.05196 [pdf]

AI-Driven Robotic Crystal Explorer for Rapid Polymorph Identification

Authors: Edward C Lee, Daniel Salley, Abhishek Sharma, Leroy Cronin

Abstract: Crystallisation is an important phenomenon which facilitates the purification as well as structural and bulk phase material characterisation using crystallographic methods. However, different conditions can lead to a vast set of different crystal structure polymorphs and these often exhibit different physical properties, allowing materials to be tailored to specific purposes. This means the high d… ▽ More Crystallisation is an important phenomenon which facilitates the purification as well as structural and bulk phase material characterisation using crystallographic methods. However, different conditions can lead to a vast set of different crystal structure polymorphs and these often exhibit different physical properties, allowing materials to be tailored to specific purposes. This means the high dimensionality that can result from variations in the conditions which affect crystallisation, and the interaction between them, means that exhaustive exploration is difficult, time-consuming, and costly to explore. Herein we present a robotic crystal search engine for the automated and efficient high-throughput approach to the exploration of crystallisation conditions. The system comprises a closed-loop computer crystal-vision system that uses machine learning to both identify crystals and classify their identity in a multiplexed robotic platform. By exploring the formation of a well-known polymorph, we were able to show how a robotic system could be used to efficiently search experimental space as a function of relative polymorph amount and efficiently create a high dimensionality phase diagram with minimal experimental budget and without expensive analytical techniques such as crystallography. In this way, we identify the set of polymorphs possible within a set of experimental conditions, as well as the optimal values of these conditions to grow each polymorph. △ Less

Submitted 8 September, 2024; originally announced September 2024.

Comments: 18 pages, 6 figures, 20 references

arXiv:2409.04564 [pdf, other]

Solar energetic particles injected inside and outside a magnetic cloud: The widespread solar energetic particle event on 2022 January 20

Authors: L. Rodríguez-García, R. Gómez-Herrero, N. Dresing, L. A. Balmaceda, E. Palmerio, A. Kouloumvakos, I. C. Jebaraj, F. Espinosa Lara, M. Roco, C. Palmroos, A. Warmuth, G. Nicolaou, G. M. Mason, J. Guo, T. Laitinen, I. Cernuda, T. Nieves-Chinchilla, A. Fedeli, C. O. Lee, C. M. S. Cohen, C. J. Owen, G. C. Ho, O. Malandraki, R. Vainio, J. Rodríguez-Pacheco

Abstract: Context. On 2022 January 20, the Energetic Particle Detector (EPD) on board Solar Orbiter measured a solar energetic particle (SEP) event showing unusual first arriving particles from the anti-Sun direction. Near-Earth spacecraft separated 17° in longitude to the west from Solar Orbiter measured classic antisunward-directed fluxes. STEREO-A and MAVEN, separated 18° to the east and 143° to the west… ▽ More Context. On 2022 January 20, the Energetic Particle Detector (EPD) on board Solar Orbiter measured a solar energetic particle (SEP) event showing unusual first arriving particles from the anti-Sun direction. Near-Earth spacecraft separated 17° in longitude to the west from Solar Orbiter measured classic antisunward-directed fluxes. STEREO-A and MAVEN, separated 18° to the east and 143° to the west from Solar Orbiter respectively, also observed the event, suggesting that particles spread over at least 160° in the heliosphere. Results. Solar Orbiter was embedded in a MC erupting on 16 January from the same active region as the one related to the SEP event on 20 January. The SEP event is related to a M5.5 flare and a fast CME-driven shock of 1433 km/s, which injected particles within and outside the MC. The hard SEP spectra, the presence of a Type II radio burst, and the co-temporal Type III radio bursts being observed from 80 MHz that seems to emanate from the Type II, points to the shock as the relevant accelerator of the particles. Conclusions. The detailed analysis of the SEP event strongly suggest that the energetic particles are injected mainly by a CME-driven shock into and outside of a previous MC present in the heliosphere at the time of the particle onset. The sunward propagating SEPs measured by Solar Orbiter are produced by the injection of particles along the longer (western) leg of the MC still connected to the Sun at the time of the release of the particles. The determined electron propagation path length inside the MC is around 30% longer than the estimated length of the loop leg of the MC itself (based on the graduated cylindrical shell model) consistent with a low number of field line rotations. △ Less

Submitted 6 September, 2024; originally announced September 2024.

Comments: 23 pages, 19 figures

arXiv:2409.04178 [pdf, other]

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

Authors: Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu, Chun-Wei Huang, Tsung-Chih Chiang, Quan Kong, Norimasa Kobori, Chun-Yi Lee

Abstract: Scene coordinate regression (SCR) methods have emerged as a promising area of research due to their potential for accurate visual localization. However, many existing SCR approaches train on samples from all image regions, including dynamic objects and texture-less areas. Utilizing these areas for optimization during training can potentially hamper the overall performance and efficiency of the mod… ▽ More Scene coordinate regression (SCR) methods have emerged as a promising area of research due to their potential for accurate visual localization. However, many existing SCR approaches train on samples from all image regions, including dynamic objects and texture-less areas. Utilizing these areas for optimization during training can potentially hamper the overall performance and efficiency of the model. In this study, we first perform an in-depth analysis to validate the adverse impacts of these areas. Drawing inspiration from our analysis, we then introduce an error-guided feature selection (EGFS) mechanism, in tandem with the use of the Segment Anything Model (SAM). This mechanism seeds low reprojection areas as prompts and expands them into error-guided masks, and then utilizes these masks to sample points and filter out problematic areas in an iterative manner. The experiments demonstrate that our method outperforms existing SCR approaches that do not rely on 3D information on the Cambridge Landmarks and Indoor6 datasets. △ Less

Submitted 6 September, 2024; originally announced September 2024.

Comments: ECCV2024

arXiv:2409.03636 [pdf, other]

DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance

Authors: Hsing-Hang Chou, Yun-Shao Lin, Ching-Chin Sung, Yu Tsao, Chi-Chun Lee

Abstract: Emotional Voice Conversion (EVC) modifies speech emotion to enhance communication by amplifying positive cues and reducing negative ones. This complex task involves entangled factors like voice quality, speaker traits, and content. Traditional deep learning models like GANs and autoencoders have achieved some success in EVC by learning mappings or disentangling features but face challenges like in… ▽ More Emotional Voice Conversion (EVC) modifies speech emotion to enhance communication by amplifying positive cues and reducing negative ones. This complex task involves entangled factors like voice quality, speaker traits, and content. Traditional deep learning models like GANs and autoencoders have achieved some success in EVC by learning mappings or disentangling features but face challenges like instability and voice quality degradation. Diffusion models offer stable training and high-quality generation. We propose a diffusion-based EVC framework that disentangles emotion and speaker identity using mutual information loss and auxiliary models. An expressive guidance mechanism is introduced to improve emotion conversion while maintaining speaker traits. Experimental results demonstrate our approach's effectiveness for unseen speakers and emotions, achieving state-of-the-art performance in EVC tasks. △ Less

Submitted 5 September, 2024; originally announced September 2024.

arXiv:2409.01556 [pdf, other]

Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture

Authors: Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin Lee, Chih-Cheng Lee

Abstract: This study introduces a comprehensive benchmark designed to evaluate the performance of large language models (LLMs) in understanding and processing cultural knowledge, with a specific focus on Hakka culture as a case study. Leveraging Bloom's Taxonomy, the study develops a multi-dimensional framework that systematically assesses LLMs across six cognitive domains: Remembering, Understanding, Apply… ▽ More This study introduces a comprehensive benchmark designed to evaluate the performance of large language models (LLMs) in understanding and processing cultural knowledge, with a specific focus on Hakka culture as a case study. Leveraging Bloom's Taxonomy, the study develops a multi-dimensional framework that systematically assesses LLMs across six cognitive domains: Remembering, Understanding, Applying, Analyzing, Evaluating, and Creating. This benchmark extends beyond traditional single-dimensional evaluations by providing a deeper analysis of LLMs' abilities to handle culturally specific content, ranging from basic recall of facts to higher-order cognitive tasks such as creative synthesis. Additionally, the study integrates Retrieval-Augmented Generation (RAG) technology to address the challenges of minority cultural knowledge representation in LLMs, demonstrating how RAG enhances the models' performance by dynamically incorporating relevant external information. The results highlight the effectiveness of RAG in improving accuracy across all cognitive domains, particularly in tasks requiring precise retrieval and application of cultural knowledge. However, the findings also reveal the limitations of RAG in creative tasks, underscoring the need for further optimization. This benchmark provides a robust tool for evaluating and comparing LLMs in culturally diverse contexts, offering valuable insights for future research and development in AI-driven cultural knowledge preservation and dissemination. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: Submitted to O-COCOSDA 2024

arXiv:2409.01169 [pdf, other]

KDAR neutrino scattering for $^{12}$C target via charged current and muon angular distribution

Authors: Chaeyun Lee, Kyungsik Kim, Myung-Ki Cheoun, Eunja Ha, Tatsushi Shima, Toshitaka Kajino

Abstract: We calculate muon-neutrino ($νにゅー_μみゅー$) scattering off $^{12}$C via charged current (CC) by exploiting the 236 MeV ${νにゅー_μみゅー}$ from the kaon-decay-at-rest (KDAR). In this energy region, since both inelastic scattering below the quasielastic (QE) region and the QE scattering contribute simultaneously, we combine the inelastic scattering obtained by the QRPA and the QE scattering obtained by distorted wave b… ▽ More We calculate muon-neutrino ($νにゅー_μみゅー$) scattering off $^{12}$C via charged current (CC) by exploiting the 236 MeV ${νにゅー_μみゅー}$ from the kaon-decay-at-rest (KDAR). In this energy region, since both inelastic scattering below the quasielastic (QE) region and the QE scattering contribute simultaneously, we combine the inelastic scattering obtained by the QRPA and the QE scattering obtained by distorted wave born approximation (DWBA) based on the relativistic mean field (RMF) theory. We compare the results to the data from MiniBooNE. Further, since the KDR $νにゅー_μみゅー$ CC scattering may have angle dependence of outgoing muon, we investigate the differential angular dependent cross section in the ${νにゅー_μみゅー}$-$^{12}$C scattering and compare to the results by $νにゅー_e$-$^{12}$C scattering. These results could be useful for the calibration of the forthcoming KDAR neutrino cross section experiments. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: 19 pages, 6 figures

arXiv:2409.00681 [pdf, other]

A Real-time Instanton Approach to Quantum Activation

Authors: Chang-Woo Lee, Paul Brookes, Kee-Su Park, Marzena H. Szymańska, Eran Ginossar

Abstract: Driven-dissipative nonlinear systems exhibit rich critical behavior, related to bifurcation, bistability and switching, which underlie key phenomena in areas ranging from physics, chemistry and biology to social sciences and economics. The importance of rare fluctuations leading to a dramatic jump between two very distinct states, such as survival and extinction in population dynamics, success and… ▽ More Driven-dissipative nonlinear systems exhibit rich critical behavior, related to bifurcation, bistability and switching, which underlie key phenomena in areas ranging from physics, chemistry and biology to social sciences and economics. The importance of rare fluctuations leading to a dramatic jump between two very distinct states, such as survival and extinction in population dynamics, success and bankruptcy in economics and the occurrence of earthquakes or of epileptic seizures, have been already established. In the quantum domain, switching is of importance in both chemical reactions and the devices used in quantum state detection and amplification. In particular, the simplest driven single oscillator model serves as an insightful starting point. Here we describe switching induced by quantum fluctuations and illustrate that an instanton approach within Keldysh field theory can provide a deep insight into such phenomena. We provide a practical recipe to compute the switching rates semi-analytically, which agrees remarkably well with exact solutions across a wide domain of drive amplitudes spanning many orders of magnitude. Being set up in the framework of Keldysh coherent states path integrals, our approach opens the possibility of studying quantum activation in many-body systems where other approaches are inapplicable. △ Less

Submitted 1 September, 2024; originally announced September 2024.

arXiv:2409.00557 [pdf, other]

Learning to Ask: When LLMs Meet Unclear Instruction

Authors: Wenxuan Wang, Juluan Shi, Chaozheng Wang, Cheryl Lee, Youliang Yuan, Jen-tse Huang, Michael R. Lyu

Abstract: Equipped with the capability to call functions, modern large language models (LLMs) can leverage external tools for addressing a range of tasks unattainable through language skills alone. However, the effective execution of these tools relies heavily not just on the advanced capabilities of LLMs but also on precise user instructions, which often cannot be ensured in the real world. To evaluate the… ▽ More Equipped with the capability to call functions, modern large language models (LLMs) can leverage external tools for addressing a range of tasks unattainable through language skills alone. However, the effective execution of these tools relies heavily not just on the advanced capabilities of LLMs but also on precise user instructions, which often cannot be ensured in the real world. To evaluate the performance of LLMs tool-use under imperfect instructions, we meticulously examine the real-world instructions queried from users, analyze the error patterns, and build a challenging tool-use benchmark called Noisy ToolBench (NoisyToolBench). We find that due to the next-token prediction training objective, LLMs tend to arbitrarily generate the missed argument, which may lead to hallucinations and risks. To address this issue, we propose a novel framework, Ask-when-Needed (AwN), which prompts LLMs to ask questions to users whenever they encounter obstacles due to unclear instructions. Moreover, to reduce the manual labor involved in user-LLM interaction and assess LLMs performance in tool utilization from both accuracy and efficiency perspectives, we design an automated evaluation tool named ToolEvaluator. Our experiments demonstrate that the AwN significantly outperforms existing frameworks for tool learning in the NoisyToolBench. We will release all related code and datasets to support future research. △ Less

Submitted 4 September, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

arXiv:2409.00415 [pdf]

Crystalline Water Structure in Room-Temperature Clathrate State: Hydrogen-Bonded Pentagonal Rings

Authors: Ching-Hsiu Chen, Wei-Hao Hsu, Ryoko Oishi-Tomiyasu, Chi-Cheng Lee, Ming-Wen Chu, Ing-Shouh Hwang

Abstract: Water hydrogen bonding is extremely versatile; approximately 20 ice structures and several types of clathrate hydrate structures have been identified. These crystalline water structures form at temperatures below room temperature and/or at high pressure. We used transmission electron microscopy to study a new crystalline water structure in a clathrate state that is prepared by sandwiching gas-supe… ▽ More Water hydrogen bonding is extremely versatile; approximately 20 ice structures and several types of clathrate hydrate structures have been identified. These crystalline water structures form at temperatures below room temperature and/or at high pressure. We used transmission electron microscopy to study a new crystalline water structure in a clathrate state that is prepared by sandwiching gas-supersaturated water between graphene layers under ambient conditions. In this clathrate state, water molecules form a three-dimensional hydrogen bonding network that encloses gas-filled cages 2-4 nm in size. We derived the crystalline water structure by recording and analyzing electron diffraction patterns and performing first-principles calculations. The structure consists purely of pentagonal rings and has a topology similar to that of water ice XVII. The study proposed a mechanism for the formation of the clathrate state. The present results improve the understanding of interactions among water and small nonpolar molecules and offer novel insights into the local structures of ambient liquid water. △ Less

Submitted 31 August, 2024; originally announced September 2024.

arXiv:2409.00395 [pdf, other]

Self-supervised Fusarium Head Blight Detection with Hyperspectral Image and Feature Mining

Authors: Yu-Fan Lin, Ching-Heng Cheng, Bo-Cheng Qiu, Cheng-Jun Kang, Chia-Ming Lee, Chih-Chung Hsu

Abstract: Fusarium Head Blight (FHB) is a serious fungal disease affecting wheat (including durum), barley, oats, other small cereal grains, and corn. Effective monitoring and accurate detection of FHB are crucial to ensuring stable and reliable food security. Traditionally, trained agronomists and surveyors perform manual identification, a method that is labor-intensive, impractical, and challenging to sca… ▽ More Fusarium Head Blight (FHB) is a serious fungal disease affecting wheat (including durum), barley, oats, other small cereal grains, and corn. Effective monitoring and accurate detection of FHB are crucial to ensuring stable and reliable food security. Traditionally, trained agronomists and surveyors perform manual identification, a method that is labor-intensive, impractical, and challenging to scale. With the advancement of deep learning and Hyper-spectral Imaging (HSI) and Remote Sensing (RS) technologies, employing deep learning, particularly Convolutional Neural Networks (CNNs), has emerged as a promising solution. Notably, wheat infected with serious FHB may exhibit significant differences on the spectral compared to mild FHB one, which is particularly advantageous for hyperspectral image-based methods. In this study, we propose a self-unsupervised classification method based on HSI endmember extraction strategy and top-K bands selection, designed to analyze material signatures in HSIs to derive discriminative feature representations. This approach does not require expensive device or complicate algorithm design, making it more suitable for practical uses. Our method has been effectively validated in the Beyond Visible Spectrum: AI for Agriculture Challenge 2024. The source code is easy to reproduce and available at {https://github.com/VanLinLin/Automated-Crop-Disease-Diagnosis-from-Hyperspectral-Imagery-3rd}. △ Less

Submitted 31 August, 2024; originally announced September 2024.

Comments: Beyond Visible Spectrum: AI for Agriculture Challenge, in conjunted with ICPR 2024

arXiv:2409.00291 [pdf, ps, other]

Variable selection in the joint frailty model of recurrent and terminal events using Broken Adaptive Ridge regression

Authors: Christian Chan, Fatemeh Mahmoudi, Chel Hee Lee, Quan Long, Xuewen Lu

Abstract: We introduce a novel method to simultaneously perform variable selection and estimation in the joint frailty model of recurrent and terminal events using the Broken Adaptive Ridge Regression penalty. The BAR penalty can be summarized as an iteratively reweighted squared $L_2$-penalized regression, which approximates the $L_0$-regularization method. Our method allows for the number of covariates to… ▽ More We introduce a novel method to simultaneously perform variable selection and estimation in the joint frailty model of recurrent and terminal events using the Broken Adaptive Ridge Regression penalty. The BAR penalty can be summarized as an iteratively reweighted squared $L_2$-penalized regression, which approximates the $L_0$-regularization method. Our method allows for the number of covariates to diverge with the sample size. Under certain regularity conditions, we prove that the BAR estimator implemented under the model framework is consistent and asymptotically normally distributed, which are known as the oracle properties in the variable selection literature. In our simulation studies, we compare our proposed method to the Minimum Information Criterion (MIC) method. We apply our method on the Medical Information Mart for Intensive Care (MIMIC-III) database, with the aim of investigating which variables affect the risks of repeated ICU admissions and death during ICU stay. △ Less

Submitted 30 August, 2024; originally announced September 2024.

arXiv:2408.16362 [pdf, other]

Oscillatory dependence of tunneling magnetoresistance on barrier thickness in magnetic tunnel junctions

Authors: B. C. Lee

Abstract: The dependence of tunneling conductance and tunneling magnetoresistance (TMR) on barrier thickness in magnetic tunnel junctions is theoretically investigated. The complex band structure of the insulator is taken into account, and an analytical formula for tunneling conductance and TMR is derived. Numerical calculations using a tight-binding model validate the analytical formula. The complex nature… ▽ More The dependence of tunneling conductance and tunneling magnetoresistance (TMR) on barrier thickness in magnetic tunnel junctions is theoretically investigated. The complex band structure of the insulator is taken into account, and an analytical formula for tunneling conductance and TMR is derived. Numerical calculations using a tight-binding model validate the analytical formula. The complex nature of insulator's band structure leads to significant oscillations in tunneling conductance and TMR as functions of barrier thickness. It is demonstrated that these TMR oscillations are not caused by quantum confinement within the barrier, but are instead analogous to classical two-slit optical interference. △ Less

Submitted 29 August, 2024; originally announced August 2024.

Comments: 24 pages, 6 figures

arXiv:2408.16052 [pdf, other]

Squeezed Thermal Reservoir Engineering via Linear Interactions

Authors: Cheng-Lin Lee, Chiao-Hsuan Wang

Abstract: Quantum reservoir engineering aims to transform typically detrimental dissipations into advantageous resources. We present a versatile method for creating a squeezed thermal reservoir for quantum systems. By coupling the system to a lossy mode within a normal thermal environment, we can emulate the effect of a squeezed reservoir characterized by reduced fluctuations in one quadrature. We demonstra… ▽ More Quantum reservoir engineering aims to transform typically detrimental dissipations into advantageous resources. We present a versatile method for creating a squeezed thermal reservoir for quantum systems. By coupling the system to a lossy mode within a normal thermal environment, we can emulate the effect of a squeezed reservoir characterized by reduced fluctuations in one quadrature. We demonstrate this approach through two illustrative cases: one for two-level systems, such as qubits or atoms, and another for bosonic modes, like photons or phonons. This method leverages constant linear interactions within a normal thermal environment, ensuring experimental feasibility without requiring squeezed light inputs or time-dependent modulations. This technique holds promise for various applications, including the enhancement of dissipative squeezing, stabilization of entanglement, advancement of quantum simulations, exploration of quantum thermodynamics and phase transitions, and improvement of precision measurements. △ Less

Submitted 28 August, 2024; originally announced August 2024.

Comments: 6 pages, 3 figures

arXiv:2408.15204 [pdf, other]

Can Unconfident LLM Annotations Be Used for Confident Conclusions?

Authors: Kristina Gligorić, Tijana Zrnic, Cinoo Lee, Emmanuel J. Candès, Dan Jurafsky

Abstract: Large language models (LLMs) have shown high agreement with human raters across a variety of tasks, demonstrating potential to ease the challenges of human data collection. In computational social science (CSS), researchers are increasingly leveraging LLM annotations to complement slow and expensive human annotations. Still, guidelines for collecting and using LLM annotations, without compromising… ▽ More Large language models (LLMs) have shown high agreement with human raters across a variety of tasks, demonstrating potential to ease the challenges of human data collection. In computational social science (CSS), researchers are increasingly leveraging LLM annotations to complement slow and expensive human annotations. Still, guidelines for collecting and using LLM annotations, without compromising the validity of downstream conclusions, remain limited. We introduce Confidence-Driven Inference: a method that combines LLM annotations and LLM confidence indicators to strategically select which human annotations should be collected, with the goal of producing accurate statistical estimates and provably valid confidence intervals while reducing the number of human annotations needed. Our approach comes with safeguards against LLM annotations of poor quality, guaranteeing that the conclusions will be both valid and no less accurate than if we only relied on human annotations. We demonstrate the effectiveness of Confidence-Driven Inference over baselines in statistical estimation tasks across three CSS settings--text politeness, stance, and bias--reducing the needed number of human annotations by over 25% in each. Although we use CSS settings for demonstration, Confidence-Driven Inference can be used to estimate most standard quantities across a broad range of NLP problems. △ Less

Submitted 27 August, 2024; originally announced August 2024.

arXiv:2408.14739 [pdf, other]

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech

Authors: Heeseung Kim, Sang-gil Lee, Jiheum Yeom, Che Hyun Lee, Sungwon Kim, Sungroh Yoon

Abstract: We propose VoiceTailor, a parameter-efficient speaker-adaptive text-to-speech (TTS) system, by equipping a pre-trained diffusion-based TTS model with a personalized adapter. VoiceTailor identifies pivotal modules that benefit from the adapter based on a weight change ratio analysis. We utilize Low-Rank Adaptation (LoRA) as a parameter-efficient adaptation method and incorporate the adapter into pi… ▽ More We propose VoiceTailor, a parameter-efficient speaker-adaptive text-to-speech (TTS) system, by equipping a pre-trained diffusion-based TTS model with a personalized adapter. VoiceTailor identifies pivotal modules that benefit from the adapter based on a weight change ratio analysis. We utilize Low-Rank Adaptation (LoRA) as a parameter-efficient adaptation method and incorporate the adapter into pivotal modules of the pre-trained diffusion decoder. To achieve powerful adaptation performance with few parameters, we explore various guidance techniques for speaker adaptation and investigate the best strategies to strengthen speaker information. VoiceTailor demonstrates comparable speaker adaptation performance to existing adaptive TTS models by fine-tuning only 0.25\% of the total parameters. VoiceTailor shows strong robustness when adapting to a wide range of real-world speakers, as shown in the demo. △ Less

Submitted 27 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

Comments: INTERSPEECH 2024

arXiv:2408.13642 [pdf, ps, other]

Change Point Detection in Pairwise Comparison Data with Covariates

Authors: Yi Han, Thomas C. M. Lee

Abstract: This paper introduces the novel piecewise stationary covariate-assisted ranking estimation (PS-CARE) model for analyzing time-evolving pairwise comparison data, enhancing item ranking accuracy through the integration of covariate information. By partitioning the data into distinct, stationary segments, the PS-CARE model adeptly detects temporal shifts in item rankings, known as change points, whos… ▽ More This paper introduces the novel piecewise stationary covariate-assisted ranking estimation (PS-CARE) model for analyzing time-evolving pairwise comparison data, enhancing item ranking accuracy through the integration of covariate information. By partitioning the data into distinct, stationary segments, the PS-CARE model adeptly detects temporal shifts in item rankings, known as change points, whose number and positions are initially unknown. Leveraging the minimum description length (MDL) principle, this paper establishes a statistically consistent model selection criterion to estimate these unknowns. The practical optimization of this MDL criterion is done with the pruned exact linear time (PELT) algorithm. Empirical evaluations reveal the method's promising performance in accurately locating change points across various simulated scenarios. An application to an NBA dataset yielded meaningful insights that aligned with significant historical events, highlighting the method's practical utility and the MDL criterion's effectiveness in capturing temporal ranking changes. To the best of the authors' knowledge, this research pioneers change point detection in pairwise comparison data with covariate information, representing a significant leap forward in the field of dynamic ranking analysis. △ Less

Submitted 24 August, 2024; originally announced August 2024.

arXiv:2408.13285 [pdf, other]

SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting

Authors: Jiseung Hong, Changmin Lee, Gyusang Yu

Abstract: TL;DR Perform 3D object editing selectively by disentangling it from the background scene. Instruct-NeRF2NeRF (in2n) is a promising method that enables editing of 3D scenes composed of Neural Radiance Field (NeRF) using text prompts. However, it is challenging to perform geometrical modifications such as shrinking, scaling, or moving on both the background and object simultaneously. In this projec… ▽ More TL;DR Perform 3D object editing selectively by disentangling it from the background scene. Instruct-NeRF2NeRF (in2n) is a promising method that enables editing of 3D scenes composed of Neural Radiance Field (NeRF) using text prompts. However, it is challenging to perform geometrical modifications such as shrinking, scaling, or moving on both the background and object simultaneously. In this project, we enable geometrical changes of objects within the 3D scene by selectively editing the object after separating it from the scene. We perform object segmentation and background inpainting respectively, and demonstrate various examples of freely resizing or moving disentangled objects within the three-dimensional space. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Comments: Code is available at: https://github.com/KAISTChangmin/SIn-NeRF2NeRF

arXiv:2408.12726 [pdf, other]

Macro-Queries: An Exploration into Guided Chart Generation from High Level Prompts

Authors: Christopher J. Lee, Giorgio Tran, Roderick Tabalba, Jason Leigh, Ryan Longman

Abstract: This paper explores the intersection of data visualization and Large Language Models (LLMs). Driven by the need to make a broader range of data visualization types accessible for novice users, we present a guided LLM-based pipeline designed to transform data, guided by high-level user questions (referred to as macro-queries), into a diverse set of useful visualizations. This approach leverages var… ▽ More This paper explores the intersection of data visualization and Large Language Models (LLMs). Driven by the need to make a broader range of data visualization types accessible for novice users, we present a guided LLM-based pipeline designed to transform data, guided by high-level user questions (referred to as macro-queries), into a diverse set of useful visualizations. This approach leverages various prompting techniques, fine-tuning inspired by Abela's Chart Taxonomy, and integrated SQL tool usage. △ Less

Submitted 22 August, 2024; originally announced August 2024.

arXiv:2408.12084 [pdf, other]

Vision-Based Detection of Uncooperative Targets and Components on Small Satellites

Authors: Hannah Grauer, Elena-Sorina Lupu, Connor Lee, Soon-Jo Chung, Darren Rowen, Benjamen Bycroft, Phaedrus Leeds, John Brader

Abstract: Space debris and inactive satellites pose a threat to the safety and integrity of operational spacecraft and motivate the need for space situational awareness techniques. These uncooperative targets create a challenging tracking and detection problem due to a lack of prior knowledge of their features, trajectories, or even existence. Recent advancements in computer vision models can be used to imp… ▽ More Space debris and inactive satellites pose a threat to the safety and integrity of operational spacecraft and motivate the need for space situational awareness techniques. These uncooperative targets create a challenging tracking and detection problem due to a lack of prior knowledge of their features, trajectories, or even existence. Recent advancements in computer vision models can be used to improve upon existing methods for tracking such uncooperative targets to make them more robust and reliable to the wide-ranging nature of the target. This paper introduces an autonomous detection model designed to identify and monitor these objects using learning and computer vision. The autonomous detection method aims to identify and accurately track the uncooperative targets in varied circumstances, including different camera spectral sensitivities, lighting, and backgrounds. Our method adapts to the relative distance between the observing spacecraft and the target, and different detection strategies are adjusted based on distance. At larger distances, we utilize You Only Look Once (YOLOv8), a multitask Convolutional Neural Network (CNN), for zero-shot and domain-specific single-shot real time detection of the target. At shorter distances, we use knowledge distillation to combine visual foundation models with a lightweight fast segmentation CNN (Fast-SCNN) to segment the spacecraft components with low storage requirements and fast inference times, and to enable weight updates from earth and possible onboard training. Lastly, we test our method on a custom dataset simulating the unique conditions encountered in space, as well as a publicly-available dataset. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: Small Satellite 2024 Conference, 13 pages, 8 figures, 6 tables

arXiv:2408.11751 [pdf, other]

Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration

Authors: David Molina Concha, Jiping Li, Haoran Yin, Kyeonghyeon Park, Hyun-Rok Lee, Taesik Lee, Dhruv Sirohi, Chi-Guhn Lee

Abstract: This study addresses the challenge of fleet design optimization in the context of heterogeneous multi-robot fleets, aiming to obtain feasible designs that balance performance and costs. In the domain of autonomous multi-robot exploration, reinforcement learning agents play a central role, offering adaptability to complex terrains and facilitating collaboration among robots. However, modifying the… ▽ More This study addresses the challenge of fleet design optimization in the context of heterogeneous multi-robot fleets, aiming to obtain feasible designs that balance performance and costs. In the domain of autonomous multi-robot exploration, reinforcement learning agents play a central role, offering adaptability to complex terrains and facilitating collaboration among robots. However, modifying the fleet composition results in changes in the learned behavior, and training multi-robot systems using multi-agent reinforcement learning is expensive. Therefore, an exhaustive evaluation of each potential fleet design is infeasible. To tackle these hurdles, we introduce Bayesian Optimization for Fleet Design (BOFD), a framework leveraging multi-objective Bayesian Optimization to explore fleets on the Pareto front of performance and cost while accounting for uncertainty in the design space. Moreover, we establish a sub-linear bound for cumulative regret, supporting BOFD's robustness and efficacy. Extensive benchmark experiments in synthetic and simulated environments demonstrate the superiority of our framework over state-of-the-art methods, achieving efficient fleet designs with minimal fleet evaluations. △ Less

Submitted 21 August, 2024; originally announced August 2024.

arXiv:2408.11679 [pdf, other]

Exploring Robustness of Visual State Space model against Backdoor Attacks

Authors: Cheng-Yi Lee, Cheng-Chang Tsai, Chia-Mu Yu, Chun-Shien Lu

Abstract: Visual State Space Model (VSS) has demonstrated remarkable performance in various computer vision tasks. However, in the process of development, backdoor attacks have brought severe challenges to security. Such attacks cause an infected model to predict target labels when a specific trigger is activated, while the model behaves normally on benign samples. In this paper, we conduct systematic exper… ▽ More Visual State Space Model (VSS) has demonstrated remarkable performance in various computer vision tasks. However, in the process of development, backdoor attacks have brought severe challenges to security. Such attacks cause an infected model to predict target labels when a specific trigger is activated, while the model behaves normally on benign samples. In this paper, we conduct systematic experiments to comprehend on robustness of VSS through the lens of backdoor attacks, specifically how the state space model (SSM) mechanism affects robustness. We first investigate the vulnerability of VSS to different backdoor triggers and reveal that the SSM mechanism, which captures contextual information within patches, makes the VSS model more susceptible to backdoor triggers compared to models without SSM. Furthermore, we analyze the sensitivity of the VSS model to patch processing techniques and discover that these triggers are effectively disrupted. Based on these observations, we consider an effective backdoor for the VSS model that recurs in each patch to resist patch perturbations. Extensive experiments across three datasets and various backdoor attacks reveal that the VSS model performs comparably to Transformers (ViTs) but is less robust than the Gated CNNs, which comprise only stacked Gated CNN blocks without SSM. △ Less

Submitted 22 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

Comments: 11 pages, 9 figures, minor revise, under review

arXiv:2408.11527 [pdf, other]

The Vizier Gaussian Process Bandit Algorithm

Authors: Xingyou Song, Qiuyi Zhang, Chansoo Lee, Emily Fertig, Tzu-Kuo Huang, Lior Belenki, Greg Kochanski, Setareh Ariafar, Srinivas Vasudevan, Sagi Perel, Daniel Golovin

Abstract: Google Vizier has performed millions of optimizations and accelerated numerous research and production systems at Google, demonstrating the success of Bayesian optimization as a large-scale service. Over multiple years, its algorithm has been improved considerably, through the collective experiences of numerous research efforts and user feedback. In this technical report, we discuss the implementa… ▽ More Google Vizier has performed millions of optimizations and accelerated numerous research and production systems at Google, demonstrating the success of Bayesian optimization as a large-scale service. Over multiple years, its algorithm has been improved considerably, through the collective experiences of numerous research efforts and user feedback. In this technical report, we discuss the implementation details and design choices of the current default algorithm provided by Open Source Vizier. Our experiments on standardized benchmarks reveal its robustness and versatility against well-established industry baselines on multiple practical modes. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: Google DeepMind Technical Report. Code can be found in https://github.com/google/vizier

arXiv:2408.11248 [pdf, ps, other]

Microlensing brown-dwarf companions in binaries detected during the 2022 and 2023 seasons

Authors: Cheongho Han, Ian A. Bond, Andrzej Udalski, Chung-Uk Lee, Andrew Gould, Michael D. Albrow, Sun-Ju Chung, Kyu-Ha Hwang, Youn Kil Jung, Yoon-Hyun Ryu, Yossi Shvartzvald, In-Gu Shin, Jennifer C. Yee, Hongjing Yang, Weicheng Zang, Sang-Mok Cha, Doeon Kim, Dong-Jin Kim, Seung-Lee Kim, Dong-Joo Lee, Yongseok Lee, Byeong-Gon Park, Richard W. Pogge, Fumio Abe, Ken Bando , et al. (41 additional authors not shown)

Abstract: Building on previous works to construct a homogeneous sample of brown dwarfs in binary systems, we investigate microlensing events detected by the Korea Microlensing Telescope Network (KMTNet) survey during the 2022 and 2023 seasons. Given the difficulty in distinguishing brown-dwarf events from those produced by binary lenses with nearly equal-mass components, we analyze all lensing events detect… ▽ More Building on previous works to construct a homogeneous sample of brown dwarfs in binary systems, we investigate microlensing events detected by the Korea Microlensing Telescope Network (KMTNet) survey during the 2022 and 2023 seasons. Given the difficulty in distinguishing brown-dwarf events from those produced by binary lenses with nearly equal-mass components, we analyze all lensing events detected during the seasons that exhibit anomalies characteristic of binary-lens systems. Using the same criteria consistently applied in previous studies, we identify six additional brown dwarf candidates through the analysis of lensing events KMT-2022-BLG-0412, KMT-2022-BLG-2286, KMT-2023-BLG-0201, KMT-2023-BLG-0601, KMT-2023-BLG-1684, and KMT-2023-BLG-1743. An examination of the mass posteriors shows that the median mass of the lens companions ranges from 0.02 $M_\odot$ to 0.05 $M_\odot$, indicating that these companions fall within the brown-dwarf mass range. The mass of the primary lenses ranges from 0.11 $M_\odot$ to 0.68 $M_\odot$, indicating that they are low-mass stars with substantially lower masses compared to the Sun. △ Less

Submitted 20 August, 2024; originally announced August 2024.

Comments: 13 pages, 17 figures, 12 tables

arXiv:2408.11227 [pdf]

OCTCube: A 3D foundation model for optical coherence tomography that improves cross-dataset, cross-disease, cross-device and cross-modality analysis

Authors: Zixuan Liu, Hanwen Xu, Addie Woicik, Linda G. Shapiro, Marian Blazes, Yue Wu, Cecilia S. Lee, Aaron Y. Lee, Sheng Wang

Abstract: Optical coherence tomography (OCT) has become critical for diagnosing retinal diseases as it enables 3D images of the retina and optic nerve. OCT acquisition is fast, non-invasive, affordable, and scalable. Due to its broad applicability, massive numbers of OCT images have been accumulated in routine exams, making it possible to train large-scale foundation models that can generalize to various di… ▽ More Optical coherence tomography (OCT) has become critical for diagnosing retinal diseases as it enables 3D images of the retina and optic nerve. OCT acquisition is fast, non-invasive, affordable, and scalable. Due to its broad applicability, massive numbers of OCT images have been accumulated in routine exams, making it possible to train large-scale foundation models that can generalize to various diagnostic tasks using OCT images. Nevertheless, existing foundation models for OCT only consider 2D image slices, overlooking the rich 3D structure. Here, we present OCTCube, a 3D foundation model pre-trained on 26,605 3D OCT volumes encompassing 1.62 million 2D OCT images. OCTCube is developed based on 3D masked autoencoders and exploits FlashAttention to reduce the larger GPU memory usage caused by modeling 3D volumes. OCTCube outperforms 2D models when predicting 8 retinal diseases in both inductive and cross-dataset settings, indicating that utilizing the 3D structure in the model instead of 2D data results in significant improvement. OCTCube further shows superior performance on cross-device prediction and when predicting systemic diseases, such as diabetes and hypertension, further demonstrating its strong generalizability. Finally, we propose a contrastive-self-supervised-learning-based OCT-IR pre-training framework (COIP) for cross-modality analysis on OCT and infrared retinal (IR) images, where the OCT volumes are embedded using OCTCube. We demonstrate that COIP enables accurate alignment between OCT and IR en face images. Collectively, OCTCube, a 3D OCT foundation model, demonstrates significantly better performance against 2D models on 27 out of 29 tasks and comparable performance on the other two tasks, paving the way for AI-based retinal disease diagnosis. △ Less

Submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.10506 [pdf, other]

doi 10.3847/1538-4357/ad7019

Correlation between dust continuum and CN line emissions in high-mass star-forming regions

Authors: Jihye Hwang, Chang Won Lee, Jongsoo Kim, Eun Jung Chung, Kee-Tae Kim

Abstract: Measuring the strength of three dimensional (3D) magnetic field vector is challenging as it is not easy to recognize whether its line-of-sight (LOS) and plane-of-sky (POS) components are obtained from the same region. CN ($N = 1 - 0$) emission has been used to get the LOS component of a magnetic field (B$_\mathrm{LOS}$) from its Zeeman splitting lines, while dust continuum emission has been used t… ▽ More Measuring the strength of three dimensional (3D) magnetic field vector is challenging as it is not easy to recognize whether its line-of-sight (LOS) and plane-of-sky (POS) components are obtained from the same region. CN ($N = 1 - 0$) emission has been used to get the LOS component of a magnetic field (B$_\mathrm{LOS}$) from its Zeeman splitting lines, while dust continuum emission has been used to get the POS component of a magnetic field (B$_\mathrm{POS}$). We use the CN ($N = 1 - 0$) data observed with the Taeduk Radio Astronomy Observatory (TRAO) 14-m telescope and the dust continuum data from $Herschel$ archive toward six high-mass star-forming regions in order to test whether CN line and dust continuum emission can trace a similar region and thus can be used for inferring 3D magnetic field strength. Our comparison between CN and H$_2$ column densities for all targets indicates that CN line emission tends to be strong toward bright continuum regions. The positions of peak CN column densities are particularly well correlated with those of peak H$_2$ column densities at least over the H$_2$ column density of 8.0 $\times$ 10$^{22}$ cm$^{-2}$ within one or two telescope beam size in all targets, implying that CN line and dust continuum emitting regions are likely spatially coincident. This enabled us to make the reliable measurement of 3D magnetic field strengths of five targets by taking a vector sum of their B$_\mathrm{LOS}$ and B$_\mathrm{POS}$, helping to decide the magnetical criticality of the targets as supercritical or transcritical. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: Accepted in ApJ, 14th Aug 2024

arXiv:2408.09686 [pdf, other]

Algorithmic Contract Design with Reinforcement Learning Agents

Authors: David Molina Concha, Kyeonghyeon Park, Hyun-Rok Lee, Taesik Lee, Chi-Guhn Lee

Abstract: We introduce a novel problem setting for algorithmic contract design, named the principal-MARL contract design problem. This setting extends traditional contract design to account for dynamic and stochastic environments using Markov Games and Multi-Agent Reinforcement Learning. To tackle this problem, we propose a Multi-Objective Bayesian Optimization (MOBO) framework named Constrained Pareto Maxi… ▽ More We introduce a novel problem setting for algorithmic contract design, named the principal-MARL contract design problem. This setting extends traditional contract design to account for dynamic and stochastic environments using Markov Games and Multi-Agent Reinforcement Learning. To tackle this problem, we propose a Multi-Objective Bayesian Optimization (MOBO) framework named Constrained Pareto Maximum Entropy Search (cPMES). Our approach integrates MOBO and MARL to explore the highly constrained contract design space, identifying promising incentive and recruitment decisions. cPMES transforms the principal-MARL contract design problem into an unconstrained multi-objective problem, leveraging the probability of feasibility as part of the objectives and ensuring promising designs predicted on the feasibility border are included in the Pareto front. By focusing the entropy prediction on designs within the Pareto set, cPMES mitigates the risk of the search strategy being overwhelmed by entropy from constraints. We demonstrate the effectiveness of cPMES through extensive benchmark studies in synthetic and simulated environments, showing its ability to find feasible contract designs that maximize the principal's objectives. Additionally, we provide theoretical support with a sub-linear regret bound concerning the number of iterations. △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.09657 [pdf, other]

Impact of Large Language Models of Code on Fault Localization

Authors: Suhwan Ji, Sanghwa Lee, Changsup Lee, Hyeonseung Im, Yo-Sub Han

Abstract: Identifying the point of error is imperative in software debugging. Traditional fault localization (FL) techniques rely on executing the program and using the code coverage matrix in tandem with test case results to calculate a suspiciousness score for each function or line. Recently, learning-based FL techniques have harnessed machine learning models to extract meaningful features from the code c… ▽ More Identifying the point of error is imperative in software debugging. Traditional fault localization (FL) techniques rely on executing the program and using the code coverage matrix in tandem with test case results to calculate a suspiciousness score for each function or line. Recently, learning-based FL techniques have harnessed machine learning models to extract meaningful features from the code coverage matrix and improve FL performance. These techniques, however, require compilable source code, existing test cases, and specialized tools for generating the code coverage matrix for each programming language of interest. In this paper, we propose, for the first time, a simple but effective sequence generation approach for fine-tuning large language models of code (LLMCs) for FL tasks. LLMCs have recently received much attention for various software engineering problems. In line with these, we leverage the innate understanding of code that LLMCs have acquired through pre-training on large code corpora. Specifically, we fine-tune representative encoder, encoder-decoder, and decoder-based 13 LLMCs for FL tasks. Unlike previous approaches, LLMCs can analyze code sequences even with syntactic errors, since they do not rely on compiled input. Still, they have a limitation on the length of the input data. Therefore, for a fair comparison with existing FL techniques, we extract methods with errors from the project-level benchmark, Defects4J, and analyze them at the line level. Experimental results show that LLMCs fine-tuned with our approach successfully pinpoint error positions in 50.6\%, 64.2\%, and 72.3\% of 1,291 methods in Defects4J for Top-1/3/5 prediction, outperforming the best learning-based state-of-the-art technique by up to 1.35, 1.12, and 1.08 times, respectively. Our findings suggest promising research directions for FL and automated program repair tasks using LLMCs. △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.09554 [pdf, other]

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Authors: Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Ran A. Godrich, Matthew C. H. Lee, Chad Vanderbilt, Razik Yousfi, Thomas Fuchs, David S. Klimstra, Siqi Liu

Abstract: Many molecular alterations serve as clinically prognostic or therapy-predictive biomarkers, typically detected using single or multi-gene molecular assays. However, these assays are expensive, tissue destructive and often take weeks to complete. Using AI on routine H&E WSIs offers a fast and economical approach to screen for multiple molecular biomarkers. We present a high-throughput AI-based syst… ▽ More Many molecular alterations serve as clinically prognostic or therapy-predictive biomarkers, typically detected using single or multi-gene molecular assays. However, these assays are expensive, tissue destructive and often take weeks to complete. Using AI on routine H&E WSIs offers a fast and economical approach to screen for multiple molecular biomarkers. We present a high-throughput AI-based system leveraging Virchow2, a foundation model pre-trained on 3 million slides, to interrogate genomic features previously determined by an next-generation sequencing (NGS) assay, using 47,960 scanned hematoxylin and eosin (H&E) whole slide images (WSIs) from 38,984 cancer patients. Unlike traditional methods that train individual models for each biomarker or cancer type, our system employs a unified model to simultaneously predict a wide range of clinically relevant molecular biomarkers across cancer types. By training the network to replicate the MSK-IMPACT targeted biomarker panel of 505 genes, it identified 80 high performing biomarkers with a mean AUえーゆー-ROC of 0.89 in 15 most common cancer types. In addition, 40 biomarkers demonstrated strong associations with specific cancer histologic subtypes. Furthermore, 58 biomarkers were associated with targets frequently assayed clinically for therapy selection and response prediction. The model can also predict the activity of five canonical signaling pathways, identify defects in DNA repair mechanisms, and predict genomic instability measured by tumor mutation burden, microsatellite instability (MSI), and chromosomal instability (CIN). The proposed model can offer potential to guide therapy selection, improve treatment efficacy, accelerate patient screening for clinical trials and provoke the interrogation of new therapeutic targets. △ Less

Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.09367 [pdf, other]

Improving Lung Cancer Diagnosis and Survival Prediction with Deep Learning and CT Imaging

Authors: Xiawei Wang, James Sharpnack, Thomas C. M. Lee

Abstract: Lung cancer is a major cause of cancer-related deaths, and early diagnosis and treatment are crucial for improving patients' survival outcomes. In this paper, we propose to employ convolutional neural networks to model the non-linear relationship between the risk of lung cancer and the lungs' morphology revealed in the CT images. We apply a mini-batched loss that extends the Cox proportional hazar… ▽ More Lung cancer is a major cause of cancer-related deaths, and early diagnosis and treatment are crucial for improving patients' survival outcomes. In this paper, we propose to employ convolutional neural networks to model the non-linear relationship between the risk of lung cancer and the lungs' morphology revealed in the CT images. We apply a mini-batched loss that extends the Cox proportional hazards model to handle the non-convexity induced by neural networks, which also enables the training of large data sets. Additionally, we propose to combine mini-batched loss and binary cross-entropy to predict both lung cancer occurrence and the risk of mortality. Simulation results demonstrate the effectiveness of both the mini-batched loss with and without the censoring mechanism, as well as its combination with binary cross-entropy. We evaluate our approach on the National Lung Screening Trial data set with several 3D convolutional neural network architectures, achieving high AUC and C-index scores for lung cancer classification and survival prediction. These results, obtained from simulations and real data experiments, highlight the potential of our approach to improving the diagnosis and treatment of lung cancer. △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.07848 [pdf]

Combinatorial synthesis and characterization of thin film Al1-xRExN (RE = Pr3+, Tb3+) heterostructural alloys

Authors: Binod Paudel, John S. Mangum, Christopher L. Rom, Kingsley Egbo, Cheng-Wei Lee, Harvey Guthrey, Sean Allen, Nancy M. Haegel, Keisuke Yazawa, Geoff L. Brennecka, Rebecca W. Smaha

Abstract: The potential impact of cation-substituted AlN-based materials, such as Al1-xScxN, Al1-xGaxN, and Al1-xBxN, with exceptional electronic, electromechanical, and dielectric properties has spurred research into this broad family of materials. Rare earth (RE) cations are particularly appealing as they could additionally impart optoelectronic or magnetic functionality. However, success in incorporating… ▽ More The potential impact of cation-substituted AlN-based materials, such as Al1-xScxN, Al1-xGaxN, and Al1-xBxN, with exceptional electronic, electromechanical, and dielectric properties has spurred research into this broad family of materials. Rare earth (RE) cations are particularly appealing as they could additionally impart optoelectronic or magnetic functionality. However, success in incorporating a significant level of RE cations into AlN has been limited so far because it is thermodynamically challenging to stabilize such heterostructural alloys. Using combinatorial co-sputtering, we synthesized Al1-xRExN (RE = Pr, Tb) thin films and performed a rapid survey of the composition-structure-property relationships as a function of RE alloying. Under our growth conditions, we observe that Al1-xPrxN maintains a phase-pure wurtzite structure until transitioning to amorphous for x>0.22. Al1-xTbxN exhibits a phase-pure wurtzite structure until x<0.15, then exhibits mixed wurtzite and rocksalt phases for 0.16<x<0.28, and finally becomes amorphous beyond that. Ellipsometry measurements reveal that the absorption onset decreases with increasing rare earth incorporation and has a strong dependence on the phases present. We observe the characteristic cathodoluminescence emission of Pr3+ and Tb3+, respectively. Using this synthesis approach, we have demonstrated incorporation of Pr and Tb into the AlN wurtzite structure up to higher compositions levels than previously reported and made the first measurements of corresponding structural and optoelectronic properties. △ Less

Submitted 14 August, 2024; originally announced August 2024.

arXiv:2408.07506 [pdf, other]

Correlators for pseudo Hermitian systems

Authors: Yao Bai, Ting-Long Feng, Suro Kim, Cheng-Yang Lee, Lei-Hua Liu, Wangping Zhao, Siyi Zhou

Abstract: Pseudo-Hermitian system is a class of non-Hermitian system with Hamiltonian satisfying the condition $ηいーた^{-1}H^\daggerηいーた=H$. We develop the in-in and Schwinger Keldysh formalism to calculate cosmological correlators for pseudo-Hermitian systems. We study a model consists of massive symplectic fermions coupled to the primordial curvature perturbation. The three-point function for the primordial curva… ▽ More Pseudo-Hermitian system is a class of non-Hermitian system with Hamiltonian satisfying the condition $ηいーた^{-1}H^\daggerηいーた=H$. We develop the in-in and Schwinger Keldysh formalism to calculate cosmological correlators for pseudo-Hermitian systems. We study a model consists of massive symplectic fermions coupled to the primordial curvature perturbation. The three-point function for the primordial curvature perturbation is computed up to one-loop and compared to earlier work where the loop correction comes from a massive scalar boson. The two results differ by a minus sign. Therefore, the one loop correction to the three-point function cannot be used to distinguished scalar bosons and symplectic fermions. To conclude, we discuss possibilities where the scalar bosons and symplectic fermions may be distinguished. △ Less

Submitted 14 August, 2024; originally announced August 2024.

Comments: 19 pages, 2 figures

arXiv:2408.05123 [pdf, other]

Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video

Authors: Chunggi Lee, Tica Lin, Hanspeter Pfister, Chen Zhu-Tian

Abstract: As basketball's popularity surges, fans often find themselves confused and overwhelmed by the rapid game pace and complexity. Basketball tactics, involving a complex series of actions, require substantial knowledge to be fully understood. This complexity leads to a need for additional information and explanation, which can distract fans from the game. To tackle these challenges, we present Sportif… ▽ More As basketball's popularity surges, fans often find themselves confused and overwhelmed by the rapid game pace and complexity. Basketball tactics, involving a complex series of actions, require substantial knowledge to be fully understood. This complexity leads to a need for additional information and explanation, which can distract fans from the game. To tackle these challenges, we present Sportify, a Visual Question Answering system that integrates narratives and embedded visualization for demystifying basketball tactical questions, aiding fans in understanding various game aspects. We propose three novel action visualizations (i.e., Pass, Cut, and Screen) to demonstrate critical action sequences. To explain the reasoning and logic behind players' actions, we leverage a large-language model (LLM) to generate narratives. We adopt a storytelling approach for complex scenarios from both first and third-person perspectives, integrating action visualizations. We evaluated Sportify with basketball fans to investigate its impact on understanding of tactics, and how different personal perspectives of narratives impact the understanding of complex tactic with action visualizations. Our evaluation with basketball fans demonstrates Sportify's capability to deepen tactical insights and amplify the viewing experience. Furthermore, third-person narration assists people in getting in-depth game explanations while first-person narration enhances fans' game engagement △ Less

Submitted 9 August, 2024; originally announced August 2024.

Comments: 14 pages, 8 figures, conference

arXiv:2408.05074 [pdf]

RT-Surv: Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring of Large-Scale Unstructured Electronic Health Records

Authors: Sangjoon Park, Chan Woo Wee, Seo Hee Choi, Kyung Hwan Kim, Jee Suk Chang, Hong In Yoon, Ik Jae Lee, Yong Bae Kim, Jaeho Cho, Ki Chang Keum, Chang Geol Lee, Hwa Kyung Byun, Woong Sub Koom

Abstract: Accurate patient selection is critical in radiotherapy (RT) to prevent ineffective treatments. Traditional survival prediction models, relying on structured data, often lack precision. This study explores the potential of large language models (LLMs) to structure unstructured electronic health record (EHR) data, thereby improving survival prediction accuracy through comprehensive clinical informat… ▽ More Accurate patient selection is critical in radiotherapy (RT) to prevent ineffective treatments. Traditional survival prediction models, relying on structured data, often lack precision. This study explores the potential of large language models (LLMs) to structure unstructured electronic health record (EHR) data, thereby improving survival prediction accuracy through comprehensive clinical information integration. Data from 34,276 patients treated with RT at Yonsei Cancer Center between 2013 and 2023 were analyzed, encompassing both structured and unstructured data. An open-source LLM was used to structure the unstructured EHR data via single-shot learning, with its performance compared against a domain-specific medical LLM and a smaller variant. Survival prediction models were developed using statistical, machine learning, and deep learning approaches, incorporating both structured and LLM-structured data. Clinical experts evaluated the accuracy of the LLM-structured data. The open-source LLM achieved 87.5% accuracy in structuring unstructured EHR data without additional training, significantly outperforming the domain-specific medical LLM, which reached only 35.8% accuracy. Larger LLMs were more effective, particularly in extracting clinically relevant features like general condition and disease extent, which closely correlated with patient survival. Incorporating LLM-structured clinical features into survival prediction models significantly improved accuracy, with the C-index of deep learning models increasing from 0.737 to 0.820. These models also became more interpretable by emphasizing clinically significant factors. This study shows that general-domain LLMs, even without specific medical training, can effectively structure large-scale unstructured EHR data, substantially enhancing the accuracy and interpretability of clinical predictive models. △ Less

Submitted 4 September, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

Comments: 23 pages, 2 tables, 4 figures

arXiv:2408.03822 [pdf, other]

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Authors: Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, Eunbyung Park

Abstract: 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussian-based representation and introduces an approximated volumetric rendering, achieving very fast rendering speed and promising image quality. Furthermore, subsequent studies have successfully extended 3DGS to dynamic 3D scenes, demonstrating its wide range of applications. However, a signif… ▽ More 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussian-based representation and introduces an approximated volumetric rendering, achieving very fast rendering speed and promising image quality. Furthermore, subsequent studies have successfully extended 3DGS to dynamic 3D scenes, demonstrating its wide range of applications. However, a significant drawback arises as 3DGS and its following methods entail a substantial number of Gaussians to maintain the high fidelity of the rendered images, which requires a large amount of memory and storage. To address this critical issue, we place a specific emphasis on two key objectives: reducing the number of Gaussian points without sacrificing performance and compressing the Gaussian attributes, such as view-dependent color and covariance. To this end, we propose a learnable mask strategy that significantly reduces the number of Gaussians while preserving high performance. In addition, we propose a compact but effective representation of view-dependent color by employing a grid-based neural field rather than relying on spherical harmonics. Finally, we learn codebooks to compactly represent the geometric and temporal attributes by residual vector quantization. With model compression techniques such as quantization and entropy coding, we consistently show over 25x reduced storage and enhanced rendering speed compared to 3DGS for static scenes, while maintaining the quality of the scene representation. For dynamic scenes, our approach achieves more than 12x storage efficiency and retains a high-quality reconstruction compared to the existing state-of-the-art methods. Our work provides a comprehensive framework for 3D scene representation, achieving high performance, fast training, compactness, and real-time rendering. Our project page is available at https://maincold2.github.io/c3dgs/. △ Less

Submitted 7 August, 2024; originally announced August 2024.

Comments: Project page: https://maincold2.github.io/c3dgs/

arXiv:2408.03601 [pdf, other]

DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba

Authors: Chengran Yuan, Zhanqi Zhang, Jiawei Sun, Shuo Sun, Zefan Huang, Christina Dao Wen Lee, Dongen Li, Yuhang Han, Anthony Wong, Keng Peng Tee, Marcelo H. Ang Jr

Abstract: Motion planning is a challenging task to generate safe and feasible trajectories in highly dynamic and complex environments, forming a core capability for autonomous vehicles. In this paper, we propose DRAMA, the first Mamba-based end-to-end motion planner for autonomous vehicles. DRAMA fuses camera, LiDAR Bird's Eye View images in the feature space, as well as ego status information, to generate… ▽ More Motion planning is a challenging task to generate safe and feasible trajectories in highly dynamic and complex environments, forming a core capability for autonomous vehicles. In this paper, we propose DRAMA, the first Mamba-based end-to-end motion planner for autonomous vehicles. DRAMA fuses camera, LiDAR Bird's Eye View images in the feature space, as well as ego status information, to generate a series of future ego trajectories. Unlike traditional transformer-based methods with quadratic attention complexity for sequence length, DRAMA is able to achieve a less computationally intensive attention complexity, demonstrating potential to deal with increasingly complex scenarios. Leveraging our Mamba fusion module, DRAMA efficiently and effectively fuses the features of the camera and LiDAR modalities. In addition, we introduce a Mamba-Transformer decoder that enhances the overall planning performance. This module is universally adaptable to any Transformer-based model, especially for tasks with long sequence inputs. We further introduce a novel feature state dropout which improves the planner's robustness without increasing training and inference times. Extensive experimental results show that DRAMA achieves higher accuracy on the NAVSIM dataset compared to the baseline Transfuser, with fewer parameters and lower computational costs. △ Less

Submitted 14 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

arXiv:2408.02736 [pdf, other]

Non-Hermitian entanglement dip from scaling-induced exceptional criticality

Authors: Sirui Liu, Hui Jiang, Wen-Tan Xue, Qingya Li, Jiangbin Gong, Xiaogang Liu, Ching Hua Lee

Abstract: It is well established that the entanglement entropy of a critical system generally scales logarithmically with system size. Yet, in this work, we report a new class of non-Hermitian critical transitions that exhibit dramatic divergent dips in their entanglement entropy scaling, strongly violating conventional logarithmic behavior. Dubbed scaling-induced exceptional criticality (SIEC), it transcen… ▽ More It is well established that the entanglement entropy of a critical system generally scales logarithmically with system size. Yet, in this work, we report a new class of non-Hermitian critical transitions that exhibit dramatic divergent dips in their entanglement entropy scaling, strongly violating conventional logarithmic behavior. Dubbed scaling-induced exceptional criticality (SIEC), it transcends existing non-Hermitian mechanisms such as exceptional bound states and non-Hermitian skin effect (NHSE)-induced gap closures, which are nevertheless still governed by logarithmic entanglement scaling. Key to SIEC is its strongly scale-dependent spectrum, where eigenbands exhibit an exceptional crossing only at a particular system size. As such, the critical behavior is dominated by how the generalized Brillouin zone (GBZ) sweeps through the exceptional crossing with increasing system size, and not just by the gap closure per se. We provide a general approach for constructing SIEC systems based on the non-local competition between heterogeneous NHSE pumping directions, and show how a scale-dependent GBZ can be analytically derived to excellent accuracy. Beyond 1D free fermions, SIEC is expected to occur more prevalently in higher-dimensional or even interacting systems, where antagonistic NHSE channels generically proliferate. SIEC-induced entanglement dips generalize straightforwardly to kinks in other entanglement measures such as Renyi entropy, and serve as spectacular demonstrations of how algebraic and geometric singularities in complex band structures manifest in quantum information. △ Less

Submitted 5 August, 2024; originally announced August 2024.

arXiv:2408.02368 [pdf, other]

First search for dark photon dark matter with a MADMAX prototype

Authors: J. Egge, D. Leppla-Weber, S. Knirck, B. Ary dos Santos Garcia, D. Bergermann, A. Caldwell, V. Dabhi, C. Diaconu, J. Diehl, G. Dvali, M. Ekmedžić, F. Gallo, E. Garutti, S. Heyminck, F. Hubaut, A. Ivanov, J. Jochum, P. Karst, M. Kramer, D. Kreikemeyer-Lorenzo, C. Krieger, C. Lee, A. Lindner, J. P. A. Maldonado, B. Majorovits , et al. (21 additional authors not shown)

Abstract: We report the first result from a dark photon dark matter search in the mass range from ${78.62}$ to $83.95~\mathrm{μみゅーeV}/c^2$ with a dielectric haloscope prototype for MADMAX (Magnetized Disc and Mirror Axion eXperiment). Putative dark photons would convert to observable photons within a stack consisting of three sapphire disks and a mirror. The emitted power of this system is received by an anten… ▽ More We report the first result from a dark photon dark matter search in the mass range from ${78.62}$ to $83.95~\mathrm{μみゅーeV}/c^2$ with a dielectric haloscope prototype for MADMAX (Magnetized Disc and Mirror Axion eXperiment). Putative dark photons would convert to observable photons within a stack consisting of three sapphire disks and a mirror. The emitted power of this system is received by an antenna and successively digitized using a low-noise receiver. No dark photon signal has been observed. Assuming unpolarized dark photon dark matter with a local density of $ρろー_χかい=0.3~\mathrm{GeV/cm^3}$ we exclude a dark photon to photon mixing parameter $χかい> 3.0 \times 10^{-12}$ over the full mass range and $χかい> 1.2 \times 10^{-13}$ at a mass of $80.57~\mathrm{μみゅーeV}/c^2$ with a 95\% confidence level. This is the first physics result from a MADMAX prototype and exceeds previous constraints on $χかい$ in this mass range by up to almost three orders of magnitude. △ Less

Submitted 5 August, 2024; originally announced August 2024.

Comments: v1

arXiv:2408.02241 [pdf, other]

Scalable Multilevel Monte Carlo Methods Exploiting Parallel Redistribution on Coarse Levels

Authors: Hillary R. Fairbanks, Delyan Z. Kalchev, Chak Shing Lee, Panayot S. Vassilevski

Abstract: We study an element agglomeration coarsening strategy that requires data redistribution at coarse levels when the number of coarse elements becomes smaller than the used computational units (cores). The overall procedure generates coarse elements (general unstructured unions of fine grid elements) within the framework of element-based algebraic multigrid methods (or AMGe) studied previously. The A… ▽ More We study an element agglomeration coarsening strategy that requires data redistribution at coarse levels when the number of coarse elements becomes smaller than the used computational units (cores). The overall procedure generates coarse elements (general unstructured unions of fine grid elements) within the framework of element-based algebraic multigrid methods (or AMGe) studied previously. The AMGe generated coarse spaces have the ability to exhibit approximation properties of the same order as the fine-level ones since by construction they contain the piecewise polynomials of the same order as the fine level ones. These approximation properties are key for the successful use of AMGe in multilevel solvers for nonlinear partial differential equations as well as for multilevel Monte Carlo (MLMC) simulations. The ability to coarsen without being constrained by the number of available cores, as described in the present paper, allows to improve the scalability of these solvers as well as in the overall MLMC method. The paper illustrates this latter fact with detailed scalability study of MLMC simulations applied to model Darcy equations with a stochastic log-normal permeability field. △ Less

Submitted 5 August, 2024; originally announced August 2024.

MSC Class: 65F50; 65Y05; 68W10

arXiv:2408.02093 [pdf, other]

Floquet engineering of topological phase transitions in quantum spin Hall $αあるふぁ$-$T_{3}$ system

Authors: Kok Wai Lee, Mateo Jalen Andrew Calderon, Xiang-Long Yu, Ching Hua Lee, Yee Sin Ang, Pei-Hao Fu

Abstract: Floquet engineering of topological phase transitions driven by a high-frequency time-periodic field is a promising approach to realizing new topological phases of matter distinct from static states. Here, we theoretically investigate Floquet engineering topological phase transitions in the quantum spin Hall $αあるふぁ$-$T_{3}$ system driven by an off-resonant circularly polarized light. In addition to the… ▽ More Floquet engineering of topological phase transitions driven by a high-frequency time-periodic field is a promising approach to realizing new topological phases of matter distinct from static states. Here, we theoretically investigate Floquet engineering topological phase transitions in the quantum spin Hall $αあるふぁ$-$T_{3}$ system driven by an off-resonant circularly polarized light. In addition to the quantum spin (anomalous) Hall insulator phase with multiple helical (chiral) edge states, spin-polarized topological metallic phases are observed, where the bulk topological band gap of one spin sub-band overlaps with the other gapless spin sub-band. Moreover, with a staggered potential, the topological invariants of the system depend on whether the middle band is occupied because of the breaking of particle-hole symmetry. Our work highlights the significance of Floquet engineering in realizing new topological phases in $αあるふぁ$-$T_{3}$ lattices. △ Less

Submitted 27 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

Comments: 11 pages, 6 figures

arXiv:2408.01875 [pdf, other]

Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval

Authors: Yanfei Chen, Jinsung Yoon, Devendra Singh Sachan, Qingze Wang, Vincent Cohen-Addad, Mohammadhossein Bateni, Chen-Yu Lee, Tomas Pfister

Abstract: Recent advances in large language models (LLMs) have enabled autonomous agents with complex reasoning and task-fulfillment capabilities using a wide range of tools. However, effectively identifying the most relevant tools for a given task becomes a key bottleneck as the toolset size grows, hindering reliable tool utilization. To address this, we introduce Re-Invoke, an unsupervised tool retrieval… ▽ More Recent advances in large language models (LLMs) have enabled autonomous agents with complex reasoning and task-fulfillment capabilities using a wide range of tools. However, effectively identifying the most relevant tools for a given task becomes a key bottleneck as the toolset size grows, hindering reliable tool utilization. To address this, we introduce Re-Invoke, an unsupervised tool retrieval method designed to scale effectively to large toolsets without training. Specifically, we first generate a diverse set of synthetic queries that comprehensively cover different aspects of the query space associated with each tool document during the tool indexing phase. Second, we leverage LLM's query understanding capabilities to extract key tool-related context and underlying intents from user queries during the inference phase. Finally, we employ a novel multi-view similarity ranking strategy based on intents to pinpoint the most relevant tools for each query. Our evaluation demonstrates that Re-Invoke significantly outperforms state-of-the-art alternatives in both single-tool and multi-tool scenarios, all within a fully unsupervised setting. Notably, on the ToolE datasets, we achieve a 20% relative improvement in nDCG@5 for single-tool retrieval and a 39% improvement for multi-tool retrieval. △ Less

Submitted 3 August, 2024; originally announced August 2024.

arXiv:2407.21329 [pdf, other]

Thermodynamic relation of accelerating black holes in anti-de Sitter spacetime

Authors: Chong Oh Lee

Abstract: We consider accelerating black holes in four-dimensional anti-de Sitter spacetime and investigate the extremality relations by examining perturbative corrections to both the entropy of black holes and their extremality bounds. It is shown that the so-called Goon and Penco (univeral) relation is also valid for accelerating black holes. Furthermore, it is found that an appropriate matching condition… ▽ More We consider accelerating black holes in four-dimensional anti-de Sitter spacetime and investigate the extremality relations by examining perturbative corrections to both the entropy of black holes and their extremality bounds. It is shown that the so-called Goon and Penco (univeral) relation is also valid for accelerating black holes. Furthermore, it is found that an appropriate matching condition is necessary between the perturbation parameter of the relation with the perturbative correction to such black holes, and the parameter with information about the conical deficits on the north and south poles with the cosmic string tensions in order to satisfy the extremality relations. This result indicates that the extremality (univeral) relation of a class of accelerating black holes holds exhibit behavior similar to the Weak Gravity Conjecture. △ Less

Submitted 13 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

Comments: 5 pages, 2 figures, 1 table, added references

arXiv:2407.21234 [pdf, other]

Asteroseismology of the Nearby K-Dwarf $σしぐま$ Draconis using the Keck Planet Finder and TESS

Authors: Marc Hon, Daniel Huber, Yaguang Li, Travis S. Metcalfe, Timothy R. Bedding, Joel Ong, Ashley Chontos, Ryan Rubenzahl, Samuel Halverson, Rafael A. García, Hans Kjeldsen, Dennis Stello, Daniel R. Hey, Tiago Campante, Andrew W. Howard, Steven R. Gibson, Kodi Rider, Arpita Roy, Ashley D. Baker, Jerry Edelstein, Chris Smith, Benjamin J. Fulton, Josh Walawender, Max Brodheim, Matt Brown , et al. (54 additional authors not shown)

Abstract: Asteroseismology of dwarf stars cooler than the Sun is very challenging due to the low amplitudes and rapid timescales of oscillations. Here, we present the asteroseismic detection of solar-like oscillations at 4-minute timescales ($νにゅー_{\mathrm{max}}\sim4300μみゅー$Hz) in the nearby K-dwarf $σしぐま$ Draconis using extreme precision Doppler velocity observations from the Keck Planet Finder and 20-second cadenc… ▽ More Asteroseismology of dwarf stars cooler than the Sun is very challenging due to the low amplitudes and rapid timescales of oscillations. Here, we present the asteroseismic detection of solar-like oscillations at 4-minute timescales ($νにゅー_{\mathrm{max}}\sim4300μみゅー$Hz) in the nearby K-dwarf $σしぐま$ Draconis using extreme precision Doppler velocity observations from the Keck Planet Finder and 20-second cadence photometry from NASA's Transiting Exoplanet Survey Satellite. The star is the coolest dwarf star to date with both velocity and luminosity observations of solar-like oscillations, having amplitudes of $5.9\pm0.8\,$cm$\,\text{s}^{-1}$ and $0.8\pm0.2$ ppm, respectively. These measured values are in excellent agreement with established luminosity-velocity amplitude relations for oscillations and provide further evidence that mode amplitudes for stars with $T_{\mathrm{eff}}<\,5500\,$K diminish in scale following a $(L/M)^{1.5}$ relation. By modeling the star's oscillation frequencies from photometric data, we measure an asteroseismic age of $4.5\pm0.9\,\rm{(ran)} \pm 1.2\,\rm{(sys)}$ Gyr. The observations demonstrate the capability of next-generation spectrographs and precise space-based photometry to extend observational asteroseismology to nearby cool dwarfs, which are benchmarks for stellar astrophysics and prime targets for directly imaging planets using future space-based telescopes. △ Less

Submitted 28 August, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

Comments: Accepted for publication in The Astrophysical Journal

Showing 1–50 of 4,276 results for author: Lee, C