(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 540 results for author: Ahn, J

.
  1. arXiv:2409.08125  [pdf, other

    cond-mat.str-el cond-mat.other

    Unexpected changes in the band structure within AFM1 state of CeBi

    Authors: Yevhen Kushnirenko, Brinda Kuthanazhi, Benjamin Schrunk, Evan O'Leary, Andrew Eaton, Robert-Jan Slager, Junyeong Ahn, Lin-Lin Wang, Paul C. Canfield, Adam Kaminski

    Abstract: We perform angle-resolved photoemission spectroscopy (ARPES) measurements in conjunction with density functional theory (DFT) calculations to investigate the evolution of the electronic structure of CeBi upon a series of antiferromagnetic (AFM) transitions. We find evidence for a new AFM transition in addition to two previously known from transport studies. We demonstrate the development of an add… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 16 pages, 4 figures

  2. arXiv:2409.02846  [pdf, other

    cs.CV

    MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling

    Authors: Jihye Ahn, Hyesong Choi, Soomin Kim, Dongbo Min

    Abstract: In stereo matching, CNNs have traditionally served as the predominant architectures. Although Transformer-based stereo models have been studied recently, their performance still lags behind CNN-based stereo models due to the inherent data scarcity issue in the stereo matching task. In this paper, we propose Masked Image Modeling Distilled Stereo matching model, termed MaDis-Stereo, that enhances l… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  3. arXiv:2409.02545  [pdf, other

    cs.CV

    UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching

    Authors: Soomin Kim, Hyesong Choi, Jihye Ahn, Dongbo Min

    Abstract: Unlike other vision tasks where Transformer-based approaches are becoming increasingly common, stereo depth estimation is still dominated by convolution-based approaches. This is mainly due to the limited availability of real-world ground truth for stereo matching, which is a limiting factor in improving the performance of Transformer-based stereo approaches. In this paper, we propose UniTT-Stereo… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  4. arXiv:2409.01141  [pdf, other

    cs.AR cs.LG

    Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching

    Authors: Sungmin Yun, Kwanhee Kyung, Juhwan Cho, Jaewan Choi, Jongmin Kim, Byeongho Kim, Sukhan Lee, Kyomin Sohn, Jung Ho Ahn

    Abstract: Large language models (LLMs) have emerged due to their capability to generate high-quality content across diverse contexts. To reduce their explosively increasing demands for computing resources, a mixture of experts (MoE) has emerged. The MoE layer enables exploiting a huge number of parameters with less computation. Applying state-of-the-art continuous batching increases throughput; however, it… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 15 pages, 16 figures, accepted at MICRO 2024

  5. arXiv:2408.16225  [pdf, ps, other

    math.AP

    Global boundedness and blow-up in a repulsive chemotaxis-consumption system in higher dimensions

    Authors: Jaewook Ahn, Kyungkeun Kang, Dongkwang Kim

    Abstract: This paper investigates the repulsive chemotaxis-consumption model \begin{align*} \partial_t u &= \nabla \cdot (D(u) \nabla u) + \nabla \cdot (u \nabla v), \\ 0 &= Δでるたv - uv \end{align*} in an $n$-dimensional ball, $n \ge 3$, where the diffusion coefficient $D$ is an appropriate extension of the function $0\leξくしー\mapsto(1+ξくしー)^{m-1}$ for some $m>0$. Under the boundary conditions \begin{equation*} νにゅー\cdot… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    MSC Class: 35B44; 35K51; 92C17

  6. arXiv:2408.10156  [pdf, other

    cond-mat.mtrl-sci

    Stacking Polymorphism of PtSe$_{2}$: Its Implication to Layer-dependent Metal-insulator Transitions

    Authors: Jeonghwan Ahn, Iuegyun Hong, Gwangyoung Lee, Hyeondeok Shin, Anouar Benali, Yongkyung Kwon, Jaron T. Krogel

    Abstract: Using diffusion Monte Carlo (DMC) and density functional theory (DFT) calculations, we examined the structural stability and interlayer binding properties of PtSe$_2$, a representative transition metal dichalcogenide (TMD) with strong interlayer interaction. Our DMC results for the bilayer revealed that AA and AB-r stacking modes are nearly degenerate, highlighting the significant role of interlay… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 20 pages. 6 figures

  7. arXiv:2408.09696  [pdf, other

    math.OC

    Optimal Replenishment Strategy for Satellite Constellation with Dual Supply Modes

    Authors: Jaewoo Kim, Jaemyung Ahn, Taehyun Sung

    Abstract: This paper proposes a novel inventory management model for the replenishment strategy of a satellite mega-constellation incorporating dual supply modes: normal and auxiliary. The proposed framework employs an indirect channel for normal supply, wherein spare satellites are initially injected into a parking orbit before transferring to the target orbital plane via propulsion systems and orbital per… ▽ More

    Submitted 8 September, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: 41 pages, 10 figures

  8. arXiv:2408.04208  [pdf, other

    astro-ph.SR astro-ph.EP math.NA physics.space-ph

    Visibility Analysis of the Sun as Viewed from Multiple Spacecraft at the Sun-Earth Lagrange Points

    Authors: Jinsung Lee, Sung-Hong Park, Arik Posner, Kyung-Suk Cho, Jaemyung Ahn

    Abstract: Beyond the Sun-Earth line, spacecraft equipped with various solar telescopes are intended to be deployed at several different vantage points in the heliosphere to carry out coordinated, multi-view observations of the Sun and its dynamic activities. In this context, we investigate solar visibility by imaging instruments onboard the spacecraft orbiting the Sun-Earth Lagrange points L1, L4 and L5, re… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  9. arXiv:2407.18505  [pdf, other

    eess.AS

    VoxSim: A perceptual voice similarity dataset

    Authors: Junseok Ahn, Youkyum Kim, Yeunju Choi, Doyeop Kwak, Ji-Hoon Kim, Seongkyu Mun, Joon Son Chung

    Abstract: This paper introduces VoxSim, a dataset of perceptual voice similarity ratings. Recent efforts to automate the assessment of speech synthesis technologies have primarily focused on predicting mean opinion score of naturalness, leaving speaker voice similarity relatively unexplored due to a lack of extensive training data. To address this, we generate about 41k utterance pairs from the VoxCeleb dat… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: INTERSPEECH 2024. The dataset is available from https://mm.kaist.ac.kr/projects/voxsim/

  10. arXiv:2407.16141  [pdf, other

    econ.GN

    Stock-driven Household Attention

    Authors: Hie Joo Ahn, Shihan Xie

    Abstract: We investigate the effects of stockholding on households' attention to the macroeconomy. Households' attentiveness is measured by their accuracy of inflation expectations and perceptions. Relative to non-stockholders, stockholders produce more accurate inflation forecasts and backcasts, disagree less about future inflation, and adjust their outlook more responsively to news, suggesting that stock-… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  11. arXiv:2407.13055  [pdf, other

    cs.CR cs.PF

    Cheddar: A Swift Fully Homomorphic Encryption Library for CUDA GPUs

    Authors: Jongmin Kim, Wonseok Choi, Jung Ho Ahn

    Abstract: Fully homomorphic encryption (FHE) is a cryptographic technology capable of resolving security and privacy problems in cloud computing by encrypting data in use. However, FHE introduces tremendous computational overhead for processing encrypted data, causing FHE workloads to become 2-6 orders of magnitude slower than their unencrypted counterparts. To mitigate the overhead, we propose Cheddar, an… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 12 pages, 5 figures

  12. arXiv:2407.11368  [pdf

    cs.CL

    Ancient Korean Archive Translation: Comparison Analysis on Statistical phrase alignment, LLM in-context learning, and inter-methodological approach

    Authors: Sojung Lucia Kim, Taehong Jang, Joonmo Ahn

    Abstract: This study aims to compare three methods for translating ancient texts with sparse corpora: (1) the traditional statistical translation method of phrase alignment, (2) in-context LLM learning, and (3) proposed inter methodological approach - statistical machine translation method using sentence piece tokens derived from unified set of source-target corpus. The performance of the proposed approach… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: ACL2024 submitted

  13. arXiv:2407.11017  [pdf, other

    cs.CL cs.AI cs.LG

    Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation

    Authors: Jihyun Janice Ahn, Ryo Kamoi, Lu Cheng, Rui Zhang, Wenpeng Yin

    Abstract: Mainstream LLM research has primarily focused on enhancing their generative capabilities. However, even the most advanced LLMs experience uncertainty in their outputs, often producing varied results on different runs or when faced with minor changes in input, despite no substantial change in content. Given multiple responses from the same LLM to the same input, we advocate leveraging the LLMs' dis… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: 4 pages, 3 tables

  14. arXiv:2407.10558  [pdf, other

    cs.CV cs.LG

    ConTEXTure: Consistent Multiview Images to Texture

    Authors: Jaehoon Ahn, Sumin Cho, Harim Jung, Kibeom Hong, Seonghoon Ban, Moon-Ryul Jung

    Abstract: We introduce ConTEXTure, a generative network designed to create a texture map/atlas for a given 3D mesh using images from multiple viewpoints. The process begins with generating a front-view image from a text prompt, such as 'Napoleon, front view', describing the 3D mesh. Additional images from different viewpoints are derived from this front-view image and camera poses relative to it. ConTEXTure… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 11 pages, 7 figures

  15. arXiv:2407.05883  [pdf, other

    math.CO

    A coarse Erdős-Pósa theorem

    Authors: Jungho Ahn, J. Pascal Gollin, Tony Huynh, O-joung Kwon

    Abstract: An \emph{induced packing} of cycles in a graph is a set of vertex-disjoint cycles with no edges between them. We generalise the classic Erdős-Pósa theorem to induced packings of cycles. More specifically, we show that there exists a function ${f(k) = \mathcal{O}(k \log k)}$ such that for every positive integer ${k}$, every graph $G$ contains either an induced packing of $k$ cycles or a set $X$ of… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 27 pages, 3 figures

    MSC Class: 05C38; 05C70; 05C85

  16. arXiv:2407.02026  [pdf, other

    quant-ph physics.atom-ph

    Programming higher-order interactions of Rydberg atoms

    Authors: Andrew Byun, Seokho Jeong, Jaewook Ahn

    Abstract: Higher-order interactions in spin-based Hamiltonians are crucial in addressing numerous fundamentally significant physical problems. In this work, Rydberg-atom graph gadgets are introduced to effectively program $K$-th order interactions within a Rydberg atom system. This approach facilitates the determination of the ground states of an Ising-type Hamiltonian, encoded to solve higher-order unconst… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures

  17. arXiv:2407.00965  [pdf, other

    hep-ex

    Measurement of the integrated luminosity of data samples collected during 2019-2022 by the Belle II experiment

    Authors: The Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, J. K. Ahn, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (382 additional authors not shown)

    Abstract: A series of data samples was collected with the Belle II detector at the SuperKEKB collider from March 2019 to June 2022. We determine the integrated luminosities of these data samples using three distinct methodologies involving Bhabha ($e^+e^- \to e^+e^-(nγがんま)$), digamma ($e^+e^- \to γがんまγがんま(nγがんま)$), and dimuon ($e^+e^- \to μみゅー^+ μみゅー^- (nγがんま)$) events. The total integrated luminosity obtained with Bhabha, diga… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 12 pages, 3 figures

    Report number: Belle II Preprint 2024-019; KEK Preprint 2024-16

  18. arXiv:2406.18750  [pdf, ps, other

    math.AP

    Stationary states of a chemotaxis consumption system with singular sensitivity and inhomogeneous boundary conditions

    Authors: Jaewook Ahn, Johannes Lankeit

    Abstract: For given total mass $m>0$ we show unique solvability of the stationary chemotaxis-consumption model \[ \begin{cases} 0= Δでるたu - χかい\nabla \cdot (\frac{u}{v} \nabla v) \\ 0= Δでるたv - uv \\ \int_Ωおめがu = m \end{cases} \] under no-flux-Dirichlet boundary conditions in bounded smooth domains $Ωおめが\subset \mathbb{R}^2$ and $Ωおめが=B_R(0)\subset \mathbb{R}^d$, $d\ge 3$.

    Submitted 26 June, 2024; originally announced June 2024.

    MSC Class: 35J25; 92C17; 35Q92

  19. Search for charmed baryons in the $Λらむだ_c^+ηいーた$ system and measurement of the branching fractions of $Λらむだ_c(2880)^+$ and $Λらむだ_c(2940)^+$ decaying to $Λらむだ_c^+ηいーた$ and $pD^0$ relative to $Σしぐま_c(2455)πぱい$

    Authors: Belle Collaboration, S. X. Li, C. P. Shen, I. Adachi, J. K. Ahn, H. Aihara, D. M. Asner, H. Atmacan, T. Aushev, R. Ayad, Sw. Banerjee, K. Belous, J. Bennett, M. Bessner, T. Bilka, D. Biswas, D. Bodrov, A. Bozek, M. Bračko, P. Branchini, T. E. Browder, A. Budano, M. Campajola, M. -C. Chang, B. G. Cheon , et al. (103 additional authors not shown)

    Abstract: We search for excited charmed baryons in the $Λらむだ_c^+ηいーた$ system using a data sample corresponding to an integrated luminosity of 980 $\rm fb^{-1}$. The data were collected by the Belle detector at the KEKB $e^{+}$$e^{-}$ asymmetric-energy collider. No significant signals are found in the $Λらむだ_c^+ηいーた$ mass spectrum, including the known $Λらむだ_c(2880)^+$ and $Λらむだ_c(2940)^+$. Clear $Λらむだ_c(2880)^+$ and… ▽ More

    Submitted 28 July, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures, accepted for publication as a Regular Article in Physical Review D

    Report number: Belle Preprint: 2024-06;KEK Preprint: 2024-15

    Journal ref: Phys. Rev. D 110, 032021 (2024)

  20. arXiv:2406.15709  [pdf, other

    cs.CR

    I Experienced More than 10 DeFi Scams: On DeFi Users' Perception of Security Breaches and Countermeasures

    Authors: Mingyi Liu, Jun Ho Huh, HyungSeok Han, Jaehyuk Lee, Jihae Ahn, Frank Li, Hyoungshick Kim, Taesoo Kim

    Abstract: Decentralized Finance (DeFi) offers a whole new investment experience and has quickly emerged as an enticing alternative to Centralized Finance (CeFi). Rapidly growing market size and active users, however, have also made DeFi a lucrative target for scams and hacks, with 1.95 billion USD lost in 2023. Unfortunately, no prior research thoroughly investigates DeFi users' security risk awareness leve… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 33rd USENIX Security Symposium, Philadelphia, PA, USA, Aug. 2024

  21. arXiv:2406.12233  [pdf, other

    cs.AI cs.CL cs.CV

    SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

    Authors: Young Jin Ahn, Jungwoo Park, Sangha Park, Jonghyun Choi, Kee-Eung Kim

    Abstract: Visual Speech Recognition (VSR) stands at the intersection of computer vision and speech recognition, aiming to interpret spoken content from visual cues. A prominent challenge in VSR is the presence of homophenes-visually similar lip gestures that represent different phonemes. Prior approaches have sought to distinguish fine-grained visemes by aligning visual and auditory semantics, but often fel… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  22. arXiv:2406.05963  [pdf, other

    cs.CV cs.AI

    Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

    Authors: Jinwoo Ahn, Junhyeok Park, Min-Jun Kim, Kang-Hyeon Kim, So-Yeong Sohn, Yun-Ji Lee, Du-Seong Chang, Yu-Jung Heo, Eun-Sol Kim

    Abstract: In this paper, the solution of HYU MLLAB KT Team to the Multimodal Algorithmic Reasoning Task: SMART-101 CVPR 2024 Challenge is presented. Beyond conventional visual question-answering problems, the SMART-101 challenge aims to achieve human-level multimodal understanding by tackling complex visio-linguistic puzzles designed for children in the 6-8 age group. To solve this problem, we suggest two m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  23. arXiv:2406.05602  [pdf, other

    cs.CV cs.CL

    Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models

    Authors: Philip Wootaek Shin, Jihyun Janice Ahn, Wenpeng Yin, Jack Sampson, Vijaykrishnan Narayanan

    Abstract: It has been shown that many generative models inherit and amplify societal biases. To date, there is no uniform/systematic agreed standard to control/adjust for these biases. This study examines the presence and manipulation of societal biases in leading text-to-image models: Stable Diffusion, DALL-E 3, and Adobe Firefly. Through a comprehensive analysis combining base prompts with modifiers and t… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  24. arXiv:2406.02753  [pdf, other

    cond-mat.mtrl-sci

    Towards improved property prediction of two-dimensional (2D) materials using many-body Quantum Monte Carlo methods

    Authors: Daniel Wines, Jeonghwan Ahn, Anouar Benali, Paul R. C. Kent, Jaron T. Krogel, Yongkyung Kwon, Lubos Mitas, Fernando A. Reboredo, Brenda Rubenstein, Kayahan Saritas, Hyeondeok Shin, Ivan Štich, Can Ataca

    Abstract: The field of two-dimensional (2D) materials has grown dramatically in the last two decades. 2D materials can be utilized for a variety of next-generation optoelectronic, spintronic, clean energy, and quantum computation applications. These 2D structures, which are often exfoliated from layered van der Waals (vdW) materials, possess highly inhomogeneous electron densities and can possess short- and… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  25. arXiv:2406.01963  [pdf

    cond-mat.mes-hall physics.app-ph

    Diamond molecular balance: Revolutionizing high-resolution mass spectrometry from MDa to TDa at room temperature

    Authors: Donggeun Lee, Seung-Woo Jeon, Chang-Hwan Yi, Yang-Hee Kim, Yeeun Choi, Sang-Hun Lee, Jinwoong Cha, Seung-Bo Shim, Junho Suh, Il-Young Kim, Dongyeon Daniel Kang, Hojoong Jung, Cherlhyun Jeong, Jae-pyoung Ahn, Hee Chul Park, Sang-Wook Han, Chulki Kim

    Abstract: The significance of mass spectrometry lies in its unparalleled ability to accurately identify and quantify molecules in complex samples, providing invaluable insights into molecular structures and interactions. Here, we leverage diamond nanostructures as highly sensitive mass sensors by utilizing a self-excitation mechanism under an electron beam in a conventional scanning electron microscope (SEM… ▽ More

    Submitted 25 July, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  26. arXiv:2405.18027  [pdf, other

    cs.CL

    TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models

    Authors: Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim

    Abstract: While Large Language Models (LLMs) can serve as agents to simulate human behaviors (i.e., role-playing agents), we emphasize the importance of point-in-time role-playing. This situates characters at specific moments in the narrative progression for three main reasons: (i) enhancing users' narrative immersion, (ii) avoiding spoilers, and (iii) fostering engagement in fandom role-playing. To accurat… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings. Code and dataset are released at https://ahnjaewoo.github.io/timechara

  27. arXiv:2405.16303  [pdf, other

    quant-ph cond-mat.mtrl-sci

    Extended spin relaxation times of optically addressed telecom defects in silicon carbide

    Authors: Jonghoon Ahn, Christina Wicker, Nolan Bitner, Michael T. Solomon, Benedikt Tissot, Guido Burkard, Alan M. Dibos, Jiefei Zhang, F. Joseph Heremans, David D. Awschalom

    Abstract: Optically interfaced solid-state defects are promising candidates for quantum communication technologies. The ideal defect system would feature bright telecom emission, long-lived spin states, and a scalable material platform, simultaneously. Here, we employ one such system, vanadium (V4+) in silicon carbide (SiC), to establish a potential telecom spin-photon interface within a mature semiconducto… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 11 pages, 6 figures

  28. arXiv:2405.11390  [pdf, other

    hep-ex

    Search for Two-Body $B$ Meson Decays to $Λらむだ^{0}$ and $Ωおめが^{(*)0}_{c}$

    Authors: Belle Collaboration, V. Savinov, I. Adachi, J. K. Ahn, H. Aihara, D. M. Asner, H. Atmacan, R. Ayad, Sw. Banerjee, J. Bennett, M. Bessner, V. Bhardwaj, D. Biswas, A. Bobrov, D. Bodrov, J. Borah, M. Bračko, P. Branchini, T. E. Browder, A. Budano, D. Červenkov, M. -C. Chang, P. Chang, B. G. Cheon, K. Cho , et al. (124 additional authors not shown)

    Abstract: We report the results of the first search for Standard Model and baryon-number-violating two-body decays of the neutral $B$ mesons to $Λらむだ^{0}$ and $Ωおめが^{(*)0}_c$ using 711~${\rm fb^{-1}}$ of data collected at the $Υうぷしろん(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. We observe no evidence of signal from any such decays and set 95\% confidence-level upper limits o… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 6 pages, 2 figures, submitted to PRD(L)

    Report number: Belle Preprint 2024-04, KEK Preprint 2024-5

  29. arXiv:2405.10272  [pdf, other

    cs.CV cs.AI cs.SD eess.AS eess.IV

    Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

    Authors: Youngjoon Jang, Ji-Hoon Kim, Junseok Ahn, Doyeop Kwak, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim, Joon Son Chung

    Abstract: The goal of this work is to simultaneously generate natural talking faces and speech outputs from text. We achieve this by integrating Talking Face Generation (TFG) and Text-to-Speech (TTS) systems into a unified framework. We address the main challenges of each task: (1) generating a range of head poses representative of real-world scenarios, and (2) ensuring voice consistency despite variations… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  30. arXiv:2405.02499  [pdf, other

    cs.CR cs.AR

    DRAMScope: Uncovering DRAM Microarchitecture and Characteristics by Issuing Memory Commands

    Authors: Hwayong Nam, Seungmin Baek, Minbok Wi, Michael Jaemin Kim, Jaehyun Park, Chihun Song, Nam Sung Kim, Jung Ho Ahn

    Abstract: The demand for precise information on DRAM microarchitectures and error characteristics has surged, driven by the need to explore processing in memory, enhance reliability, and mitigate security vulnerability. Nonetheless, DRAM manufacturers have disclosed only a limited amount of information, making it difficult to find specific information on their DRAM microarchitectures. This paper addresses t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: To appear at the 51st IEEE/ACM International Symposium on Computer Architecture (ISCA)

  31. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  32. arXiv:2404.12461  [pdf, other

    cond-mat.mtrl-sci

    Exploring interlayer coupling in the twisted bilayer PtTe$_{2}$

    Authors: Jeonghwan Ahn, Seoung-Hun Kang, Mina Yoon, Jaron T. Krogel

    Abstract: We have investigated interlayer interactions in the bilayer PtTe$_{2}$ system, which influence the electronic energy bands near the Fermi levels. Our diffusion Monte Carlo (DMC) calculations for the high-symmetry bilayer stackings (AA, AB, AC) manifest distinct interlayer binding characteristics among the stacking modes by revealing significantly different interlayer separations depending on the s… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  33. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 27 July, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: COLM 2024, 46 pages, Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  34. arXiv:2404.02155  [pdf, other

    cs.CV

    Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields

    Authors: Joshua Ahn, Haochen Wang, Raymond A. Yeh, Greg Shakhnarovich

    Abstract: Scale-ambiguity in 3D scene dimensions leads to magnitude-ambiguity of volumetric densities in neural radiance fields, i.e., the densities double when scene size is halved, and vice versa. We call this property alpha invariance. For NeRFs to better maintain alpha invariance, we recommend 1) parameterizing both distance and volume densities in log space, and 2) a discretization-agnostic initializat… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. project page https://pals.ttic.edu/p/alpha-invariance

  35. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  36. arXiv:2403.20109  [pdf, ps, other

    cs.LG cs.AI q-bio.BM

    Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-directed Molecular Generation

    Authors: Jinyeong Park, Jaegyoon Ahn, Jonghwan Choi, Jibum Kim

    Abstract: Optimizing techniques for discovering molecular structures with desired properties is crucial in artificial intelligence(AI)-based drug discovery. Combining deep generative models with reinforcement learning has emerged as an effective strategy for generating molecules with specific properties. Despite its potential, this approach is ineffective in exploring the vast chemical space and optimizing… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  37. arXiv:2403.14963  [pdf, other

    cs.CR

    Enabling Physical Localization of Uncooperative Cellular Devices

    Authors: Taekkyung Oh, Sangwook Bae, Junho Ahn, Yonghwa Lee, Dinh-Tuan Hoang, Min Suk Kang, Nils Ole Tippenhauer, Yongdae Kim

    Abstract: In cellular networks, it can become necessary for authorities to physically locate user devices for tracking criminals or illegal devices. While cellular operators can provide authorities with cell information the device is camping on, fine-grained localization is still required. Therefore, the authorized agents trace the device by monitoring its uplink signals. However, tracking the uplink signal… ▽ More

    Submitted 25 March, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  38. arXiv:2403.05591  [pdf, other

    cs.HC cs.LG

    Data-Driven Ergonomic Risk Assessment of Complex Hand-intensive Manufacturing Processes

    Authors: Anand Krishnan, Xingjian Yang, Utsav Seth, Jonathan M. Jeyachandran, Jonathan Y. Ahn, Richard Gardner, Samuel F. Pedigo, Adriana, Blom-Schieber, Ashis G. Banerjee, Krithika Manohar

    Abstract: Hand-intensive manufacturing processes, such as composite layup and textile draping, require significant human dexterity to accommodate task complexity. These strenuous hand motions often lead to musculoskeletal disorders and rehabilitation surgeries. We develop a data-driven ergonomic risk assessment system with a special focus on hand and finger activity to better identify and address ergonomic… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 26 pages, 7 figures

  39. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  40. arXiv:2403.04340  [pdf, other

    hep-ex

    Search for a pentaquark state decaying into $pJ/ψぷさい$ in $Υうぷしろん(1,2S)$ inclusive decays at Belle

    Authors: Belle Collaboration, X. Dong, H. Y. Zhang, X. L. Wang, I. Adachi, J. K. Ahn, H. Aihara, S. Al Said, D. M. Asner, H. Atmacan, R. Ayad, S. Bahinipati, Sw. Banerjee, M. Bessner, V. Bhardwaj, D. Biswas, D. Bodrov, A. Bozek, M. Bračko, P. Branchini, T. E. Browder, A. Budano, M. Campajola, D. Červenkov, M. -C. Chang , et al. (139 additional authors not shown)

    Abstract: Using the data samples of 102 million $Υうぷしろん(1S)$ and 158 million $Υうぷしろん(2S)$ events collected by the Belle detector, we search for a pentaquark state in the $pJ/ψぷさい$ final state from $Υうぷしろん(1,2S)$ inclusive decays. Here, the charge-conjugate $\bar{p}J/ψぷさい$ is included. We observe clear $pJ/ψぷさい$ production in $Υうぷしろん(1,2S)$ decays and measure the branching fractions to be… ▽ More

    Submitted 11 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Report number: Belle Preprint 2024-02, KEK Preprint 2023-54

  41. arXiv:2402.02648  [pdf, other

    cs.CL cs.AI

    Recursive Chain-of-Feedback Prevents Performance Degradation from Redundant Prompting

    Authors: Jinwoo Ahn, Kyuseung Shin

    Abstract: Large Language Models (LLMs) frequently struggle with complex reasoning tasks, failing to construct logically sound steps towards the solution. In response to this behavior, users often try prompting the LLMs repeatedly in hopes of reaching a better response. This paper studies such repetitive behavior and its effect by defining a novel setting, Chain-of-Feedback (CoF). The setting takes questions… ▽ More

    Submitted 1 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Still Ongoing Work; 8 Pages; 2 Figures

  42. arXiv:2402.02447  [pdf, other

    cs.LG cs.CL

    Breaking MLPerf Training: A Case Study on Optimizing BERT

    Authors: Yongdeok Kim, Jaehyung Ahn, Myeongwoo Kim, Changin Choi, Heejae Kim, Narankhuu Tuvshinjargal, Seungwon Lee, Yanzi Zhang, Yuan Pei, Xiongzhan Linghu, Jingkun Ma, Lin Chen, Yuehua Dai, Sungjoo Yoo

    Abstract: Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc. We present novel approaches for fast large-scale training of BERT model which individually ameliorates each component thereby leading to a new level of BERT training performance. Load balancing is imperative in distri… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Total 15 pages (Appendix 3 pages)

  43. arXiv:2402.00157  [pdf, other

    cs.CL

    Large Language Models for Mathematical Reasoning: Progresses and Challenges

    Authors: Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin

    Abstract: Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing… ▽ More

    Submitted 5 April, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: EACL 2024 Student Research Workshop, 8 pages

  44. arXiv:2401.02639  [pdf, ps, other

    math.CO

    Spectral integral variation of signed graphs

    Authors: Jungho Ahn, Cheolwon Heo, Sunyo Moon

    Abstract: We characterize when the spectral variation of the signed Laplacian matrices is integral after a new edge is added to a signed graph. As an application, for every fixed signed complete graph, we fully characterize the class of signed graphs to which one can recursively add new edges keeping spectral integral variation to make the signed complete graph.

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 18 pages, 0 figure

    MSC Class: 05C22; 05C50; 15A18

  45. arXiv:2312.15288  [pdf, other

    cs.CV stat.ML

    Understanding normalization in contrastive representation learning and out-of-distribution detection

    Authors: Tai Le-Gia, Jaehyun Ahn

    Abstract: Contrastive representation learning has emerged as an outstanding approach for anomaly detection. In this work, we explore the $\ell_2$-norm of contrastive features and its applications in out-of-distribution detection. We propose a simple method based on contrastive learning, which incorporates out-of-distribution data by discriminating against normal samples in the contrastive layer space. Our a… ▽ More

    Submitted 8 April, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

  46. arXiv:2312.12488  [pdf, other

    cs.LG cs.CR cs.CV

    Foreseeing Reconstruction Quality of Gradient Inversion: An Optimization Perspective

    Authors: HyeongGwon Hong, Yooshin Cho, Hanbyel Cho, Jaesung Ahn, Junmo Kim

    Abstract: Gradient inversion attacks can leak data privacy when clients share weight updates with the server in federated learning (FL). Existing studies mainly use L2 or cosine distance as the loss function for gradient matching in the attack. Our empirical investigation shows that the vulnerability ranking varies with the loss function used. Gradient norm, which is commonly used as a vulnerability proxy f… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: To appear in AAAI 2024

  47. arXiv:2312.11881  [pdf, other

    cs.CL cs.AI

    Punctuation restoration Model and Spacing Model for Korean Ancient Document

    Authors: Taehong Jang, Joonmo Ahn, Sojung Lucia Kim

    Abstract: In Korean ancient documents, there is no spacing or punctuation, and they are written in classical Chinese characters. This makes it challenging for modern individuals and translation models to accurately interpret and translate them. While China has models predicting punctuation and spacing, applying them directly to Korean texts is problematic due to data differences. Therefore, we developed the… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 Pages, 2 Figures

  48. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  49. arXiv:2312.08703  [pdf, other

    quant-ph physics.atom-ph

    A Rydberg-atom approach to the integer factorization problem

    Authors: Juyoung Park, Seokho Jeong, Minhyuk Kim, Kangheun Kim, Andrew Byun, Louis Vignoli, Louis-Paul Henry, Loïc Henriet, Jaewook Ahn

    Abstract: The task of factoring integers poses a significant challenge in modern cryptography, and quantum computing holds the potential to efficiently address this problem compared to classical algorithms. Thus, it is crucial to develop quantum computing algorithms to address this problem. This study introduces a quantum approach that utilizes Rydberg atoms to tackle the factorization problem. Experimental… ▽ More

    Submitted 31 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures

  50. arXiv:2312.08476  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Enhanced Magnetization by Defect-Assisted Exciton Recombination in Atomically Thin CrCl$_3$

    Authors: Xin-Yue Zhang, Thomas K. M. Graham, Hyeonhu Bae, Yu-Xuan Wang, Nazar Delegan, Jonghoon Ahn, Zhi-Cheng Wang, Jakub Regner, Kenji Watanabe, Takashi Taniguchi, Minkyung Jung, Zdeněk Sofer, Fazel Tafti, David D. Awschalom, F. Joseph Heremans, Binghai Yan, Brian B. Zhou

    Abstract: Two dimensional (2D) semiconductors present unique opportunities to intertwine optical and magnetic functionalities and to tune these performances through defects and dopants. Here, we integrate exciton pumping into a quantum sensing protocol on nitrogen-vacancy centers in diamond to image the optically-induced transient stray fields in few-layer, antiferromagnetic CrCl$_3$. We discover that excit… ▽ More

    Submitted 26 August, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 11 pages, 8 figures