(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–15 of 15 results for author: Jeon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19795  [pdf, other

    cs.CL cs.AI

    SLM as Guardian: Pioneering AI Safety with Small Language Models

    Authors: Ohjoon Kwon, Donghyeon Jeon, Nayoung Choi, Gyu-Hwung Cho, Changbong Kim, Hyunwoo Lee, Inho Kang, Sun Kim, Taiwoo Park

    Abstract: Most prior safety research of large language models (LLMs) has focused on enhancing the alignment of LLMs to better suit the safety requirements of humans. However, internalizing such safeguard features into larger models brought challenges of higher training cost and unintended degradation of helpfulness. To overcome such challenges, a modular approach employing a smaller LLM to detect harmful us… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.17878  [pdf, other

    cs.LG cs.AI

    An Information Theoretic Metric for Evaluating Unlearning Models

    Authors: Dongjae Jeon, Wonje Jeung, Taeheon Kim, Albert No, Jonghyun Choi

    Abstract: Machine unlearning (MU) addresses privacy concerns by removing information of `forgetting data' samples from trained models. Typically, evaluating MU methods involves comparing unlearned models to those retrained from scratch without forgetting data, using metrics such as membership inference attacks (MIA) and accuracy measurements. These evaluations implicitly assume that if the output logits of… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2404.08672  [pdf, other

    cs.IR cs.AI cs.CL cs.CY cs.LG

    Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

    Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

    Abstract: Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2402.17812  [pdf, other

    cs.LG cs.CL

    DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation

    Authors: Sunghyeon Woo, Baeseong Park, Byeongwook Kim, Minjung Jo, Sejung Kwon, Dongsuk Jeon, Dongsoo Lee

    Abstract: Training deep neural networks typically involves substantial computational costs during both forward and backward propagation. The conventional layer dropping techniques drop certain layers during training for reducing the computations burden. However, dropping layers during forward propagation adversely affects the training process by degrading accuracy. In this paper, we propose Dropping Backwar… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  5. arXiv:2308.11199  [pdf, other

    cs.CV cs.AI cs.LG

    ConcatPlexer: Additional Dim1 Batching for Faster ViTs

    Authors: Donghoon Han, Seunghyeon Seo, Donghyeon Jeon, Jiho Jang, Chaerin Kong, Nojun Kwak

    Abstract: Transformers have demonstrated tremendous success not only in the natural language processing (NLP) domain but also the field of computer vision, igniting various creative approaches and applications. Yet, the superior performance and modeling flexibility of transformers came with a severe increase in computation costs, and hence several works have proposed methods to reduce this burden. Inspired… ▽ More

    Submitted 31 January, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

  6. arXiv:2306.17618  [pdf, other

    cs.CV

    Polarimetric iToF: Measuring High-Fidelity Depth through Scattering Media

    Authors: Daniel S. Jeon, Andreas Meuleman, Seung-Hwan Baek, Min H. Kim

    Abstract: Indirect time-of-flight (iToF) imaging allows us to capture dense depth information at a low cost. However, iToF imaging often suffers from multipath interference (MPI) artifacts in the presence of scattering media, resulting in severe depth-accuracy degradation. For instance, iToF cameras cannot measure depth accurately through fog because ToF active illumination scatters back to the sensor befor… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 12353-12362

  7. arXiv:2305.04001  [pdf, other

    cs.CV cs.AI

    AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion

    Authors: Seungwoo Lee, Chaerin Kong, Donghyeon Jeon, Nojun Kwak

    Abstract: Recent advances in diffusion models have showcased promising results in the text-to-video (T2V) synthesis task. However, as these T2V models solely employ text as the guidance, they tend to struggle in modeling detailed temporal dynamics. In this paper, we introduce a novel T2V framework that additionally employ audio signals to control the temporal dynamics, empowering an off-the-shelf T2I diffus… ▽ More

    Submitted 23 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: CVPR2023 Workshop on AI for Content Creation. Project Page: https://lifrary.github.io/AADiff/

  8. arXiv:2211.11153  [pdf, other

    cs.LG cs.CL cs.CV

    Unifying Vision-Language Representation Space with Single-tower Transformer

    Authors: Jiho Jang, Chaerin Kong, Donghyeon Jeon, Seonhoon Kim, Nojun Kwak

    Abstract: Contrastive learning is a form of distance learning that aims to learn invariant features from two related representations. In this paper, we explore the bold hypothesis that an image and its caption can be simply regarded as two different views of the underlying mutual information, and train a model to learn a unified vision-language representation space that encodes both modalities at once in a… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: AAAI 2023, 11 pages

  9. arXiv:2210.05872  [pdf, other

    cs.CV

    Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

    Authors: Chaerin Kong, DongHyeon Jeon, Ohjoon Kwon, Nojun Kwak

    Abstract: Fashion attribute editing is a task that aims to convert the semantic attributes of a given fashion image while preserving the irrelevant regions. Previous works typically employ conditional GANs where the generator explicitly learns the target attributes and directly execute the conversion. These approaches, however, are neither scalable nor generic as they operate only with few limited attribute… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  10. Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

    Authors: Inseung Hwang, Daniel S. Jeon, Adolfo Muñoz, Diego Gutierrez, Xin Tong, Min H. Kim

    Abstract: Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors. This results in cumbersome capture devices, carefully calibrated in lab conditions, and in very long acquisition times, usually in the order of a few days per object. Recent techniques allow to capture polarimetric sp… ▽ More

    Submitted 8 February, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Journal ref: ACM Transactions on Graphics 41, 4, Article 133 (July 2022)

  11. arXiv:2205.05300  [pdf, other

    cs.CL

    User Guide for KOTE: Korean Online Comments Emotions Dataset

    Authors: Duyoung Jeon, Junho Lee, Cheongtag Kim

    Abstract: Sentiment analysis that classifies data into positive or negative has been dominantly used to recognize emotional aspects of texts, despite the deficit of thorough examination of emotional meanings. Recently, corpora labeled with more than just valence are built to exceed this limit. However, most Korean emotion corpora are small in the number of instances and cover a limited range of emotions. We… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 16 pages, 4 figures

  12. arXiv:2109.04650  [pdf, other

    cs.CL

    What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

    Authors: Boseop Kim, HyoungSeok Kim, Sang-Woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-Hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park , et al. (12 additional authors not shown)

    Abstract: GPT-3 shows remarkable in-context learning ability of large-scale language models (LMs) trained on hundreds of billion scale data. Here we address some remaining issues less reported by the GPT-3 paper, such as a non-English LM, the performances of different sized models, and the effect of recently introduced prompt optimization on in-context learning. To achieve this, we introduce HyperCLOVA, a K… ▽ More

    Submitted 28 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP2021 as a long paper. Fixed some typos

  13. arXiv:2102.03207  [pdf, other

    cs.SD cs.AI eess.AS

    Real-time Denoising and Dereverberation with Tiny Recurrent U-Net

    Authors: Hyeong-Seok Choi, Sungjin Park, Jie Hwan Lee, Hoon Heo, Dongsuk Jeon, Kyogu Lee

    Abstract: Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks. The number of parameters of state-of-the-art models, however, is often too large to be deployed on devices for real-world applications. To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of-the-art mod… ▽ More

    Submitted 22 June, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 5 pages, 2 figures, 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). arXiv admin note: text overlap with arXiv:2006.00687

  14. arXiv:2009.00463  [pdf, other

    eess.IV cs.CV

    Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics

    Authors: Seung-Hwan Baek, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim

    Abstract: Imaging depth and spectrum have been extensively studied in isolation from each other for decades. Recently, hyperspectral-depth (HS-D) imaging emerges to capture both information simultaneously by combining two different imaging systems; one for depth, the other for spectrum. While being accurate, this combinational approach induces increased form factor, cost, capture time, and alignment/registr… ▽ More

    Submitted 15 August, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

    ACM Class: I.2.10; I.4.1; I.5

    Journal ref: International Conference on Computer Vision (ICCV) 2021

  15. arXiv:2006.14317  [pdf

    cs.AR

    A Fast Finite Field Multiplier for SIKE

    Authors: Yeonsoo Jeon, Dongsuk Jeon

    Abstract: Various post-quantum cryptography algorithms have been recently proposed. Supersingluar isogeny Diffie-Hellman key exchange (SIKE) is one of the most promising candidates due to its small key size. However, the SIKE scheme requires numerous finite field multiplications for its isogeny computation, and hence suffers from slow encryption and decryption process. In this paper, we propose a fast finit… ▽ More

    Submitted 26 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.