(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 56 results for author: Luong, M

.
  1. arXiv:2405.10084  [pdf, other

    eess.AS cs.AI cs.SD

    Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

    Authors: Manh Luong, Khai Nguyen, Nhat Ho, Reza Haf, Dinh Phung, Lizhen Qu

    Abstract: The Learning-to-match (LTM) framework proves to be an effective inverse optimal transport approach for learning the underlying ground metric between two sources of data, facilitating subsequent matching. However, the conventional LTM framework faces scalability challenges, necessitating the use of the entire dataset each time the parameters of the ground metric are updated. In adapting LTM to the… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2403.13494  [pdf, other

    hep-ph

    Novel imprint of a dark photon from the 3-3-1-1 model

    Authors: Doan Minh Luong, Phung Van Dong, Nguyen Huy Thao

    Abstract: We investigate a dark photon that arises from the UV model based upon $SU(3)_C\otimes SU(3)_L\otimes U(1)_X \otimes U(1)_G$ (3-3-1-1) gauge symmetry, where the last three factors enlarge the electroweak symmetry encompassing electric charge $Q=T_3 - 1/ \sqrt{3}T_8 +X$ and dark charge $D = -2/\sqrt{3} T_8 +G$. It is well-established that this model addresses the questions of family number, neutrino… ▽ More

    Submitted 23 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 16 pages, 3 figures, 2 tables; A scalar triplet relabelled for clarity

  3. arXiv:2309.07370  [pdf

    cond-mat.mtrl-sci physics.optics

    Highly-Sensitive Resonance-Enhanced Organic Photodetectors for Shortwave Infrared Sensing

    Authors: Hoang Mai Luong, Chokchai Kaiyasuan, Ahra Yi, Sangmin Chae, Brian Minki Kim, Patchareepond Panoy, Hyo Jung Kim, Vinich Promarak, Yasuo Miyata, Hidenori Nakayama, Thuc-Quyen Nguyen

    Abstract: Shortwave infrared (SWIR) has various applications, including night vision, remote sensing, and medical imaging. SWIR organic photodetectors (OPDs) offer advantages such as flexibility, cost-effectiveness, and tunable properties, however, lower sensitivity and limited spectral coverage compared to inorganic counterparts are major drawbacks. Here, we propose a simple yet effective and widely applic… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  4. arXiv:2309.01076  [pdf, other

    cs.LG cs.SD eess.AS

    Federated Few-shot Learning for Cough Classification with Edge Devices

    Authors: Ngan Dao Hoang, Dat Tran-Anh, Manh Luong, Cong Tran, Cuong Pham

    Abstract: Automatically classifying cough sounds is one of the most critical tasks for the diagnosis and treatment of respiratory diseases. However, collecting a huge amount of labeled cough dataset is challenging mainly due to high laborious expenses, data scarcity, and privacy concerns. In this work, our aim is to develop a framework that can effectively perform cough classification even in situations whe… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 21 pages, 5 figures

  5. arXiv:2210.05610  [pdf, other

    cs.CL cs.AI

    MTet: Multi-domain Translation for English and Vietnamese

    Authors: Chinh Ngo, Trieu H. Trinh, Long Phan, Hieu Tran, Tai Dang, Hieu Nguyen, Minh Nguyen, Minh-Thang Luong

    Abstract: We introduce MTet, the largest publicly available parallel corpus for English-Vietnamese translation. MTet consists of 4.2M high-quality training sentence pairs and a multi-domain test set refined by the Vietnamese research community. Combining with previous works on English-Vietnamese translation, we grow the existing parallel dataset to 6.2M sentence pairs. We also release the first pretrained m… ▽ More

    Submitted 19 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

  6. arXiv:2210.05598  [pdf, other

    cs.CL cs.AI

    Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation

    Authors: Long Phan, Tai Dang, Hieu Tran, Trieu H. Trinh, Vy Phan, Lam D. Chau, Minh-Thang Luong

    Abstract: Biomedical data and benchmarks are highly valuable yet very limited in low-resource languages other than English such as Vietnamese. In this paper, we make use of a state-of-the-art translation model in English-Vietnamese to translate and produce both pretrained as well as supervised data in the biomedical domains. Thanks to such large-scale translation, we introduce ViPubmedT5, a pretrained Encod… ▽ More

    Submitted 29 January, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  7. arXiv:2208.04243  [pdf, other

    cs.CL

    A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

    Authors: Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen

    Abstract: In this paper, we introduce a high-quality and large-scale benchmark dataset for English-Vietnamese speech translation with 508 audio hours, consisting of 331K triplets of (sentence-lengthed audio, English source transcript sentence, Vietnamese target subtitle sentence). We also conduct empirical experiments using strong baselines and find that the traditional "Cascaded" approach still outperforms… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: In Proceedings of INTERSPEECH 2022, to appear. The first three authors contributed equally to this work

  8. arXiv:2201.10954  [pdf

    physics.ao-ph math.NA

    Enhanced Simulation of the Indian Summer Monsoon Rainfall Using Regional Climate Modeling and Continuous Data Assimilation

    Authors: Srinivas Desamsetti, Hari Prasad Dasari, Sabique Langodan, Yesubabu Viswanadhapalli, Raju Attada, Thang M. Luong, Omar Knio, Edriss S. Titi, Ibrahim Hoteit

    Abstract: This study assesses a Continuous Data Assimilation (CDA) dynamical-downscaling algorithm for enhancing the simulation of the Indian summer monsoon (ISM) system. CDA is a mathematically rigorous technique that has been recently introduced to constrain the large-scale features of high-resolution atmospheric models with coarse spatial scale data. It is similar to spectral nudging but does not require… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: Research Article

  9. arXiv:2111.10050  [pdf, other

    cs.LG cs.CL cs.CV

    Combined Scaling for Zero-shot Transfer Learning

    Authors: Hieu Pham, Zihang Dai, Golnaz Ghiasi, Kenji Kawaguchi, Hanxiao Liu, Adams Wei Yu, Jiahui Yu, Yi-Ting Chen, Minh-Thang Luong, Yonghui Wu, Mingxing Tan, Quoc V. Le

    Abstract: We present a combined scaling method - named BASIC - that achieves 85.7% top-1 accuracy on the ImageNet ILSVRC-2012 validation set without learning from any labeled ImageNet example. This accuracy surpasses best published similar models - CLIP and ALIGN - by 9.3%. Our BASIC model also shows significant improvements in robustness benchmarks. For instance, on 5 test sets with natural distribution sh… ▽ More

    Submitted 12 April, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

  10. arXiv:2110.03742  [pdf, other

    cs.CL cs.LG

    Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference

    Authors: Sneha Kudugunta, Yanping Huang, Ankur Bapna, Maxim Krikun, Dmitry Lepikhin, Minh-Thang Luong, Orhan Firat

    Abstract: Sparse Mixture-of-Experts (MoE) has been a successful approach for scaling multilingual translation models to billions of parameters without a proportional increase in training computation. However, MoE models are prohibitively large and practitioners often resort to methods such as distillation for serving. In this work, we investigate routing strategies at different granularity (token, sentence,… ▽ More

    Submitted 24 September, 2021; originally announced October 2021.

    Comments: EMNLP Findings 2021

  11. arXiv:2109.13675  [pdf, other

    cs.SD cs.LG eess.AS

    FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis

    Authors: Manh Luong, Viet Anh Tran

    Abstract: Recently, autoregressive neural vocoders have provided remarkable performance in generating high-fidelity speech and have been able to produce synthetic speech in real-time. However, autoregressive neural vocoders such as WaveFlow are capable of modeling waveform signals from mel-spectrogram, its number of parameters is significant to deploy on edge devices. Though NanoFlow, which has a small numb… ▽ More

    Submitted 25 March, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

  12. arXiv:2109.06270  [pdf, other

    cs.CL

    STraTA: Self-Training with Task Augmentation for Better Few-shot Learning

    Authors: Tu Vu, Minh-Thang Luong, Quoc V. Le, Grady Simon, Mohit Iyyer

    Abstract: Despite their recent successes in tackling many NLP tasks, large-scale pre-trained language models do not perform as well in few-shot settings where only a handful of training examples are available. To address this shortcoming, we propose STraTA, which stands for Self-Training with Task Augmentation, an approach that builds on two key ideas for effective leverage of unlabeled data. First, STraTA… ▽ More

    Submitted 12 April, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted as a main conference paper at EMNLP 2021, 17 pages, 3 figures, 11 tables

  13. arXiv:2107.06642  [pdf, other

    eess.AS cs.LG cs.SD

    Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder

    Authors: Manh Luong, Viet Anh Tran

    Abstract: Voice conversion is a challenging task which transforms the voice characteristics of a source speaker to a target speaker without changing linguistic content. Recently, there have been many works on many-to-many Voice Conversion (VC) based on Variational Autoencoder (VAEs) achieving good results, however, these methods lack the ability to disentangle speaker identity and linguistic content to achi… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Journal ref: INTERSPEECH 2021

  14. arXiv:2103.01011  [pdf

    cond-mat.mtrl-sci

    Sub-second and ppm-level Optical Sensing of Hydrogen Using Templated Control of Nano-hydride Geometry and Composition

    Authors: Hoang Mai Luong, Minh Thien Pham, Tyler Guin, Richa Pokharel Madhogaria, Manh-Huong Phan, George K. Larsen, Tho Duc Nguyen

    Abstract: The use of hydrogen as a clean and renewable alternative to fossil fuels requires a suite of flammability mitigating technologies, particularly robust sensors for hydrogen leak detection and concentration monitoring. To this end, we have developed a class of lightweight optical hydrogen sensors based on a metasurface of Pd nano-patchy particle arrays, which fulfills the increasing requirements of… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  15. arXiv:2012.08561  [pdf, other

    cs.CL

    Pre-Training Transformers as Energy-Based Cloze Models

    Authors: Kevin Clark, Minh-Thang Luong, Quoc V. Le, Christopher D. Manning

    Abstract: We introduce Electric, an energy-based cloze model for representation learning over text. Like BERT, it is a conditional generative model of tokens given their contexts. However, Electric does not use masking or output a full distribution over tokens that could occur in a context. Instead, it assigns a scalar energy score to each input token indicating how likely it is given its context. We train… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: EMNLP 2020

  16. arXiv:2011.04419  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Domain-Agnostic Contrastive Learning

    Authors: Vikas Verma, Minh-Thang Luong, Kenji Kawaguchi, Hieu Pham, Quoc V. Le

    Abstract: Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, d… ▽ More

    Submitted 19 July, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Published in ICML 2021

  17. arXiv:2010.00198  [pdf, other

    cs.CL

    Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models

    Authors: Thai Binh Nguyen, Quang Minh Nguyen, Thi Thu Hien Nguyen, Quoc Truong Do, Chi Mai Luong

    Abstract: Studies on the Named Entity Recognition (NER) task have shown outstanding results that reach human parity on input texts with correct text formattings, such as with proper punctuation and capitalization. However, such conditions are not available in applications where the input is speech, because the text is generated from a speech recognition system (ASR), and that the system does not consider th… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: Accepted in Interspeech 2020

  18. arXiv:2008.11938  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Highly transparent contacts to the 1D hole gas in ultra-scaled Ge/Si core/shell nanowires

    Authors: Masiar Sistani, Jovian Delaforce, Roman Kramer, Nicolas Roch, Minh Anh Luong, M. den Hertog, Eric Robin, Jürgen Smoliner, Jun Yao, Charles Lieber, Cécile Naud, Alois Lugstein, Olivier Buisson

    Abstract: Semiconductor-superconductor hybrid systems have outstanding potential for emerging high-performance nanoelectronics and quantum devices. However, critical to their successful application is the fabrication of high-quality and reproducible semiconductor-superconductor interfaces. Here, we realize and measure axial Al-Ge-Al nanowire heterostructures with atomically precise interfaces, enwrapped by… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Journal ref: ACS Nano, American Chemical Society, 2019, 13 (12), pp.14145-14151

  19. arXiv:2006.08385  [pdf

    physics.app-ph cond-mat.mes-hall

    Plasmon-Driven Hot Electron Transfer at Atomically Sharp Metal-Semiconductor Nanojunctions

    Authors: Masiar Sistani, Maximilian G. Bartmann, Nicholas A. Güsken, Rupert F. Oulton, Hamid Keshmiri, Minh Anh Luong, Zahra Sadre-Momtaz, Martien I. den Hertog, Alois Lugstein

    Abstract: Recent advances in guiding and localizing light at the nanoscale exposed the enormous potential of ultra-scaled plasmonic devices. In this context, the decay of surface plasmons to hot carriers triggers a variety of applications in boosting the efficiency of energy-harvesting, photo-catalysis and photo-detection. However, a detailed understanding of plasmonic hot carrier generation and particularl… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  20. arXiv:2003.10580  [pdf, other

    cs.LG stat.ML

    Meta Pseudo Labels

    Authors: Hieu Pham, Zihang Dai, Qizhe Xie, Minh-Thang Luong, Quoc V. Le

    Abstract: We present Meta Pseudo Labels, a semi-supervised learning method that achieves a new state-of-the-art top-1 accuracy of 90.2% on ImageNet, which is 1.6% better than the existing state-of-the-art. Like Pseudo Labels, Meta Pseudo Labels has a teacher network to generate pseudo labels on unlabeled data to teach a student network. However, unlike Pseudo Labels where the teacher is fixed, the teacher i… ▽ More

    Submitted 1 March, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: Preprint

  21. arXiv:2003.10555  [pdf, other

    cs.CL

    ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

    Authors: Kevin Clark, Minh-Thang Luong, Quoc V. Le, Christopher D. Manning

    Abstract: Masked language modeling (MLM) pre-training methods such as BERT corrupt the input by replacing some tokens with [MASK] and then train a model to reconstruct the original tokens. While they produce good results when transferred to downstream NLP tasks, they generally require large amounts of compute to be effective. As an alternative, we propose a more sample-efficient pre-training task called rep… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: ICLR 2020

  22. arXiv:2003.02597   

    cs.CV cs.LG eess.IV

    AI outperformed every dermatologist: Improved dermoscopic melanoma diagnosis through customizing batch logic and loss function in an optimized Deep CNN architecture

    Authors: Cong Tri Pham, Mai Chi Luong, Dung Van Hoang, Antoine Doucet

    Abstract: Melanoma, one of most dangerous types of skin cancer, re-sults in a very high mortality rate. Early detection and resection are two key points for a successful cure. Recent research has used artificial intelligence to classify melanoma and nevus and to compare the assessment of these algorithms to that of dermatologists. However, an imbalance of sensitivity and specificity measures affected the pe… ▽ More

    Submitted 28 August, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: We are submitting the article in the journal and waiting for the review result, so we want to temporarily delete the article. When the article is officially accepted, it will be resubmitted

  23. arXiv:2002.02373  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Reversible Al Propagation in Si$_x$Ge$_{1-x}$ Nanowires

    Authors: Minh Anh Luong, Robin Eric, Pauc Nicolas, Gentile Pascal, Baron Thierry, Salem Bassem, Sistani Masiar, Lugstein Alois, Spies Maria, Fernandez Bruno, M. den Hertog

    Abstract: While reversibility is a fundamental concept in thermodynamics, most reactions are not readily reversible, especially in solid state physics. For example, thermal diffusion is a widely known concept, used among others to inject dopant atoms into the substitutional positions in the matrix and improve the device properties. Typically, such a diffusion process will create a concentration gradient ext… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

  24. arXiv:2001.09977  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Towards a Human-like Open-Domain Chatbot

    Authors: Daniel Adiwardana, Minh-Thang Luong, David R. So, Jamie Hall, Noah Fiedel, Romal Thoppilan, Zi Yang, Apoorv Kulshreshtha, Gaurav Nemade, Yifeng Lu, Quoc V. Le

    Abstract: We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation.… ▽ More

    Submitted 27 February, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: 38 pages, 12 figures

  25. arXiv:2001.09179  [pdf, other

    physics.app-ph cond-mat.mtrl-sci

    Correlated and in-situ electrical transmission electron microscopy studies and related membrane fabrication

    Authors: Maria Spies, Zahra Sadre-Momtaz, Jonas Lähnemann, Minh Anh Luong, Bruno Fernandez, Thierry Fournier, Eva Monroy, Martien I. den Hertog

    Abstract: Understanding the interplay between the structure, composition and opto-electronic properties of semiconductor nano-objects requires combining transmission electron microscopy (TEM) based techniques with electrical and optical measurements on the very same specimen. Recent developments in TEM technologies allow not only the identification and in-situ electrical characterization of a particular obj… ▽ More

    Submitted 2 December, 2021; v1 submitted 24 January, 2020; originally announced January 2020.

    Comments: This is an author-created, un-copyedited version of a topical review published in Nanotechnology. IOP Publishing Ltd. is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at https://doi.org/10.1088/1361-6528/ab99f0

    Journal ref: Nanotechnology 31, 472001 (2020)

  26. arXiv:2001.09026  [pdf

    physics.app-ph cond-mat.mtrl-sci

    In-situ high resolution TEM observation of Aluminum solid-state diffusion in Germanium nanowires: fabricating sub-10 nm Ge quantum dots

    Authors: M. Luong, E. Robin, N. Pauc, P. Gentile, M. Sistani, A. Lugstein, M Spies, B Fernandez, M. den Hertog

    Abstract: Aluminum-germanium nanowires (NWs) thermal activated solid state reaction is a promising system as very sharp and well defined one dimensional contacts can be created between a metal and a semiconductor, that can become a quantum dot if the size becomes sufficiently small. In the search for high performance devices without variability, it is of high interest to allow deterministic fabrication of n… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  27. arXiv:1911.08117  [pdf, ps, other

    cs.CL cs.LG

    A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages

    Authors: Minh-Thang Luong, Preslav Nakov, Min-Yen Kan

    Abstract: We propose a language-independent approach for improving statistical machine translation for morphologically rich languages using a hybrid morpheme-word representation where the basic unit of translation is the morpheme, but word boundaries are respected at all stages of the translation process. Our model extends the classic phrase-based model by means of (1) word boundary-aware morpheme-level phr… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: EMNLP-2010

  28. arXiv:1911.04252  [pdf, other

    cs.LG cs.CV stat.ML

    Self-training with Noisy Student improves ImageNet classification

    Authors: Qizhe Xie, Minh-Thang Luong, Eduard Hovy, Quoc V. Le

    Abstract: We present Noisy Student Training, a semi-supervised learning approach that works well even when labeled data is abundant. Noisy Student Training achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On robustness test sets, it improves ImageNet-A top-1 accuracy from 61.0% to 83.7%, reduces ImageNet-C mea… ▽ More

    Submitted 19 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: CVPR 2020

  29. arXiv:1910.13299  [pdf, other

    cs.CL

    Findings of the Third Workshop on Neural Generation and Translation

    Authors: Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

    Abstract: This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the two shared tasks 1) efficient neural machine translation (NMT) where pa… ▽ More

    Submitted 29 October, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Fixed the metadata (author list)

  30. arXiv:1907.04829  [pdf, other

    cs.CL

    BAM! Born-Again Multi-Task Networks for Natural Language Understanding

    Authors: Kevin Clark, Minh-Thang Luong, Urvashi Khandelwal, Christopher D. Manning, Quoc V. Le

    Abstract: It can be challenging to train multi-task neural networks that outperform or even match their single-task counterparts. To help address this, we propose using knowledge distillation where single-task models teach a multi-task model. We enhance this training with teacher annealing, a novel method that gradually transitions the model from distillation to supervised learning, helping the multi-task m… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: ACL 2019

  31. arXiv:1906.02940  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Selfie: Self-supervised Pretraining for Image Embedding

    Authors: Trieu H. Trinh, Minh-Thang Luong, Quoc V. Le

    Abstract: We introduce a pretraining technique called Selfie, which stands for SELFie supervised Image Embedding. Selfie generalizes the concept of masked language modeling of BERT (Devlin et al., 2019) to continuous data, such as images, by making use of the Contrastive Predictive Coding loss (Oord et al., 2018). Given masked-out patches in an input image, our method learns to select the correct patch, amo… ▽ More

    Submitted 27 July, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  32. arXiv:1904.12848  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Unsupervised Data Augmentation for Consistency Training

    Authors: Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, Quoc V. Le

    Abstract: Semi-supervised learning lately has shown much promise in improving deep learning models when labeled data is scarce. Common among recent approaches is the use of consistency training on a large amount of unlabeled data to constrain model predictions to be invariant to input noise. In this work, we present a new perspective on how to effectively noise unlabeled examples and argue that the quality… ▽ More

    Submitted 5 November, 2020; v1 submitted 29 April, 2019; originally announced April 2019.

    Comments: NeurIPS 2020

  33. arXiv:1902.06625  [pdf

    physics.app-ph

    Magnetically Tunable Organic Semiconductors with Superparamagnetic Nanoparticles

    Authors: Rugang Geng, Hoang Mai Luong, Raja Das, Kristen Stojak, Minh Thien Pham, Joshua Robles-Garcia, Tuan Anh Duong, Huy Thanh Pham, Thi Huong Au, Ngoc Diep Lai, George K. Larsen, Manh-Huong Phan, Tho Duc Nguyen

    Abstract: Magnetic nanoparticles (MNPs) exhibiting superparamagnetic properties might generate large magnetic dipole-dipole interaction with electron spins in organic semiconductors (OSECs). This concept could be considered analogous to the effect of hyperfine interaction (HFI). In order to investigate this model, Fe3O4 MNPs are used as a dopant for generating random hyperfine-like magnetic fields in a HFI-… ▽ More

    Submitted 11 June, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

  34. arXiv:1809.08370  [pdf, other

    cs.CL

    Semi-Supervised Sequence Modeling with Cross-View Training

    Authors: Kevin Clark, Minh-Thang Luong, Christopher D. Manning, Quoc V. Le

    Abstract: Unsupervised representation learning algorithms such as word2vec and ELMo improve the accuracy of many supervised NLP models, mainly because they can take advantage of large amounts of unlabeled text. However, the supervised models only learn from task-specific labeled data during the main training phase. We therefore propose Cross-View Training (CVT), a semi-supervised learning algorithm that imp… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  35. arXiv:1809.07070  [pdf, other

    cs.CL

    Latent Topic Conversational Models

    Authors: Tsung-Hsien Wen, Minh-Thang Luong

    Abstract: Latent variable models have been a preferred choice in conversational modeling compared to sequence-to-sequence (seq2seq) models which tend to generate generic and repetitive responses. Despite so, training latent variable models remains to be difficult. In this paper, we propose Latent Topic Conversational Model (LTCM) which augments seq2seq with a neural latent topic component to better guide re… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

  36. arXiv:1806.02940  [pdf, other

    cs.CL

    Findings of the Second Workshop on Neural Machine Translation and Generation

    Authors: Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

    Abstract: This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018). First, we summarize the research trends of papers presented in the proceedings, and note that there is particular interest in linguistic structure, domain adaptation, data augmentation, hand… ▽ More

    Submitted 18 June, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: WNMT 2018

  37. arXiv:1804.09541  [pdf, other

    cs.CL cs.AI cs.LG

    QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension

    Authors: Adams Wei Yu, David Dohan, Minh-Thang Luong, Rui Zhao, Kai Chen, Mohammad Norouzi, Quoc V. Le

    Abstract: Current end-to-end machine reading and question answering (Q\&A) models are primarily based on recurrent neural networks (RNNs) with attention. Despite their success, these models are often slow for both training and inference due to the sequential nature of RNNs. We propose a new Q\&A architecture called QANet, which does not require recurrent networks: Its encoder consists exclusively of convolu… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: Published as full paper in ICLR 2018

  38. arXiv:1803.00144  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Longer-term Dependencies in RNNs with Auxiliary Losses

    Authors: Trieu H. Trinh, Andrew M. Dai, Minh-Thang Luong, Quoc V. Le

    Abstract: Despite recent advances in training recurrent neural networks (RNNs), capturing long-term dependencies in sequences remains a fundamental challenge. Most approaches use backpropagation through time (BPTT), which is difficult to scale to very long sequences. This paper proposes a simple method that improves the ability to capture long term dependencies in RNNs by adding an unsupervised auxiliary lo… ▽ More

    Submitted 13 June, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

    Comments: ICML 2018

  39. arXiv:1711.11373  [pdf, ps, other

    physics.app-ph

    Demonstration of a 2x2 programmable phase plate for electrons

    Authors: Jo Verbeeck, Armand Béché, Knut Müller-Caspary, Giulio Guzzinati, Minh Anh Luong, Martien Den Hertog

    Abstract: First results on the experimental realisation of a 2x2 programmable phase plate for electrons are presented. The design consists of an array of electrostatic einzel lenses that influence the phase of electron waves passing through 4 separately controllable aperture holes. This functionality is demonstrated in a conventional transmission electron microscope operating at 300~kV and results are in ve… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

  40. arXiv:1710.02076  [pdf, other

    cs.CL

    On the Effective Use of Pretraining for Natural Language Inference

    Authors: Ignacio Cases, Minh-Thang Luong, Christopher Potts

    Abstract: Neural networks have excelled at many NLP tasks, but there remain open questions about the performance of pretrained distributed word representations and their interaction with weight initialization and other hyperparameters. We address these questions empirically using attention-based sequence-to-sequence models for natural language inference (NLI). Specifically, we compare three types of embeddi… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: This manuscript dates from late Winter 2016

  41. arXiv:1707.00110  [pdf, other

    cs.CL

    Efficient Attention using a Fixed-Size Memory Representation

    Authors: Denny Britz, Melody Y. Guan, Minh-Thang Luong

    Abstract: The standard content-based attention mechanism typically used in sequence-to-sequence models is computationally expensive as it requires the comparison of large encoder and decoder states at each time step. In this work, we propose an alternative attention mechanism based on a fixed size memory representation that is more efficient. Our technique predicts a compact set of K attention contexts duri… ▽ More

    Submitted 1 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  42. arXiv:1704.00784  [pdf, other

    cs.LG cs.CL

    Online and Linear-Time Attention by Enforcing Monotonic Alignments

    Authors: Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck

    Abstract: Recurrent neural network models with an attention mechanism have proven to be extremely effective on a wide variety of sequence-to-sequence problems. However, the fact that soft attention mechanisms perform a pass over the entire input sequence when producing each element in the output sequence precludes their use in online settings and results in a quadratic time complexity. Based on the insight… ▽ More

    Submitted 29 June, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: ICML camera-ready version; 10 pages + 9 page appendix

  43. arXiv:1703.03906  [pdf, other

    cs.CL

    Massive Exploration of Neural Machine Translation Architectures

    Authors: Denny Britz, Anna Goldie, Minh-Thang Luong, Quoc Le

    Abstract: Neural Machine Translation (NMT) has shown remarkable progress over the past few years with production systems now being deployed to end-users. One major drawback of current architectures is that they are expensive to train, typically requiring days to weeks of GPU time to converge. This makes exhaustive hyperparameter search, as is commonly done with other neural network architectures, prohibitiv… ▽ More

    Submitted 21 March, 2017; v1 submitted 10 March, 2017; originally announced March 2017.

    Comments: 9 pages, 2 figures, 8 tables, submitted to ACL 2017, open source code at https://github.com/google/seq2seq/

  44. arXiv:1606.09274  [pdf, other

    cs.AI cs.CL cs.NE

    Compression of Neural Machine Translation Models via Pruning

    Authors: Abigail See, Minh-Thang Luong, Christopher D. Manning

    Abstract: Neural Machine Translation (NMT), like many other deep learning domains, typically suffers from over-parameterization, resulting in large storage sizes. This paper examines three simple magnitude-based pruning schemes to compress NMT models, namely class-blind, class-uniform, and class-distribution, which differ in terms of how pruning thresholds are computed for the different classes of weights i… ▽ More

    Submitted 29 June, 2016; originally announced June 2016.

    Comments: Accepted to CoNLL 2016. 9 pages plus references

  45. arXiv:1604.00788  [pdf, ps, other

    cs.CL cs.LG

    Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models

    Authors: Minh-Thang Luong, Christopher D. Manning

    Abstract: Nearly all previous work on neural machine translation (NMT) has used quite restricted vocabularies, perhaps with a subsequent method to patch in unknown words. This paper presents a novel word-character solution to achieving open vocabulary NMT. We build hybrid systems that translate mostly at the word level and consult the character components for rare words. Our character-level recurrent neural… ▽ More

    Submitted 22 June, 2016; v1 submitted 4 April, 2016; originally announced April 2016.

    Comments: 11pages, 4 figures. ACL 2016 camera-ready version. SOTA WMT'15 English-Czech 20.7 BLEU (+2.1-11.4 points)

  46. arXiv:1511.06114  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Multi-task Sequence to Sequence Learning

    Authors: Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser

    Abstract: Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. To date, most of its applications focused on only one task and not much work explored this framework for multiple tasks. This paper examines three multi-task learning (MTL) settings for sequence to sequence models: (a) the oneto-many setting - where the encoder is shared between several tasks such as machi… ▽ More

    Submitted 1 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 10 pages, 4 figures, ICLR 2016 camera-ready, added parsing SOTA results

  47. arXiv:1508.04025  [pdf, ps, other

    cs.CL

    Effective Approaches to Attention-based Neural Machine Translation

    Authors: Minh-Thang Luong, Hieu Pham, Christopher D. Manning

    Abstract: An attentional mechanism has lately been used to improve neural machine translation (NMT) by selectively focusing on parts of the source sentence during translation. However, there has been little work exploring useful architectures for attention-based NMT. This paper examines two simple and effective classes of attentional mechanism: a global approach which always attends to all source words and… ▽ More

    Submitted 20 September, 2015; v1 submitted 17 August, 2015; originally announced August 2015.

    Comments: 11 pages, 7 figures, EMNLP 2015 camera-ready version, more training details

  48. arXiv:1507.01398  [pdf

    physics.chem-ph

    Nuclear spin noise in NMR revisited

    Authors: Guillaume Ferrand, Gaspard Huber, Michel Luong, Hervé Desvaux

    Abstract: The theoretical shapes of nuclear spin-noise spectra in NMR are derived by considering a receiver circuit with finite, preamplifier input impedance and a transmission line between the preamplifier and the probe. Using this model, it becomes possible to reproduce all observed experimental features: variation of the NMR resonance linewidth as a function of the transmission line phase, nuclear spin-n… ▽ More

    Submitted 22 August, 2015; v1 submitted 6 July, 2015; originally announced July 2015.

  49. arXiv:1506.01057  [pdf, other

    cs.CL

    A Hierarchical Neural Autoencoder for Paragraphs and Documents

    Authors: Jiwei Li, Minh-Thang Luong, Dan Jurafsky

    Abstract: Natural language generation of coherent long texts like paragraphs or longer documents is a challenging problem for recurrent networks models. In this paper, we explore an important step toward this generation task: training an LSTM (Long-short term memory) auto-encoder to preserve and reconstruct multi-sentence paragraphs. We introduce an LSTM model that hierarchically builds an embedding for a p… ▽ More

    Submitted 5 June, 2015; v1 submitted 2 June, 2015; originally announced June 2015.

  50. arXiv:1503.00185  [pdf, other

    cs.AI cs.CL

    When Are Tree Structures Necessary for Deep Learning of Representations?

    Authors: Jiwei Li, Minh-Thang Luong, Dan Jurafsky, Eudard Hovy

    Abstract: Recursive neural models, which use syntactic parse trees to recursively generate representations bottom-up, are a popular architecture. But there have not been rigorous evaluations showing for exactly which tasks this syntax-based method is appropriate. In this paper we benchmark {\bf recursive} neural models against sequential {\bf recurrent} neural models (simple recurrent and LSTM models), enfo… ▽ More

    Submitted 18 August, 2015; v1 submitted 28 February, 2015; originally announced March 2015.