(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 78 results for author: Nguyen, L T

.
  1. arXiv:2406.02555  [pdf, ps, other

    eess.AS cs.CL

    PhoWhisper: Automatic Speech Recognition for Vietnamese

    Authors: Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen

    Abstract: We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the Whisper model on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. We have open-sourced PhoWhisper at: https://github.com… ▽ More

    Submitted 27 March, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2024 Tiny Papers Track

  2. arXiv:2405.14141  [pdf, other

    cs.CL

    ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model

    Authors: Luan Thanh Nguyen

    Abstract: Recent advancements in hate speech detection (HSD) in Vietnamese have made significant progress, primarily attributed to the emergence of transformer-based pre-trained language models, particularly those built on the BERT architecture. However, the necessity for specialized fine-tuned models has resulted in the complexity and fragmentation of developing a multitasking HSD system. Moreover, most cu… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL'2024 (Findings)

  3. arXiv:2404.05276  [pdf, ps, other

    cs.LO

    On the complexity of normalization for the planar $λらむだ$-calculus

    Authors: Anupam Das, Damiano Mazza, Lê Thành Dũng Nguyên, Noam Zeilberger

    Abstract: We sketch a tentative proof of P-completeness for the $βべーた$-convertibility problem on untyped planar (a.k.a. ordered or non-commutative) $λらむだ$-terms.

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Abstract for the Trends in Linear Logic and Applications 2023 workshop, meant to be expanded into a proper paper in the future

  4. arXiv:2404.05265  [pdf, other

    cs.LO cs.FL math.LO

    Function spaces for orbit-finite sets

    Authors: Mikołaj Bojańczyk, Lê Thành Dũng Nguyên, Rafał Stefański

    Abstract: Orbit-finite sets are a generalisation of finite sets, and as such support many operations allowed for finite sets, such as pairing, quotienting, or taking subsets. However, they do not support function spaces, i.e. if X and Y are orbit-finite sets, then the space of finitely supported functions from X to Y is not orbit-finite. In this paper we propose two solutions to this problem: one is obtaine… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2403.14918  [pdf, other

    cs.LG

    Deep learning-based method for weather forecasting: A case study in Itoshima

    Authors: Yuzhong Cheng, Linh Thi Hoai Nguyen, Akinori Ozaki, Ton Viet Ta

    Abstract: Accurate weather forecasting is of paramount importance for a wide range of practical applications, drawing substantial scientific and societal interest. However, the intricacies of weather systems pose substantial challenges to accurate predictions. This research introduces a multilayer perceptron model tailored for weather forecasting in Itoshima, Kyushu, Japan. Our meticulously designed archite… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  6. arXiv:2402.05854  [pdf, other

    cs.FL cs.LO

    (Almost) Affine Higher-Order Tree Transducers

    Authors: Lê Thành Dũng Tito Nguyên, Gabriele Vanoni

    Abstract: We investigate the tree-to-tree functions computed by \enquote{affine$λらむだ$-transducers}: tree automata whose memory consists of an affine $λらむだ$-term instead of a finite state. They can be seen as variations on Gallot, Lemay and Salvati's Linear High-Order Deterministic Tree Transducers. When the memory is almost purely affine (\textit{à la} Kanazawa), we show that these machines can be translated to t… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  7. arXiv:2402.01198  [pdf, other

    cs.IT eess.SP

    Physical Layer Location Privacy in SIMO Communication Using Fake Paths Injection

    Authors: Trong Duy Tran, Maxime Ferreira Da Costa, Linh Trung Nguyen

    Abstract: Fake path injection is an emerging paradigm for inducing privacy over wireless networks. In this paper, fake paths are injected by the transmitter into a SIMO multipath communication channel to preserve her physical location from an eavesdropper. A novel statistical privacy metric is defined as the ratio between the largest (resp. smallest) eigenvalues of Bob's (resp. Eve's) Cramér-Rao lower bound… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2311.11001  [pdf, other

    cs.CL

    Gendec: A Machine Learning-based Framework for Gender Detection from Japanese Names

    Authors: Duong Tien Pham, Luan Thanh Nguyen

    Abstract: Every human has their own name, a fundamental aspect of their identity and cultural heritage. The name often conveys a wealth of information, including details about an individual's background, ethnicity, and, especially, their gender. By detecting gender through the analysis of names, researchers can unlock valuable insights into linguistic patterns and cultural norms, which can be applied to pra… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: This paper is accepted for presentation at ISDA'23

  9. arXiv:2311.02945  [pdf, ps, other

    cs.CL

    PhoGPT: Generative Pre-training for Vietnamese

    Authors: Dat Quoc Nguyen, Linh The Nguyen, Chi Tran, Dung Ngoc Nguyen, Dinh Phung, Hung Bui

    Abstract: We open-source a state-of-the-art 4B-parameter generative model series for Vietnamese, which includes the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. The base model, PhoGPT-4B, with exactly 3.7B parameters, is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length, employing a vocabulary of 20480 token types. The chat vari… ▽ More

    Submitted 22 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: PhoGPT-4B Technical Report - 5 pages

  10. arXiv:2310.18046  [pdf, other

    cs.CL cs.CV

    ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model for Visual Question Answering in Vietnamese

    Authors: Khiem Vinh Tran, Hao Phu Phan, Kiet Van Nguyen, Ngan Luu Thuy Nguyen

    Abstract: In recent years, Visual Question Answering (VQA) has gained significant attention for its diverse applications, including intelligent car assistance, aiding visually impaired individuals, and document image information retrieval using natural language queries. VQA requires effective integration of information from questions and images to generate accurate answers. Neural models for VQA have made r… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: A pre-print version and submitted to journal

  11. arXiv:2310.09811  [pdf, other

    math-ph math.SP quant-ph

    Spacing distribution for quantum Rabi models

    Authors: Daniel Braak, Linh Thi Hoai Nguyen, Cid Reyes-Bustos, Masato Wakayama

    Abstract: The asymmetric quantum Rabi model (AQRM) is a fundamental model in quantum optics describing the interaction of light and matter. Besides its immediate physical interest, the AQRM possesses an intriguing mathematical structure which is far from being completely understood. In this paper, we focus on the distribution of the level spacing, the difference between consecutive eigenvalues of the AQRM i… ▽ More

    Submitted 9 February, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 28 pages. 15 figures. The conjecture in Section 4.4 (Theorem 4.5 in the current version) was proved using results published after the previous version. The rest of the manuscript was modified slightly according to this change

    MSC Class: 47B06 (Primary) 81V73; 81R40 (Secondary)

  12. Syntactically and semantically regular languages of lambda-terms coincide through logical relations

    Authors: Vincent Moreau, Lê Thành Dũng Nguyên

    Abstract: A fundamental theme in automata theory is regular languages of words and trees, and their many equivalent definitions. Salvati has proposed a generalization to regular languages of simply typed $λらむだ$-terms, defined using denotational semantics in finite sets. We provide here some evidence for its robustness. First, we give an equivalent syntactic characterization that naturally extends the seminal… ▽ More

    Submitted 8 February, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: The proofs on "finitely pointable" CCCs in versions 1 and 2 were wrong; we now make slightly weaker claims on well-pointed locally finite CCCs. New in this version: added reference [3] and official DOI (proceedings of CSL 2024)

  13. arXiv:2307.15335  [pdf, other

    cs.CL cs.CV

    BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering

    Authors: Khiem Vinh Tran, Kiet Van Nguyen, Ngan Luu Thuy Nguyen

    Abstract: Visual Question Answering (VQA) is an intricate and demanding task that integrates natural language processing (NLP) and computer vision (CV), capturing the interest of researchers. The English language, renowned for its wealth of resources, has witnessed notable advancements in both datasets and models designed for VQA. However, there is a lack of models that target specific countries such as Vie… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  14. arXiv:2307.11057  [pdf, other

    cs.FL

    Two-way automata and transducers with planar behaviours are aperiodic

    Authors: Lê Thành Dũng Nguyên, Camille Noûs, Cécilia Pradic

    Abstract: We consider a notion of planarity for two-way finite automata and transducers, inspired by Temperley-Lieb monoids of planar diagrams. We show that this restriction captures star-free languages and first-order transductions.

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 18 pages, DMTCS submission

  15. arXiv:2306.08798  [pdf, other

    cs.CL stat.ML

    MPSA-DenseNet: A novel deep learning model for English accent classification

    Authors: Tianyu Song, Linh Thi Hoai Nguyen, Ton Viet Ta

    Abstract: This paper presents three innovative deep learning models for English accent classification: Multi-DenseNet, PSA-DenseNet, and MPSE-DenseNet, that combine multi-task learning and the PSA module attention mechanism with DenseNet. We applied these models to data collected from six dialects of English across native English speaking regions (Britain, the United States, Scotland) and nonnative English… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  16. arXiv:2305.19709  [pdf, other

    cs.CL cs.SD eess.AS

    XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech

    Authors: Linh The Nguyen, Thinh Pham, Dat Quoc Nguyen

    Abstract: We present XPhoneBERT, the first multilingual model pre-trained to learn phoneme representations for the downstream text-to-speech (TTS) task. Our XPhoneBERT has the same model architecture as BERT-base, trained using the RoBERTa pre-training approach on 330M phoneme-level sentences from nearly 100 languages and locales. Experimental results show that employing XPhoneBERT as an input phoneme encod… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: In Proceedings of INTERSPEECH 2023 (to appear)

  17. arXiv:2305.12601  [pdf, other

    cs.LO cs.PL

    Simply typed convertibility is TOWER-complete even for safe lambda-terms

    Authors: Lê Thành Dũng Nguyên

    Abstract: We consider the following decision problem: given two simply typed $λらむだ$-terms, are they $βべーた$-convertible? Equivalently, do they have the same normal form? It is famously non-elementary, but the precise complexity - namely TOWER-complete - is lesser known. One goal of this short paper is to popularize this fact. Our original contribution is to show that the problem stays TOWER-complete when the two… ▽ More

    Submitted 12 July, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: final revision after acceptance to Logical Methods in Computer Science

  18. arXiv:2303.06546  [pdf, other

    cs.CR cs.AI cs.DC

    Blockchain-Empowered Trustworthy Data Sharing: Fundamentals, Applications, and Challenges

    Authors: Linh T. Nguyen, Lam Duc Nguyen, Thong Hoang, Dilum Bandara, Qin Wang, Qinghua Lu, Xiwei Xu, Liming Zhu, Petar Popovski, Shiping Chen

    Abstract: Various data-sharing platforms have emerged with the growing public demand for open data and legislation mandating certain data to remain open. Most of these platforms remain opaque, leading to many questions about data accuracy, provenance and lineage, privacy implications, consent management, and the lack of fair incentives for data providers. With their transparency, immutability, non-repudiati… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 40 pages, 15 figures, and 8 tables

  19. arXiv:2303.05692  [pdf, ps, other

    cs.CV

    Semantic-Preserving Augmentation for Robust Image-Text Retrieval

    Authors: Sunwoo Kim, Kyuhong Shim, Luong Trung Nguyen, Byonghyo Shim

    Abstract: Image text retrieval is a task to search for the proper textual descriptions of the visual world and vice versa. One challenge of this task is the vulnerability to input image and text corruptions. Such corruptions are often unobserved during the training, and degrade the retrieval model decision quality substantially. In this paper, we propose a novel image text retrieval technique, referred to a… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  20. arXiv:2303.01706  [pdf, other

    q-bio.PE math.DS

    A Geometrical Structure for Predator-Avoidance Fish Schooling

    Authors: Aditya Dewanto Hartono, Ton Viet Ta, Linh Thi Hoai Nguyen

    Abstract: This paper conducts a numerical study of a geometrical structure called $εいぷしろん$-school for predator-avoidance fish schools, based on our previous mathematical model. Our results show that during a predator attack, the number of $εいぷしろん$-school increases from one to a certain value. After the attack, the number of $εいぷしろん$-school decreases in the first two predator-avoidance patterns, but continues to increase i… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  21. arXiv:2302.11789  [pdf, other

    math.OC

    Interval optimization problems on Hadamard manifolds:Solvability and Duality

    Authors: Le Tram Nguyen, Yu-Lin Chang, Chu-Chin Hu, Jein-Shan Chen

    Abstract: In this paper, we will study about the solvability and duality of interval optimization problems on Hadamard manifolds. It includes the KKT conditions, and Wofle dual problem with weak duality and strong duality. These results are the complement for the solvability of interval optimization problems on Hadamard manifolds.

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2205.11793

  22. arXiv:2301.09234  [pdf, other

    cs.FL cs.LO

    Refutations of pebble minimization via output languages

    Authors: Sandra Kiefer, Lê Thành Dũng Nguyên, Cécilia Pradic

    Abstract: Polyregular functions are the class of string-to-string functions definable by pebble transducers, an extension of finite-state automata with outputs and multiple two-way reading heads (pebbles) with a stack discipline. If a polyregular function can be computed with $k$ pebbles, then its output length is bounded by a polynomial of degree $k$ in the input length. But Bojańczyk has shown that the co… ▽ More

    Submitted 20 June, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: 20 pages, for submission to Fundamenta Informaticae; this version excludes some of the material in the v1, which may appear in other subsequent papers

  23. arXiv:2210.03989  [pdf, other

    math.DS

    A Stochastic Differential Equation Model for Predator-Avoidance Fish Schooling

    Authors: Aditya Dewanto Hartono, Linh Thi Hoai Nguyen, Ton Viet Ta

    Abstract: This paper presents a system of stochastic differential equations (SDEs) as mathematical model to describe the spatial-temporal dynamics of predator-prey system in an artificial aquatic environment with schooling behavior imposed upon the associated prey. The proposed model follows the particle-like approach where interactions among the associated units are manifested through combination of attrac… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    MSC Class: 92-10; 60H10; 68W10

  24. arXiv:2209.10482  [pdf, other

    cs.CL

    SMTCE: A Social Media Text Classification Evaluation Benchmark and BERTology Models for Vietnamese

    Authors: Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Text classification is a typical natural language processing or computational linguistics task with various interesting applications. As the number of users on social media platforms increases, data acceleration promotes emerging studies on Social Media Text Classification (SMTC) or social media text mining on these valuable resources. In contrast to English, Vietnamese, one of the low-resource la… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted at The 36th annual Meeting of Pacific Asia Conference on Language, Information and Computation (PACLIC 36)

  25. A System of Interaction and Structure III: The Complexity of BV and Pomset Logic

    Authors: Lê Thành Dũng Nguyên, Lutz Straßburger

    Abstract: Pomset logic and BV are both logics that extend multiplicative linear logic (with Mix) with a third connective that is self-dual and non-commutative. Whereas pomset logic originates from the study of coherence spaces and proof nets, BV originates from the study of series-parallel orders, cographs, and proof systems. Both logics enjoy a cut-admissibility result, but for neither logic can this be do… ▽ More

    Submitted 15 December, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Journal ref: Logical Methods in Computer Science, Volume 19, Issue 4 (December 18, 2023) lmcs:10057

  26. vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM

    Authors: Thanh Tin Nguyen, Long H. Nguyen, Nhat Truong Pham, Liu Tai Nguyen, Van Huong Do, Hai Nguyen, Ngoc Duy Nguyen

    Abstract: This study presents our approach on the automatic Vietnamese image captioning for healthcare domain in text processing tasks of Vietnamese Language and Speech Processing (VLSP) Challenge 2021, as shown in Figure 1. In recent years, image captioning often employs a convolutional neural network-based architecture as an encoder and a long short-term memory (LSTM) as a decoder to generate sentences. T… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted for publication in the VNU Journal of Science: Computer Science and Communication Engineering

    Journal ref: VNU Journal of Science: Computer Science and Communication Engineering, 38(2), 2022

  27. arXiv:2208.04243  [pdf, other

    cs.CL

    A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

    Authors: Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen

    Abstract: In this paper, we introduce a high-quality and large-scale benchmark dataset for English-Vietnamese speech translation with 508 audio hours, consisting of 331K triplets of (sentence-lengthed audio, English source transcript sentence, Vietnamese target subtitle sentence). We also conduct empirical experiments using strong baselines and find that the traditional "Cascaded" approach still outperforms… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: In Proceedings of INTERSPEECH 2022, to appear. The first three authors contributed equally to this work

  28. arXiv:2206.09600  [pdf, other

    cs.CL

    SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical Texts

    Authors: Nhung Thi-Hong Nguyen, Phuong Phan-Dieu Ha, Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Question answering (QA) systems have gained explosive attention in recent years. However, QA tasks in Vietnamese do not have many datasets. Significantly, there is mostly no dataset in the medical domain. Therefore, we built a Vietnamese Healthcare Question Answering dataset (ViHealthQA), including 10,015 question-answer passage pairs for this task, in which questions from health-interested users… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  29. arXiv:2205.11793  [pdf, other

    math.OC

    Interval Optimization Problems on Hadamard manifolds

    Authors: L. T. Nguyen, Y. L Chang, C. C Hu, J. S Chen

    Abstract: In this article, we introduce the interval optimization problems (IOPs) on Hadamard manifolds as well as study the relationship between them and the interval variational inequalities. To achieve the theoretical results, we build up some new concepts about $gH$-directional derivative and $gH$-Gâteaux differentiability of interval valued functions and their properties on the Hadamard manifolds. Th… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: submitted

  30. VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension

    Authors: Kiet Van Nguyen, Son Quoc Tran, Luan Thanh Nguyen, Tin Van Huynh, Son T. Luu, Ngan Luu-Thuy Nguyen

    Abstract: One of the emerging research trends in natural language understanding is machine reading comprehension (MRC) which is the task to find answers to human questions based on textual data. Existing Vietnamese datasets for MRC research concentrate solely on answerable questions. However, in reality, questions can be unanswerable for which the correct answer is not stated in the given textual data. To a… ▽ More

    Submitted 4 April, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: The 8th International Workshop on Vietnamese Language and Speech Processing (VLSP 2021)

  31. arXiv:2110.12199  [pdf, other

    cs.CL

    PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation

    Authors: Long Doan, Linh The Nguyen, Nguyen Luong Tran, Thai Hoang, Dat Quoc Nguyen

    Abstract: We introduce a high-quality and large-scale Vietnamese-English parallel dataset of 3.02M sentence pairs, which is 2.9M pairs larger than the benchmark Vietnamese-English machine translation corpus IWSLT15. We conduct experiments comparing strong neural baselines and well-known automatic translation engines on our dataset and find that in both automatic and human evaluations: the best performance i… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: To appear in Proceedings of EMNLP 2021 (main conference). The first three authors contribute equally to this work

  32. Gradual Federated Learning with Simulated Annealing

    Authors: Luong Trung Nguyen, Junhan Kim, Byonghyo Shim

    Abstract: Federated averaging (FedAvg) is a popular federated learning (FL) technique that updates the global model by averaging local models and then transmits the updated global model to devices for their local model update. One main limitation of FedAvg is that the average-based global model is not necessarily better than local models in the early stage of the training process so that FedAvg might diverg… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  33. arXiv:2109.03219  [pdf, other

    cs.SD cs.LG cs.NE eess.AS

    Fruit-CoV: An Efficient Vision-based Framework for Speedy Detection and Diagnosis of SARS-CoV-2 Infections Through Recorded Cough Sounds

    Authors: Long H. Nguyen, Nhat Truong Pham, Van Huong Do, Liu Tai Nguyen, Thanh Tin Nguyen, Van Dung Do, Hai Nguyen, Ngoc Duy Nguyen

    Abstract: SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing serv… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 4 pages

  34. Singular angular magnetoresistance and sharp resonant features in a high-mobility metal with open orbits, ReO3

    Authors: Nicholas P. Quirk, Loi T. Nguyen, Jiayi Hu, R. J. Cava, N. P. Ong

    Abstract: We report high-resolution angular magnetoresistance (AMR) experiments performed on crystals of ReO$_3$ with high mobility (90,000 cm$^2$/Vs at 2 K) and extremely low residual resistivity (5-8 n$Ωおめが$cm). The Fermi surface, comprised of intersecting cylinders, supports open orbits. The resistivity $ρろー_{xx}$ in a magnetic field $B$ = 9 T displays a singular pattern of behavior. With… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 12 pages, 7 figures

    Journal ref: Phys. Rev. M 5, 105004 (2021)

  35. arXiv:2105.15079  [pdf, other

    cs.CL

    SA2SL: From Aspect-Based Sentiment Analysis to Social Listening System for Business Intelligence

    Authors: Luong Luc Phan, Phuc Huynh Pham, Kim Thi-Thanh Nguyen, Tham Thi Nguyen, Sieu Khai Huynh, Luan Thanh Nguyen, Tin Van Huynh, Kiet Van Nguyen

    Abstract: In this paper, we present a process of building a social listening system based on aspect-based sentiment analysis in Vietnamese from creating a dataset to building a real application. Firstly, we create UIT-ViSFD, a Vietnamese Smartphone Feedback Dataset as a new benchmark corpus built based on a strict annotation schemes for evaluating aspect-based sentiment analysis, consisting of 11,122 human-… ▽ More

    Submitted 10 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

  36. arXiv:2105.08358  [pdf, other

    cs.FL

    Comparison-free polyregular functions

    Authors: Lê Thành Dũng Tito Nguyên, Camille Noûs, Cécilia Pradic

    Abstract: This paper introduces a new automata-theoretic class of string-to-string functions with polynomial growth. Several equivalent definitions are provided: a machine model which is a restricted variant of pebble transducers, and a few inductive definitions that close the class of regular functions under certain operations. Our motivation for studying this class comes from another characterization, whi… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 May, 2021; originally announced May 2021.

    Journal ref: International Colloquium on Automata, Languages and Programming 2021, Jul 2021, Glasgow, United Kingdom

  37. arXiv:2105.04722  [pdf, other

    physics.class-ph

    On the Electrostatic Interaction between Point Charges due to Dielectrical Shielding

    Authors: Long T. Nguyen, Kim Tuan Do, Duy V. Nguyen, Trung Phan

    Abstract: How will the electrostatic interaction between two point charges change if they are shielded from the other by a dielectrical slab? While the physical setting of this electromagnetic problem is relatively simple, it is easy to be wronged and the correct solution is surprisingly complicated. Here we will show a general answer using the method of images, in which the electrical field are not found b… ▽ More

    Submitted 31 October, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Journal ref: Progress In Electromagnetics Research Letters, Vol. 107, 111-118, 2022

  38. arXiv:2104.11969  [pdf, ps, other

    cs.CL

    Vietnamese Complaint Detection on E-Commerce Websites

    Authors: Nhung Thi-Hong Nguyen, Phuong Phan-Dieu Ha, Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Customer product reviews play a role in improving the quality of products and services for business organizations or their brands. Complaining is an attitude that expresses dissatisfaction with an event or a product not meeting customer expectations. In this paper, we build a Open-domain Complaint Detection dataset (UIT-ViOCD), including 5,485 human-annotated reviews on four categories about produ… ▽ More

    Submitted 5 July, 2021; v1 submitted 24 April, 2021; originally announced April 2021.

  39. arXiv:2104.07376  [pdf, other

    cs.CL

    UIT-E10dot3 at SemEval-2021 Task 5: Toxic Spans Detection with Named Entity Recognition and Question-Answering Approaches

    Authors: Phu Gia Hoang, Luan Thanh Nguyen, Kiet Van Nguyen

    Abstract: The increment of toxic comments on online space is causing tremendous effects on other vulnerable users. For this reason, considerable efforts are made to deal with this, and SemEval-2021 Task 5: Toxic Spans Detection is one of those. This task asks competitors to extract spans that have toxicity from the given texts, and we have done several analyses to understand its structure before doing exper… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval-2021 Task 5: Toxic Spans Detection, ACL-IJCNLP 2021

  40. Black hole mass measurement using ALMA observations of [CI] and CO emissions in the Seyfert 1 galaxy NGC7469

    Authors: Dieu D. Nguyen, Takuma Izumi, Sabine Thater, Masatoshi Imanishi, Taiki Kawamuro, Shunsuke Baba, Suzuka Nakano, Jean L. Turner, Kotaro Kohno, Satoki Matsushita, Sergio Martin, David S. Meier, Phuong M. Nguyen, Lam T. Nguyen

    Abstract: We present a supermassive black hole (SMBH) mass measurement in the Seyfert 1 galaxy NGC7469 using Atacama Large Millimeter/submillimeter Array (ALMA) observations of the atomic-${\rm [CI]}$(1-0) and molecular-$^{12}$CO(1-0) emission lines at the spatial resolution of $\approx0.3$" (or $\approx$ 100 pc). These emissions reveal that NGC7469 hosts a circumnuclear gas disc (CND) with a ring-like stru… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 22 pages, 16 figures, 7 tables. Accepted for publication on MNRAS

  41. arXiv:2104.02983  [pdf, other

    math.NA

    Optimal fire allocation in a combat model of mixed NCW type

    Authors: My A. Vu, Nam H. Nguyen, Hanh Le T. Nguyen, Anh N. Ta, Mong H. Nguyen

    Abstract: In this work, we introduce a nonlinear Lanchester model of NCW-type and study a problem of finding the optimal fire allocation for this model. A Blue party $B$ will fight against a Red party consisting of $A$ and $R$, where $A$ is an independent force and $R$ fights with supports from a supply unit $N$. A battle may consist of several stages but we consider the problem of finding optimal fire allo… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  42. Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese

    Authors: Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: The rise of social media has led to the increasing of comments on online forums. However, there still exists invalid comments which are not informative for users. Moreover, those comments are also quite toxic and harmful to people. In this paper, we create a dataset for constructive and toxic speech detection, named UIT-ViCTSD (Vietnamese Constructive and Toxic Speech Detection dataset) with 10,00… ▽ More

    Submitted 6 September, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: IEA/AIE 2021: Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices pp 572-583

  43. arXiv:2101.01476  [pdf, other

    cs.CL

    PhoNLP: A joint multi-task learning model for Vietnamese part-of-speech tagging, named entity recognition and dependency parsing

    Authors: Linh The Nguyen, Dat Quoc Nguyen

    Abstract: We present the first multi-task learning model -- named PhoNLP -- for joint Vietnamese part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing. Experiments on Vietnamese benchmark datasets show that PhoNLP produces state-of-the-art results, outperforming a single-task learning approach that fine-tunes the pre-trained Vietnamese language model PhoBERT (Nguyen and Nguyen,… ▽ More

    Submitted 8 April, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: To appear in Proceedings of NAACL 2021: Demonstrations

  44. arXiv:2101.00862  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    NbIr$_2$B$_2$ and TaIr$_2$B$_2$ -- new low symmetry noncentrosymmetric superconductors with strong spin orbit coupling

    Authors: Karolina Górnicka, Xin Gui, Bartlomiej Wiendlocha, Loi T. Nguyen, Weiwei Xie, Robert J. Cava, Tomasz Klimczuk

    Abstract: Superconductivity was first observed more than a century ago, but the search for new superconducting materials remains a challenge. The Cooper pairs in superconductors are ideal embodiments of quantum entanglement. Thus, novel superconductors can be critical for both learning about electronic systems in condensed matter and for possible application in future quantum technologies. Here two previous… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: 36 pages, 11 figures

    Journal ref: Adv. Funct. Mater. 2020, 2007960

  45. arXiv:2012.14969  [pdf

    cond-mat.mes-hall cond-mat.supr-con

    Van der Waals Heterostructure Magnetic Josephson Junction

    Authors: H. Idzuchi, F. Pientka, K. -F. Huang, K. Harada, Ö. Gül, Y. J. Shin, L. T. Nguyen, N. H. Jo, D. Shindo, R. J. Cava, P. C. Canfield, P. Kim

    Abstract: When two superconductors are connected across a ferromagnet, the spin configuration of the transferred Cooper pairs can be modulated due to magnetic exchange interaction. The resulting supercurrent can reverse its sign across the Josephson junction (JJ) [1-4]. Here we demonstrate Josephson phase modulation in van der Waals heterostructures when Cooper pairs from superconducting NbSe$_2$ tunnel thr… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  46. arXiv:2011.00103  [pdf

    cond-mat.mtrl-sci

    Low Temperature Structural Phase Transition in the Perovskite Ba2CaMoO6

    Authors: Loi T. Nguyen, Robert J. Cava, Allyson M. Fry-Petit

    Abstract: Ba2CaMoO6 was synthesized by solid state method. The crystal structure adopts cubic Fm-3m space group at room temperature with lattice parameters of 8.378231(5) {angstroms}. Upon cooling, Ba2CaMoO6 was determined to have a structural phase transition to tetragonal I4/m (a=5.905763(6) {angstroms} and c=8.38817(1) {angstroms}) around 200 K. The phase transition was probed structurally by synchrotron… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: 5 figures, 3 tables

    Journal ref: Journal of Solid State Chemistry (2019)

  47. arXiv:2010.08232  [pdf, other

    cs.CL

    WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets

    Authors: Dat Quoc Nguyen, Thanh Vu, Afshin Rahimi, Mai Hoang Dao, Linh The Nguyen, Long Doan

    Abstract: In this paper, we provide an overview of the WNUT-2020 shared task on the identification of informative COVID-19 English Tweets. We describe how we construct a corpus of 10K Tweets and organize the development and evaluation phases for this task. In addition, we also present a brief summary of results obtained from the final system evaluation submissions of 55 teams, finding that (i) many systems… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 6th Workshop on Noisy User-generated Text

  48. arXiv:2009.11215  [pdf

    cond-mat.str-el

    Structure, Magnetism and First Principles Modeling of the Na0.5La0.5RuO3 Perovskite

    Authors: Loi T. Nguyen, Matthieu Saubanère, Robert J. Cava

    Abstract: High purity polycrystalline Na0.5La0.5RuO3 was synthesized by a solid state method, and its properties were studied by magnetic susceptibility, heat capacity and resistivity measurements. We find it to be a tetragonal perovskite, in contrast to an earlier report, with random La/Na mixing. With a Curie-Weiss temperature of -231 K and effective moment of 2.74 uB/mol-Ru, there is no magnetic ordering… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: 7 figures, 2 tables

  49. Widely Spaced Planes of Magnetic Dimers in the Ba6Y2Rh2Ti2O17-δでるた Hexagonal Perovskite

    Authors: Loi T. Nguyen, Daniel B. Straus, Q. Zhang, R. J. Cava

    Abstract: We report the synthesis and initial characterization of Ba6Y2Rh2Ti2O17-δでるた, a previously unreported material with a hexagonal symmetry structure. Face-sharing RhO6 octahedra form triangular planes of Rh2O9 dimers that are widely separated in the perpendicular direction. The material displays a small effective magnetic moment, due to the Rh ions present, and a negative Curie-Weiss temperature. The ch… ▽ More

    Submitted 19 December, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: 5 figures, 2 tables

    Journal ref: Phys. Rev. Materials 5, 034419 (2021)

  50. BANANA at WNUT-2020 Task 2: Identifying COVID-19 Information on Twitter by Combining Deep Learning and Transfer Learning Models

    Authors: Tin Van Huynh, Luan Thanh Nguyen, Son T. Luu

    Abstract: The outbreak COVID-19 virus caused a significant impact on the health of people all over the world. Therefore, it is essential to have a piece of constant and accurate information about the disease with everyone. This paper describes our prediction system for WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets. The dataset for this task contains size 10,000 tweets in English la… ▽ More

    Submitted 1 April, 2021; v1 submitted 6 September, 2020; originally announced September 2020.

    Comments: Submitted to 2020 The 6th Workshop on Noisy User-generated Text (W-NUT)