(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 123 results for author: Dinh, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10421  [pdf, other

    cs.CL

    SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading

    Authors: Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Fabian Ternava, Jianfeng Gao, Tobias Röddiger, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues

    Abstract: With the rapid development of Large Language Models (LLMs), it is crucial to have benchmarks which can evaluate the ability of LLMs on different domains. One common use of LLMs is performing tasks on scientific topics, such as writing algorithms, querying databases or giving mathematical proofs. Inspired by the way university students are evaluated on such tasks, in this paper, we propose SciEx -… ▽ More

    Submitted 12 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    ACM Class: I.2.7

  2. arXiv:2405.07180  [pdf, other

    cs.IT

    Repairing Reed-Solomon Codes with Side Information

    Authors: Thi Xinh Dinh, Ba Thong Le, Son Hoang Dau, Serdar Boztas, Stanislav Kruglik, Han Mao Kiah, Emanuele Viterbo, Tuvi Etzion, Yeow Meng Chee

    Abstract: We generalize the problem of recovering a lost/erased symbol in a Reed-Solomon code to the scenario in which some side information about the lost symbol is known. The side information is represented as a set $S$ of linearly independent combinations of the sub-symbols of the lost symbol. When $S = \varnothing$, this reduces to the standard problem of repairing a single codeword symbol. When $S$ is… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    MSC Class: 94B05; 94B60 ACM Class: E.4

  3. arXiv:2404.18031  [pdf, other

    cs.CL

    Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation

    Authors: Tu Anh Dinh, Tobias Palzer, Jan Niehues

    Abstract: Providing quality scores along with Machine Translation (MT) output, so-called reference-free Quality Estimation (QE), is crucial to inform users about the reliability of the translation. We propose a model-specific, unsupervised QE approach, termed $k$NN-QE, that extracts information from the MT model's training data using $k$-nearest neighbors. Measuring the performance of model-specific QE is n… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted to EAMT 2024

    ACM Class: I.2.7

  4. arXiv:2403.15509  [pdf, other

    cs.CR cs.AI cs.LG

    Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

    Authors: Phai Vu Dinh, Quang Uy Nguyen, Thai Hoang Dinh, Diep N. Nguyen, Bao Son Pham, Eryk Dutkiewicz

    Abstract: Representation Learning (RL) plays a pivotal role in the success of many problems including cyberattack detection. Most of the RL methods for cyberattack detection are based on the latent vector of Auto-Encoder (AE) models. An AE transforms raw data into a new latent representation that better exposes the underlying characteristics of the input data. Thus, it is very useful for identifying cyberat… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  5. arXiv:2401.10447  [pdf, other

    cs.CL cs.AI cs.LG cs.NE cs.SD eess.AS

    Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

    Authors: Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke

    Abstract: The use of low-rank adaptation (LoRA) with frozen pretrained language models (PLMs) has become increasing popular as a mainstream, resource-efficient modeling approach for memory-constrained hardware. In this study, we first explore how to enhance model performance by introducing various LoRA training strategies, achieving relative word error rate reductions of 3.50\% on the public Librispeech dat… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  6. arXiv:2311.07289  [pdf, other

    cs.LG

    A probabilistic forecast methodology for volatile electricity prices in the Australian National Electricity Market

    Authors: Cameron Cornell, Nam Trong Dinh, S. Ali Pourmousavi

    Abstract: The South Australia region of the Australian National Electricity Market (NEM) displays some of the highest levels of price volatility observed in modern electricity markets. This paper outlines an approach to probabilistic forecasting under these extreme conditions, including spike filtration and several post-processing steps. We propose using quantile regression as an ensemble tool for probabili… ▽ More

    Submitted 12 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: This manuscript has been accepted for publication in International Journal of Forecasting

  7. arXiv:2310.02494  [pdf, other

    eess.SY cs.IT

    On the Financial Consequences of Simplified Battery Sizing Models without Considering Operational Details

    Authors: Nam Trong Dinh, Sahand Karimi-Arpanahi, S. Ali Pourmousavi, Mingyu Guo, Julian Lemos-Vinasco, Jon A. R. Liisberg

    Abstract: Optimal battery sizing studies tend to overly simplify the practical aspects of battery operation within the battery sizing framework. Such assumptions may lead to a suboptimal battery capacity, resulting in significant financial losses for a battery project that could last more than a decade. In this paper, we compare the most common existing sizing methods in the literature with a battery sizing… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This manuscript has been submitted to PSCC 2024 for possible publication

  8. arXiv:2309.15223  [pdf, other

    cs.CL cs.AI cs.LG cs.NE cs.SD eess.AS

    Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

    Authors: Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

    Abstract: We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring. Although pretrained language models (LMs) like BERT have shown superior performance in second-pass rescoring, the high computational cost of scaling up the pretraining stage and adapting the pretrained models to specific domains limit their practical use in rescoring. Here we p… ▽ More

    Submitted 10 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE ASRU 2023. Internal Review Approved. Revised 2nd version with Andreas and Huck. The first version is in Sep 29th. 8 pages

    Journal ref: Proc. IEEE ASRU Workshop, Dec. 2023

  9. arXiv:2309.09012  [pdf, other

    eess.SY cs.IT

    Modelling Irrational Behaviour of Residential End Users using Non-Stationary Gaussian Processes

    Authors: Nam Trong Dinh, Sahand Karimi-Arpanahi, Rui Yuan, S. Ali Pourmousavi, Mingyu Guo, Jon A. R. Liisberg, Julian Lemos-Vinasco

    Abstract: Demand response (DR) plays a critical role in ensuring efficient electricity consumption and optimal use of network assets. Yet, existing DR models often overlook a crucial element, the irrational behaviour of electricity end users. In this work, we propose a price-responsive model that incorporates key aspects of end-user irrationality, specifically loss aversion, time inconsistency, and bounded… ▽ More

    Submitted 26 March, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: This manuscript has been accepted for publication in IEEE Transactions on Smart Grid

  10. arXiv:2308.03415  [pdf, other

    cs.CL cs.AI

    End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

    Authors: Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel

    Abstract: The challenge of low-latency speech translation has recently draw significant interest in the research community as shown by several publications and shared tasks. Therefore, it is essential to evaluate these different approaches in realistic scenarios. However, currently only specific aspects of the systems are evaluated and often it is not possible to compare different approaches. In this work… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Demo paper at EMNLP 2023

  11. arXiv:2306.15860  [pdf, other

    cs.NI

    Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

    Authors: Phuong L. Vo, Nghia T. Nguyen, Long Luu, Canh T. Dinh, Nguyen H. Tran, Tuan-Anh Le

    Abstract: In video streaming over HTTP, the bitrate adaptation selects the quality of video chunks depending on the current network condition. Some previous works have applied deep reinforcement learning (DRL) algorithms to determine the chunk's bitrate from the observed states to maximize the quality-of-experience (QoE). However, to build an intelligent model that can predict in various environments, such… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 1 column

  12. arXiv:2306.09754  [pdf, other

    cs.CR

    CroCoDai: A Stablecoin for Cross-Chain Commerce

    Authors: Daniël Reijsbergen, Bretislav Hajek, Tien Tuan Anh Dinh, Jussi Keppo, Hank Korth, Anwitaman Datta

    Abstract: Decentralized Finance (DeFi), in which digital assets are exchanged without trusted intermediaries, has grown rapidly in value in recent years. The global DeFi ecosystem is fragmented into multiple blockchains, fueling the demand for cross-chain commerce. Existing approaches for cross-chain transactions, e.g., bridges and cross-chain deals, achieve atomicity by locking assets in escrow. However, l… ▽ More

    Submitted 20 June, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  13. arXiv:2306.09735  [pdf, other

    cs.CR

    PIEChain -- A Practical Blockchain Interoperability Framework

    Authors: Daniël Reijsbergen, Aung Maw, Jingchi Zhang, Tien Tuan Anh Dinh, Anwitaman Datta

    Abstract: A plethora of different blockchain platforms have emerged in recent years, but many of them operate in silos. As such, there is a need for reliable cross-chain communication to enable blockchain interoperability. Blockchain interoperability is challenging because transactions can typically not be reverted - as such, if one transaction is committed then the protocol must ensure that all related tra… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  14. arXiv:2306.05320  [pdf, other

    cs.CL cs.SD

    KIT's Multilingual Speech Translation System for IWSLT 2023

    Authors: Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

    Abstract: Many existing speech translation benchmarks focus on native-English speech in high-quality recording conditions, which often do not match the conditions in real-life use-cases. In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks. The test condition features accented input speech and te… ▽ More

    Submitted 12 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: IWSLT 2023

  15. arXiv:2306.03438  [pdf, other

    cs.LG cs.AI cs.CL cs.SE

    Large Language Models of Code Fail at Completing Code with Potential Bugs

    Authors: Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis

    Abstract: Large language models of code (Code-LLMs) have recently brought tremendous advances to code completion, a fundamental feature of programming assistance and code intelligence. However, most existing works ignore the possible presence of bugs in the code context for generation, which are inevitable in software development. Therefore, we introduce and study the buggy-code completion problem, inspired… ▽ More

    Submitted 30 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 27 pages, accepted to NeurIPS 2023

  16. arXiv:2305.07457  [pdf, other

    cs.CL

    Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation

    Authors: Tu Anh Dinh, Jan Niehues

    Abstract: Quality Estimation (QE) is the task of predicting the quality of Machine Translation (MT) system output, without using any gold-standard translation references. State-of-the-art QE models are supervised: they require human-labeled quality of some MT system output on some datasets for training, making them domain-dependent and MT-system-dependent. There has been research on unsupervised QE, which r… ▽ More

    Submitted 13 July, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted to MT Summit 2023

    ACM Class: I.2.7

  17. arXiv:2305.06600  [pdf, other

    cs.IT cs.DM

    Designing Compact Repair Groups for Reed-Solomon Codes

    Authors: Thi Xinh Dinh, Serdar Boztas, Son Hoang Dau, Emanuele Viterbo

    Abstract: Motivated by the application of Reed-Solomon codes to recently emerging decentralized storage systems such as Storj and Filebase/Sia, we study the problem of designing compact repair groups for recovering multiple failures in a decentralized manner. Here, compactness means that the corresponding trace repair schemes of these groups of helpers can be generated from a single or a few seed repair sch… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  18. Development of a Vision System to Enhance the Reliability of the Pick-and-Place Robot for Autonomous Testing of Camera Module used in Smartphones

    Authors: Hoang-Anh Phan, Duy Nam Bui, Tuan Nguyen Dinh, Bao-Anh Hoang, An Nguyen Ngoc, Dong Tran Huu Quoc, Ha Tran Thi Thuy, Tung Thanh Bui, Van Nguyen Thi Thanh

    Abstract: Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Published to 2021 International Conference on Engineering and Emerging Technologies (ICEET 2021). 6 pages

  19. arXiv:2304.13671  [pdf, other

    math.OC cs.AI

    Multiobjective Logistics Optimization for Automated ATM Cash Replenishment Process

    Authors: Bui Tien Thanh, Dinh Van Tuan, Tuan Anh Chi, Nguyen Van Dai, Nguyen Tai Quang Dinh, Nguyen Thu Thuy, Nguyen Thi Xuan Hoa

    Abstract: In the digital transformation era, integrating digital technology into every aspect of banking operations improves process automation, cost efficiency, and service level improvement. Although logistics for ATM cash is a crucial task that impacts operating costs and consumer satisfaction, there has been little effort to enhance it. Specifically, in Vietnam, with a market of more than 20,000 ATMs na… ▽ More

    Submitted 22 July, 2023; v1 submitted 23 April, 2023; originally announced April 2023.

  20. arXiv:2304.03433  [pdf, other

    cs.IT eess.SP

    Multi-User Cooperation for Covert Communication Under Quasi-Static Fading

    Authors: Jinyoung Lee, Duc Trung Dinh, Hyeonsik Yeom, Si-Hyeon Lee, Jeongseok Ha

    Abstract: This work studies a covert communication scheme for an uplink multi-user scenario in which some users are opportunistically selected to help a covert user. In particular, the selected users emit interfering signals via an orthogonal resource dedicated to the covert user together with signals for their own communications using orthogonal resources allocated to the selected users, which helps the co… ▽ More

    Submitted 10 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 13 pages, 8 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  21. arXiv:2302.11426  [pdf, other

    cs.DB cs.IR

    Mining compact high utility sequential patterns

    Authors: Tai Dinh, Philippe Fournier-Viger, Huynh Van Hong

    Abstract: High utility sequential pattern mining (HUSPM) aims to mine all patterns that yield a high utility (profit) in a sequence dataset. HUSPM is useful for several applications such as market basket analysis, marketing, and website clickstream analysis. In these applications, users may also consider high utility patterns frequently appearing in the dataset to obtain more fruitful information. However,… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Nippon (Japan) Applied Informatics Society Journal

  22. arXiv:2302.05512  [pdf

    cs.CR cs.DL

    Composable Ledgers for Distributed Synchronic Web Archiving

    Authors: Thien-Nam Dinh, Nicholas Pattengale

    Abstract: The Synchronic Web is a highly scalable notary infrastructure that provides tamper-evident data provenance for historical web data. In this document, we describe the applicability of this infrastructure for web archiving across three envisioned stages of adoption. We codify the core mechanism enabling the value proposition: a procedure for splitting and merging cryptographic information fluidly ac… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  23. arXiv:2301.10733  [pdf

    cs.CR

    The Synchronic Web

    Authors: Thien-Nam Dinh, Nicholas Pattengale, Steven Elliott

    Abstract: The Synchronic Web is a distributed network for securing data provenance on the World Wide Web. By enabling clients around the world to freely commit digital information into a single shared view of history, it provides a foundational basis of truth on which to build decentralized and scalable trust across the Internet. Its core cryptographical capability allows mutually distrusting parties to cre… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

  24. arXiv:2212.10723  [pdf, other

    cs.AI

    Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

    Authors: Christoph Bergmeir, Frits de Nijs, Abishek Sriramulu, Mahdi Abolghasemi, Richard Bean, John Betts, Quang Bui, Nam Trong Dinh, Nils Einecke, Rasul Esmaeilbeigi, Scott Ferraro, Priya Galketiya, Evgenii Genov, Robert Glasgow, Rakshitha Godahewa, Yanfei Kang, Steffen Limmer, Luis Magdalena, Pablo Montero-Manso, Daniel Peralta, Yogesh Pipada Sunil Kumar, Alejandro Rosales-Pérez, Julian Ruddick, Akylas Stratigakos, Peter Stuckey , et al. (3 additional authors not shown)

    Abstract: Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends o… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  25. arXiv:2212.00981  [pdf, other

    cs.CV cs.AI

    QC-StyleGAN -- Quality Controllable Image Generation and Manipulation

    Authors: Dat Viet Thanh Nguyen, Phong Tran The, Tan M. Dinh, Cuong Pham, Anh Tuan Tran

    Abstract: The introduction of high-quality image generation models, particularly the StyleGAN family, provides a powerful tool to synthesize and manipulate images. However, existing models are built upon high-quality (HQ) data as desired outputs, making them unfit for in-the-wild low-quality (LQ) images, which are common inputs for manipulation. In this work, we bridge this gap by proposing a novel GAN stru… ▽ More

    Submitted 7 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS 2022; The code is available at https://github.com/VinAIResearch/QC-StyleGAN

  26. arXiv:2211.11001  [pdf

    cs.CV

    F2SD: A dataset for end-to-end group detection algorithms

    Authors: Giang Hoang, Tuan Nguyen Dinh, Tung Cao Hoang, Son Le Duy, Keisuke Hihara, Yumeka Utada, Akihiko Torii, Naoki Izumi, Long Tran Quoc

    Abstract: The lack of large-scale datasets has been impeding the advance of deep learning approaches to the problem of F-formation detection. Moreover, most research works on this problem rely on input sensor signals of object location and orientation rather than image signals. To address this, we develop a new, large-scale dataset of simulated images for F-formation detection, called F-formation Simulation… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted at ICMV 2022

  27. arXiv:2210.12990  [pdf, other

    cs.LG math.OC

    Optimal activity and battery scheduling algorithm using load and solar generation forecasts

    Authors: Yogesh Pipada Sunil Kumar, Rui Yuan, Nam Trong Dinh, S. Ali Pourmousavi

    Abstract: Energy usage optimal scheduling has attracted great attention in the power system community, where various methodologies have been proposed. However, in real-world applications, the optimal scheduling problems require reliable energy forecasting, which is scarcely discussed as a joint solution to the scheduling problem. The 5\textsuperscript{th} IEEE Computational Intelligence Society (IEEE-CIS) c… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 6 pages, 4 figures, 3 tables. Accepted for IEEE proceedings as a conference paper for AUPEC 2022

  28. arXiv:2210.11702  [pdf, other

    cs.CR

    TAP: Transparent and Privacy-Preserving Data Services

    Authors: Daniel Reijsbergen, Aung Maw, Zheng Yang, Tien Tuan Anh Dinh, Jianying Zhou

    Abstract: Users today expect more security from services that handle their data. In addition to traditional data privacy and integrity requirements, they expect transparency, i.e., that the service's processing of the data is verifiable by users and trusted auditors. Our goal is to build a multi-user system that provides data privacy, integrity, and transparency for a large number of operations, while achie… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted for USENIX Security 2023

  29. arXiv:2210.06042  [pdf, ps, other

    cs.IT

    Efficient Hamiltonian Reduction for Quantum Annealing on SatCom Beam Placement Problem

    Authors: Thinh Q. Dinh, Son Hoang Dau, Eva Lagunas, Symeon Chatzinotas

    Abstract: Beam Placement (BP) is a well-known problem in Low-Earth Orbit (LEO) satellite communication (SatCom) systems, which can be modelled as an NP-hard clique cover problem. Recently, quantum computing has emerged as a novel technology which revolutionizes how to solve challenging optimization problems by formulating Quadratic Unconstrained Binary Optimization (QUBO), then preparing Hamiltonians as inp… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  30. arXiv:2208.04613  [pdf

    eess.IV cs.CV cs.LG

    Res-Dense Net for 3D Covid Chest CT-scan classification

    Authors: Quoc-Huy Trinh, Minh-Van Nguyen, Thien-Phuc Nguyen Dinh

    Abstract: One of the most contentious areas of research in Medical Image Preprocessing is 3D CT-scan. With the rapid spread of COVID-19, the function of CT-scan in properly and swiftly diagnosing the disease has become critical. It has a positive impact on infection prevention. There are many tasks to diagnose the illness through CT-scan images, include COVID-19. In this paper, we propose a method that usin… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.07524 by other authors

  31. arXiv:2208.04609  [pdf, other

    cs.LG cs.SI

    E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes

    Authors: Tu Anh Dinh, Jeroen den Boef, Joran Cornelisse, Paul Groth

    Abstract: Node classification utilizing text-based node attributes has many real-world applications, ranging from prediction of paper topics in academic citation graphs to classification of user characteristics in social media networks. State-of-the-art node classification frameworks, such as GIANT, use a two-stage pipeline: first embedding the text attributes of graph nodes then feeding the resulting embed… ▽ More

    Submitted 26 September, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted to MLoG - IEEE International Conference on Data Mining Workshops ICDMW 2023

  32. arXiv:2207.00944  [pdf, other

    cs.DB

    GlassDB: An Efficient Verifiable Ledger Database System Through Transparency

    Authors: Cong Yue, Tien Tuan Anh Dinh, Zhongle Xie, Meihui Zhang, Gang Chen, Beng Chin Ooi, Xiaokui Xiao

    Abstract: Verifiable ledger databases protect data history against malicious tampering. Existing systems, such as blockchains and certificate transparency, are based on transparency logs -- a simple abstraction allowing users to verify that a log maintained by an untrusted server is append-only. They expose a simple key-value interface. Building a practical database from transparency logs, on the other hand… ▽ More

    Submitted 19 February, 2023; v1 submitted 2 July, 2022; originally announced July 2022.

  33. arXiv:2206.06565  [pdf, other

    cs.LG cs.CL

    LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks

    Authors: Tuan Dinh, Yuchen Zeng, Ruisu Zhang, Ziqian Lin, Michael Gira, Shashank Rajput, Jy-yong Sohn, Dimitris Papailiopoulos, Kangwook Lee

    Abstract: Fine-tuning pretrained language models (LMs) without making any architectural changes has become a norm for learning various language downstream tasks. However, for non-language downstream tasks, a common practice is to employ task-specific designs for input, output layers, and loss functions. For instance, it is possible to fine-tune an LM into an MNIST classifier by replacing the word embedding… ▽ More

    Submitted 30 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2022

  34. arXiv:2206.01432  [pdf, other

    cs.LG cs.DC

    On the Generalization of Wasserstein Robust Federated Learning

    Authors: Tung-Anh Nguyen, Tuan Dung Nguyen, Long Tan Le, Canh T. Dinh, Nguyen H. Tran

    Abstract: In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with conv… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  35. arXiv:2205.11616  [pdf, other

    cs.CL cs.LG

    Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

    Authors: Tuan Dinh, Jy-yong Sohn, Shashank Rajput, Timothy Ossowski, Yifei Ming, Junjie Hu, Dimitris Papailiopoulos, Kangwook Lee

    Abstract: Word translation without parallel corpora has become feasible, rivaling the performance of supervised methods. Recent findings have shown that the accuracy and robustness of unsupervised word translation (UWT) can be improved by making use of visual observations, which are universal representations across languages. In this work, we investigate the potential of using not only visual observations b… ▽ More

    Submitted 7 November, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings)

  36. arXiv:2205.11469  [pdf

    cs.LG eess.SY

    Advanced Transient Diagnostic with Ensemble Digital Twin Modeling

    Authors: Edward Chen, Linyu Lin, Nam T. Dinh

    Abstract: The use of machine learning (ML) model as digital-twins for reduced-order-modeling (ROM) in lieu of system codes has grown traction over the past few years. However, due to the complex and non-linear nature of nuclear reactor transients as well as the large range of tasks required, it is infeasible for a single ML model to generalize across all tasks. In this paper, we incorporate issue specific d… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 9 pages, 4 figures, 3 tables, presented in the American Nuclear Society Mathematics and Computation 2021 Annual Conference

    MSC Class: 93C57

  37. arXiv:2205.11015  [pdf, other

    cs.IT

    Practical Considerations in Repairing Reed-Solomon Codes

    Authors: Thi Xinh Dinh, Luu Y Nhi Nguyen, Lakshmi J. Mohan, Serdar Boztas, Tran Thi Luong, Son Hoang Dau

    Abstract: The issue of repairing Reed-Solomon codes currently employed in industry has been sporadically discussed in the literature. In this work we carry out a systematic study of these codes and investigate important aspects of repairing them under the trace repair framework, including which evaluation points to select and how to implement a trace repair scheme efficiently. In particular, we employ diffe… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: 6 pages, accepted to the IEEE International Symposium on Information Theory

    MSC Class: 94B05; 94B60 ACM Class: E.4

  38. arXiv:2205.06941  [pdf, ps, other

    cs.DC cs.DB cs.PF

    Blockchain Goes Green? Part II: Characterizing the Performance and Cost of Blockchains on the Cloud and at the Edge

    Authors: Dumitrel Loghin, Tien Tuan Anh Dinh, Aung Maw, Chen Gang, Yong Meng Teo, Beng Chin Ooi

    Abstract: While state-of-the-art permissioned blockchains can achieve thousands of transactions per second on commodity hardware with x86/64 architecture, their performance when running on different architectures is not clear. The goal of this work is to characterize the performance and cost of permissioned blockchains on different hardware systems, which is important as diverse application domains are adop… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: 13 pages, 10 figures, 3 tables

  39. arXiv:2205.05611  [pdf, other

    cs.CR cs.LG

    Blockchain-based Secure Client Selection in Federated Learning

    Authors: Truc Nguyen, Phuc Thai, Tre' R. Jeter, Thang N. Dinh, My T. Thai

    Abstract: Despite the great potential of Federated Learning (FL) in large-scale distributed learning, the current system is still subject to several privacy issues due to the fact that local models trained by clients are exposed to the central server. Consequently, secure aggregation protocols for FL have been developed to conceal the local models from the server. However, we show that, by manipulating the… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: IEEE ICBC 2022

  40. arXiv:2205.05577  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation in RIS-assisted Downlink Massive MIMO: A Learning-Based Approach

    Authors: Tung T. Vu, Trinh Van Chien, Canh T. Dinh, Hien Quoc Ngo, Michail Matthaiou

    Abstract: For downlink massive multiple-input multiple-output (MIMO) operating in time-division duplex protocol, users can decode the signals effectively by only utilizing the channel statistics as long as channel hardening holds. However, in a reconfigurable intelligent surface (RIS)-assisted massive MIMO system, the propagation channels may be less hardened due to the extra random fluctuations of the effe… ▽ More

    Submitted 15 May, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

    Comments: accepted to appear in IEEE SPAWC'22, Oulu, Finland

  41. arXiv:2205.05004  [pdf, other

    quant-ph cs.DS

    FastHare: Fast Hamiltonian Reduction for Large-scale Quantum Annealing

    Authors: Phuc Thai, My T. Thai, Tam Vu, Thang N. Dinh

    Abstract: Quantum annealing (QA) that encodes optimization problems into Hamiltonians remains the only near-term quantum computing paradigm that provides sufficient many qubits for real-world applications. To fit larger optimization instances on existing quantum annealers, reducing Hamiltonians into smaller equivalent Hamiltonians provides a promising approach. Unfortunately, existing reduction techniques a… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  42. arXiv:2205.00185  [pdf, other

    cs.CR

    Protecting the Integrity of IoT Sensor Data and Firmware With A Feather-Light Blockchain Infrastructure

    Authors: Daniel Reijsbergen, Aung Maw, Sarad Venugopalan, Dianshi Yang, Tien Tuan Anh Dinh, Jianying Zhou

    Abstract: Smart cities deploy large numbers of sensors and collect a tremendous amount of data from them. For example, Advanced Metering Infrastructures (AMIs), which consist of physical meters that collect usage data about public utilities such as power and water, are an important building block in a smart city. In a typical sensor network, the measurement devices are connected through a computer network,… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  43. arXiv:2203.01746  [pdf, other

    cs.SI

    SaPHyRa: A Learning Theory Approach to Ranking Nodes in Large Networks

    Authors: Phuc Thai, My T. Thai, Tam Vu, Thang N. Dinh

    Abstract: Ranking nodes based on their centrality stands a fundamental, yet, challenging problem in large-scale networks. Approximate methods can quickly estimate nodes' centrality and identify the most central nodes, but the ranking for the majority of remaining nodes may be meaningless. For example, ranking for less-known websites in search queries is known to be noisy and unstable. To this end, we invest… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: To appear in IEEE ICDE'22

  44. arXiv:2202.04345  [pdf, other

    cs.CR

    Securing Smart Grids Through an Incentive Mechanism for Blockchain-Based Data Sharing

    Authors: Daniel Reijsbergen, Aung Maw, Tien Tuan Anh Dinh, Wen-Tai Li, Chau Yuen

    Abstract: Smart grids leverage the data collected from smart meters to make important operational decisions. However, they are vulnerable to False Data Injection (FDI) attacks in which an attacker manipulates meter data to disrupt the grid operations. Existing works on FDI are based on a simple threat model in which a single grid operator has access to all the data, and only some meters can be compromised.… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  45. Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques

    Authors: Tu Anh Dinh, Danni Liu, Jan Niehues

    Abstract: Recently, end-to-end speech translation (ST) has gained significant attention as it avoids error propagation. However, the approach suffers from data scarcity. It heavily depends on direct ST data and is less efficient in making use of speech transcription and text translation data, which is often more easily available. In the related field of multilingual text translation, several techniques have… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 6 pages, 5 figures, accepted to IEEE ICASSP 2022. arXiv admin note: text overlap with arXiv:2107.06010

    ACM Class: I.2.7

    Journal ref: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 6222-6226

  46. arXiv:2201.02692  [pdf, other

    cs.LG cs.AI

    Improved Input Reprogramming for GAN Conditioning

    Authors: Tuan Dinh, Daewon Seo, Zhixu Du, Liang Shang, Kangwook Lee

    Abstract: We study the GAN conditioning problem, whose goal is to convert a pretrained unconditional GAN into a conditional GAN using labeled data. We first identify and analyze three approaches to this problem -- conditional GAN training from scratch, fine-tuning, and input reprogramming. Our analysis reveals that when the amount of labeled data is small, input reprogramming performs the best. Motivated by… ▽ More

    Submitted 7 February, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: 24 pages, 7 figures

  47. arXiv:2112.01398  [pdf, other

    cs.CV

    TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

    Authors: Tan M. Dinh, Rang Nguyen, Binh-Son Hua

    Abstract: In this paper, we conduct a study on the state-of-the-art methods for text-to-image synthesis and propose a framework to evaluate these methods. We consider syntheses where an image contains a single or multiple objects. Our study outlines several issues in the current evaluation pipeline: (i) for image quality assessment, a commonly used metric, e.g., Inception Score (IS), is often either miscali… ▽ More

    Submitted 19 July, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to ECCV 2022; TISE toolbox is available at https://github.com/VinAIResearch/tise-toolbox

  48. arXiv:2112.00719  [pdf, other

    cs.CV

    HyperInverter: Improving StyleGAN Inversion via Hypernetwork

    Authors: Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua

    Abstract: Real-world image manipulation has achieved fantastic progress in recent years as a result of the exploration and utilization of GAN latent spaces. GAN inversion is the first step in this pipeline, which aims to map the real image to the latent code faithfully. Unfortunately, the majority of existing GAN inversion methods fail to meet at least one of the three requirements listed below: high recons… ▽ More

    Submitted 4 April, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022; Project page is located at https://di-mi-ta.github.io/HyperInverter/

  49. arXiv:2111.11604  [pdf, other

    cs.CV

    Simultaneous face detection and 360 degree headpose estimation

    Authors: Hoang Nguyen Viet, Linh Nguyen Viet, Tuan Nguyen Dinh, Duc Tran Minh, Long Tran Quoc

    Abstract: With many practical applications in human life, including manufacturing surveillance cameras, analyzing and processing customer behavior, many researchers are noticing face detection and head pose estimation on digital images. A large number of proposed deep learning models have state-of-the-art accuracy such as YOLO, SSD, MTCNN, solving the problem of face detection or HopeNet, FSA-Net, RankPose… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted at The 13th International Conference on Knowledge and Systems Engineering (KSE 2021), 7 pages, 2 figures, 3 tables

  50. arXiv:2111.07039  [pdf, other

    cs.CV cs.AI cs.HC

    UET-Headpose: A sensor-based top-view head pose dataset

    Authors: Linh Nguyen Viet, Tuan Nguyen Dinh, Hoang Nguyen Viet, Duc Tran Minh, Long Tran Quoc

    Abstract: Head pose estimation is a challenging task that aims to solve problems related to predicting three dimensions vector, that serves for many applications in human-robot interaction or customer behavior. Previous researches have proposed some precise methods for collecting head pose data. But those methods require either expensive devices like depth cameras or complex laboratory environment setup. In… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.