(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–19 of 19 results for author: Faisal, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08092  [pdf, ps, other

    cs.CL cs.AI

    Data-Augmentation-Based Dialectal Adaptation for LLMs

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: This report presents GMUNLP's participation to the Dialect-Copa shared task at VarDial 2024, which focuses on evaluating the commonsense reasoning capabilities of large language models (LLMs) on South Slavic micro-dialects. The task aims to assess how well LLMs can handle non-standard dialectal varieties, as their performance on standard languages is already well-established. We propose an approac… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2403.20088  [pdf, other

    cs.CL

    An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established. However, phenomena of positive or negative transfer, and the effect of language choice still need to be fully understood, especially in the complex setting of massively multilingual LMs. We propose an \textit{efficient} method to study transfer language influence in ze… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2403.11009  [pdf, other

    cs.CL cs.AI

    DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

    Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More

    Submitted 7 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

  4. arXiv:2310.08078  [pdf, other

    cs.CL cs.LG

    To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer

    Authors: Md Mushfiqur Rahman, Fardin Ahsan Sakib, Fahim Faisal, Antonios Anastasopoulos

    Abstract: Choosing an appropriate tokenization scheme is often a bottleneck in low-resource cross-lingual transfer. To understand the downstream implications of text representation choices, we perform a comparative analysis on language models having diverse text representation modalities including 2 segmentation-based models (\texttt{BERT}, \texttt{mBERT}), 1 image-based model (\texttt{PIXEL}), and 1 charac… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at 3RD MULTILINGUAL REPRESENTATION LEARNING (MRL) WORKSHOP, 2023

  5. arXiv:2309.00949  [pdf, ps, other

    cs.CL

    Multilingual Text Representation

    Authors: Fahim Faisal

    Abstract: Modern NLP breakthrough includes large multilingual models capable of performing tasks across more than 100 languages. State-of-the-art language models came a long way, starting from the simple one-hot representation of words capable of performing tasks like natural language understanding, common-sense reasoning, or question-answering, thus capturing both the syntax and semantics of texts. At the… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: PhD Comprehensive exam report

  6. arXiv:2308.01932  [pdf, other

    cond-mat.supr-con cs.LG

    Investigation on Machine Learning Based Approaches for Estimating the Critical Temperature of Superconductors

    Authors: Fatin Abrar Shams, Rashed Hasan Ratul, Ahnaf Islam Naf, Syed Shaek Hossain Samir, Mirza Muntasir Nishat, Fahim Faisal, Md. Ashraful Hoque

    Abstract: Superconductors have been among the most fascinating substances, as the fundamental concept of superconductivity as well as the correlation of critical temperature and superconductive materials have been the focus of extensive investigation since their discovery. However, superconductors at normal temperatures have yet to be identified. Additionally, there are still many unknown factors and gaps o… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted for publication on IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, ROBOTICS,SIGNAL, AND IMAGE PROCESSING (AIRoSIP 2023)

  7. arXiv:2305.14716  [pdf, other

    cs.CL

    GlobalBench: A Benchmark for Global Progress in Natural Language Processing

    Authors: Yueqi Song, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig

    Abstract: Despite the major advances in NLP, significant disparities in NLP system performance across languages still exist. Arguably, these are due to uneven resource allocation and sub-optimal incentives to work on less resourced languages. To track and further incentivize the global development of equitable language technology, we introduce GlobalBench. Prior multilingual benchmarks are static and have f… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint, 9 pages

  8. arXiv:2304.12979  [pdf, other

    cs.CL cs.LG

    GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters

    Authors: Md Mahfuz Ibn Alam, Ruoyu Xie, Fahim Faisal, Antonios Anastasopoulos

    Abstract: This report describes GMU's sentiment analysis system for the SemEval-2023 shared task AfriSenti-SemEval. We participated in all three sub-tasks: Monolingual, Multilingual, and Zero-Shot. Our approach uses models initialized with AfroXLMR-large, a pre-trained multilingual language model trained on African languages and fine-tuned correspondingly. We also introduce augmented training data along wit… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at SemEval Workshop at ACL 2023

  9. arXiv:2212.10408  [pdf, other

    cs.CL

    Geographic and Geopolitical Biases of Language Models

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Pretrained language models (PLMs) often fail to fairly represent target users from certain world regions because of the under-representation of those regions in training datasets. With recent PLMs trained on enormous data sources, quantifying their potential biases is difficult, due to their black-box nature and the sheer scale of the data sources. In this work, we devise an approach to study the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  10. arXiv:2205.09634  [pdf, other

    cs.CL

    Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Large pretrained multilingual models, trained on dozens of languages, have delivered promising results due to cross-lingual learning capabilities on variety of language tasks. Further adapting these models to specific languages, especially ones unseen during pre-training, is an important goal towards expanding the coverage of language technologies. In this study, we show how we can use language ph… ▽ More

    Submitted 22 November, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: accepted in AACL 2022 Main Conference

  11. Survival Prediction of Children Undergoing Hematopoietic Stem Cell Transplantation Using Different Machine Learning Classifiers by Performing Chi-squared Test and Hyper-parameter Optimization: A Retrospective Analysis

    Authors: Ishrak Jahan Ratul, Ummay Habiba Wani, Mirza Muntasir Nishat, Abdullah Al-Monsur, Abrar Mohammad Ar-Rafi, Fahim Faisal, Mohammad Ridwan Kabir

    Abstract: Bone Marrow Transplant, a gradational rescue for a wide range of disorders emanating from the bone marrow, is an efficacious surgical treatment. Several risk factors, such as post-transplant illnesses, new malignancies, and even organ damage, can impair long-term survival. Therefore, technologies like Machine Learning are deployed for investigating the survival prediction of BMT receivers along wi… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 25 pages, 14 figures, 38 tables

    Report number: 9391136 ACM Class: J.3

  12. arXiv:2112.03497  [pdf, other

    cs.CL

    Dataset Geography: Mapping Language Data to Language Users

    Authors: Fahim Faisal, Yinkai Wang, Antonios Anastasopoulos

    Abstract: As language technologies become more ubiquitous, there are increasing efforts towards expanding the language diversity and coverage of natural language processing (NLP) systems. Arguably, the most important factor influencing the quality of modern NLP systems is data availability. In this work, we study the geographical representativeness of NLP datasets, aiming to quantify if and by how much do N… ▽ More

    Submitted 23 March, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: ACL 2022

  13. arXiv:2110.08480  [pdf, other

    cs.AI

    Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

    Authors: Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

    Abstract: Multi-agent Markov Decision Process (MMDP) has been an effective way of modelling sequential decision making algorithms for multi-agent cooperative environments. A number of algorithms based on centralized and decentralized planning have been developed in this domain. However, dynamically changing environment, coupled with exponential size of the state and joint action space, make it difficult for… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  14. arXiv:2109.12072  [pdf

    cs.CL

    SD-QA: Spoken Dialectal Question Answering for the Real World

    Authors: Fahim Faisal, Sharlina Keshava, Md Mahfuz ibn Alam, Antonios Anastasopoulos

    Abstract: Question answering (QA) systems are now available through numerous commercial applications for a wide variety of domains, serving millions of users that interact with them via speech interfaces. However, current benchmarks in QA research do not account for the errors that speech recognition models might introduce, nor do they consider the language variations (dialects) of the users. To address thi… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 Findings

  15. arXiv:2109.12028  [pdf

    cs.CL

    Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Human knowledge is collectively encoded in the roughly 6500 languages spoken around the world, but it is not distributed equally across languages. Hence, for information-seeking question answering (QA) systems to adequately serve speakers of all languages, they need to operate cross-lingually. In this work we investigate the capabilities of multilingually pre-trained language models on cross-lingu… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at MRQA Workshop 2021

  16. arXiv:2106.08415  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

    Authors: Junayed Mahmud, Fahim Faisal, Raihan Islam Arnob, Antonios Anastasopoulos, Kevin Moran

    Abstract: Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions. Most evaluations of such models are conducted using automatic reference-based metrics. However, given the relatively large semantic gap between programming languages and natural language, we ar… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to the 2021 NLP4Prog Workshop co-located with The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  17. arXiv:1907.09395  [pdf, other

    cs.SI cs.LG

    Mining Temporal Evolution of Knowledge Graph and Genealogical Features for Literature-based Discovery Prediction

    Authors: Nazim Choudhury, Fahim Faisal, Matloob Khushi

    Abstract: Literature-based knowledge discovery process identifies the important but implicit relations among information embedded in published literature. Existing techniques from Information Retrieval and Natural Language Processing attempt to identify the hidden or unpublished connections between information concepts within published literature, however, these techniques undermine the concept of predictin… ▽ More

    Submitted 10 November, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

  18. arXiv:1905.01987   

    cs.IR cs.CL

    Disease Identification From Unstructured User Input

    Authors: Fahim Faisal, Shafkat Ahmed Bhuiyan, Abu Raihan Mostofa Kamal

    Abstract: A method to identify probable diseases from the unstructured textual input (eg, health forum posts) by incorporating a lexicographic and semantic feature based two-phase text classification module and a symptom-disease correlation-based similarity measurement module. One notable aspect of my approach was to develop a competent algorithm to extract all inherent features from the data source to make… ▽ More

    Submitted 10 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: This was an undergraduate research. The hypotheses it proposes is based on a small number of samples and thus, can not be declared significant. To declare it significant, a large number of sample testing is needed. After that, it can be put through

  19. arXiv:1307.3388  [pdf, ps, other

    cs.CE q-bio.MN

    Dynamic networks reveal key players in aging

    Authors: Fazle Elahi Faisal, Tijana Milenkovic

    Abstract: Motivation: Since susceptibility to diseases increases with age, studying aging gains importance. Analyses of gene expression or sequence data, which have been indispensable for investigating aging, have been limited to studying genes and their protein products in isolation, ignoring their connectivities. However, proteins function by interacting with other proteins, and this is exactly what biolo… ▽ More

    Submitted 12 July, 2013; originally announced July 2013.