BLARK for multi-dialect languages: towards the Kurdish BLARK

H Hassani - Language Resources and Evaluation, 2018 - Springer
In this paper we introduce the Kurdish BLARK (Basic Language Resource Kit). The original
BLARK has not considered multi-dialect characteristics and generally has targeted …

[PDF][PDF] Sorani Kurdish versus Kurmanji Kurdish: an empirical comparison

KS Esmaili, S Salavati - Proceedings of the 51st Annual Meeting of …, 2013 - aclanthology.org
Resource scarcity along with diversity–both in dialect and script–are the two primary
challenges in Kurdish language processing. In this paper we aim at addressing these two …

[PDF][PDF] Tharwa: A Large Scale Dialectal Arabic-Standard Arabic-English Lexicon.

MT Diab, M Al-Badrashiny, M Aminian, M Attia… - LREC, 2014 - academia.edu
We introduce an electronic three-way lexicon, Tharwa, comprising Dialectal Arabic, Modern
Standard Arabic and English correspondents. The paper focuses on Egyptian Arabic as the …

KLPT–Kurdish language processing toolkit

S Ahmadi - Proceedings of second workshop for NLP open …, 2020 - aclanthology.org
Despite the recent advances in applying language-independent approaches to various
natural language processing tasks thanks to artificial intelligence, some language-specific …

[PDF][PDF] Developing language technology tools and resources for a resource-poor language: Sindhi

R Motlani - Proceedings of the NAACL Student Research …, 2016 - aclanthology.org
Abstract Sindhi, an Indo-Aryan language with more than 75 million native speakers1 is a
resourcepoor language in terms of the availability of language technology tools and …

[PDF][PDF] The BLARK Matrix and its relation to the language resources situation for the Celtic languages

D Prys - Strategies for developing machine translation for …, 2006 - mt-archive.net
Abstract BLARK (Basic Language Resource Kit) was originally developed as a concept to
specify the minimum corpora, tools, and skills needed to engage in pre-competitive research …

[PDF][PDF] The madar arabic dialect corpus and lexicon

H Bouamor, N Habash, M Salameh… - Proceedings of the …, 2018 - aclanthology.org
In this paper, we present two resources that were created as part of the Multi Arabic Dialect
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …

[PDF][PDF] Towards electronic lexicography for the Kurdish language

S Ahmadi, H Hassani… - Proceedings of the sixth …, 2019 - aran.library.nuigalway.ie
This paper describes the development of lexicographic resources for Kurdish and provides a
lexical model for this language. Kurdish is considered a less-resourced language, and …

A syntactically annotated corpus of Tibetan

A Wagner, B Zeisler - … of the Fourth International Conference on …, 2004 - aclanthology.org
This paper describes the creation of a syntactically annotated Tibetan corpus. This corpus
forms a part of the TUSNELDA collection of corpora and databases for linguistic research. It …

TULAP-An Accessible and Sustainable Platform for Turkish Natural Language Processing Resources

S Uskudarli, M Şen, F Akkurt, M Gürbüz… - Proceedings of the …, 2023 - aclanthology.org
Access to natural language processing resources is essential for their continuous
improvement. This can be especially challenging in educational institutions where the …