BLARK for multi-dialect languages: towards the Kurdish BLARK
H Hassani - Language Resources and Evaluation, 2018 - Springer
In this paper we introduce the Kurdish BLARK (Basic Language Resource Kit). The original
BLARK has not considered multi-dialect characteristics and generally has targeted …
BLARK has not considered multi-dialect characteristics and generally has targeted …
[PDF][PDF] Sorani Kurdish versus Kurmanji Kurdish: an empirical comparison
KS Esmaili, S Salavati - Proceedings of the 51st Annual Meeting of …, 2013 - aclanthology.org
Resource scarcity along with diversity–both in dialect and script–are the two primary
challenges in Kurdish language processing. In this paper we aim at addressing these two …
challenges in Kurdish language processing. In this paper we aim at addressing these two …
[PDF][PDF] Tharwa: A Large Scale Dialectal Arabic-Standard Arabic-English Lexicon.
We introduce an electronic three-way lexicon, Tharwa, comprising Dialectal Arabic, Modern
Standard Arabic and English correspondents. The paper focuses on Egyptian Arabic as the …
Standard Arabic and English correspondents. The paper focuses on Egyptian Arabic as the …
KLPT–Kurdish language processing toolkit
S Ahmadi - Proceedings of second workshop for NLP open …, 2020 - aclanthology.org
Despite the recent advances in applying language-independent approaches to various
natural language processing tasks thanks to artificial intelligence, some language-specific …
natural language processing tasks thanks to artificial intelligence, some language-specific …
[PDF][PDF] Developing language technology tools and resources for a resource-poor language: Sindhi
R Motlani - Proceedings of the NAACL Student Research …, 2016 - aclanthology.org
Abstract Sindhi, an Indo-Aryan language with more than 75 million native speakers1 is a
resourcepoor language in terms of the availability of language technology tools and …
resourcepoor language in terms of the availability of language technology tools and …
[PDF][PDF] The BLARK Matrix and its relation to the language resources situation for the Celtic languages
D Prys - Strategies for developing machine translation for …, 2006 - mt-archive.net
Abstract BLARK (Basic Language Resource Kit) was originally developed as a concept to
specify the minimum corpora, tools, and skills needed to engage in pre-competitive research …
specify the minimum corpora, tools, and skills needed to engage in pre-competitive research …
[PDF][PDF] The madar arabic dialect corpus and lexicon
In this paper, we present two resources that were created as part of the Multi Arabic Dialect
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …
[PDF][PDF] Towards electronic lexicography for the Kurdish language
This paper describes the development of lexicographic resources for Kurdish and provides a
lexical model for this language. Kurdish is considered a less-resourced language, and …
lexical model for this language. Kurdish is considered a less-resourced language, and …
A syntactically annotated corpus of Tibetan
A Wagner, B Zeisler - … of the Fourth International Conference on …, 2004 - aclanthology.org
This paper describes the creation of a syntactically annotated Tibetan corpus. This corpus
forms a part of the TUSNELDA collection of corpora and databases for linguistic research. It …
forms a part of the TUSNELDA collection of corpora and databases for linguistic research. It …
TULAP-An Accessible and Sustainable Platform for Turkish Natural Language Processing Resources
S Uskudarli, M Şen, F Akkurt, M Gürbüz… - Proceedings of the …, 2023 - aclanthology.org
Access to natural language processing resources is essential for their continuous
improvement. This can be especially challenging in educational institutions where the …
improvement. This can be especially challenging in educational institutions where the …