A curated list of pretrained sentence and word embedding models
-
Updated
Apr 23, 2021 - Python
A curated list of pretrained sentence and word embedding models
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Multilingual Voice Understanding Model
“
Text2Text: Crosslingual NLP/G toolkit
AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Cross-Lingual Machine Reading Comprehension (EMNLP 2019)
Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework
This repo contains the code for the paper Neural Factor Graph Models for Cross-lingual Morphological Tagging.
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.
Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity
A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".
mSimCSE: Multilingual SimCSE
EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems (AAAI-2020)
PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656
Zero-shot Cross-lingual Task-Oriented Dialogue Systems (EMNLP 2019)
Add a description, image, and links to the cross-lingual topic page so that developers can more easily learn about it.
To associate your repository with the cross-lingual topic, visit your repo's landing page and select "manage topics."