Stars
Dataset for the Emerging & Novel Entity NER task (WNUT '17)
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Tesseract Open Source OCR Engine (main repository)
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o.
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Firefly:
A high-throughput and memory-efficient inference and serving engine for LLMs
microsoft / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
An incremental parsing system for programming tools
MNBVC(Massive Never-ending BT Vast Chinese corpus)