(Translated by https://www.hiragana.jp/)
jzhang38 (Zhang Peiyuan) / Starred · GitHub
Skip to content
View jzhang38's full-sized avatar

Highlights

  • Pro

Block or report jzhang38

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best OSS video generation models

Python 1,627 159 Updated Nov 1, 2024

[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models

Python 2 Updated Oct 2, 2024

reckoning, judgement, apocalypse

Verilog 2 1 Updated Dec 11, 2020

An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.

Python 26 2 Updated Mar 28, 2024

An evaluation suite for Retrieval-Augmented Generation (RAG).

Python 11 2 Updated Oct 14, 2024

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Python 127 11 Updated Oct 31, 2024

Kolors Team

Python 3,806 261 Updated Sep 4, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,000 5,355 Updated Nov 3, 2024

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 153 Updated Nov 3, 2024
Jupyter Notebook 41 3 Updated Jun 13, 2024

An open-source implementation for training LLaVA-NeXT.

Python 340 17 Updated Oct 23, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 211 15 Updated Apr 22, 2024

Core ROS packages

Python 2,821 779 Updated Feb 20, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 346 22 Updated Nov 1, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,858 4,541 Updated Nov 2, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 669 44 Updated Oct 24, 2024
Python 18 1 Updated May 30, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 640 46 Updated Sep 27, 2024

[ICML 2024] CLLMs: Consistency Large Language Models

Python 348 17 Updated Aug 1, 2024

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,418 188 Updated Nov 2, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,831 142 Updated Nov 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,540 4,437 Updated Nov 3, 2024

Some preliminary explorations of Mamba's context scaling.

Python 190 10 Updated Feb 8, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,092 450 Updated Oct 10, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,872 475 Updated Nov 3, 2024

[ACL 2024] Progressive LLaMA with Block Expansion.

Python 479 35 Updated May 20, 2024

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 350 24 Updated Oct 14, 2024

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 201 9 Updated Oct 7, 2024

Robust recipes to align language models with human and AI preferences

Python 4,643 405 Updated Oct 7, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,821 460 Updated May 3, 2024
Next