Universal LLM Deployment Engine with ML Compilation
-
Updated
Aug 15, 2024 - Python
Universal LLM Deployment Engine with ML Compilation
High-performance In-browser LLM Inference Engine
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
FlashInfer: Kernel Library for LLM Serving
TVM Documentation in Chinese Simplified / TVM
AutoKernel
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
TON Foundation invites talent to imagine and realize projects that have the potential to integrate with the daily lives of users.
Open, Modular, Deep Learning Accelerator
Optimizing Mobile Deep Learning on ARM GPU with TVM
Solidity compiler for TVM
A home for the final text of all TVM RFCs.
Add a description, image, and links to the tvm topic page so that developers can more easily learn about it.
To associate your repository with the tvm topic, visit your repo's landing page and select "manage topics."