Top Repositories
Repositories
80A Toolkit to Help Optimize Onnx Model
Large Language Model Onnx Inference Framework
No description provided.
caffe model to onnx
caffe to tensorrt
👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AIGC system etc.
Rust implementation for a WebNN-oriented graph DSL
Wan: Open and Advanced Large-Scale Video Generative Models
SGLang is a fast serving framework for large language models and vision language models.
Visualizer for neural network, deep learning and machine learning models
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
JAX backend for SGL
Everything in Torch Fx
No description provided.
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Deep Learning tools and applications for NVIDIA AGX platforms.
katago benchmark
Machine Learning, Facial Rigger
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Efficient in-memory representation for ONNX, in Python
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
A framework for few-shot evaluation of language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./project/android/apps/MnnLlmApp/README.md)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Onnxruntime using pytorch backend
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!