6,924 results for “topic:transformers”
21 Lessons, Get Started Building with Generative AI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
CVPR 2026 论文和开源项目合集
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Machine Learning Engineering Open Book
Ongoing research training transformer models at scale
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
This repository contains demos I made with the Transformers library by HuggingFace.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A PyTorch-based Speech Toolkit
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Open source real-time translation app for Android that runs locally
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
BertViz: Visualize Attention in Transformer Models
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.