GitHunt — Discover GitHub Repositories

6,924 results for “topic:transformers”

microsoft/generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook107.7k57.7kUpdated just now

aiazurechatgptdall-egenerative-ai+9

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook87.4k13.3kUpdated just now

aiartificial-intelligencechatbotchatgptdeep-learning+11

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python68.0k8.3kUpdated just now

agentaideepseekfine-tuninggemma+15

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Python65.9k6.6kUpdated just now

attentiondeep-learningdeep-learning-tutorialganliterate-programming+8

deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

MDX24.4k2.6kUpdated 2 hours ago

agentagentsaigeminigenerative-ai+15

amusi/CVPR2026-Papers-with-Code

CVPR 2026 论文和开源项目合集

22.0k2.8kUpdated just now

computer-visioncvprcvpr2020cvpr2021cvpr2022+15

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python20.7k2.2kUpdated 5 hours ago

adapterdiffusionfine-tuningllmlora+5

arc53/DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

Python17.7k2.0kUpdated 12 hours ago

agent-builderagentsaichatgptdocsgpt+14

stas00/ml-engineering

Machine Learning Engineering Open Book

Python17.3k1.1kUpdated just now

aidebugginggpusinferencelarge-language-models+11

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python15.5k3.7kUpdated just now

large-language-modelsmodel-paratransformers

huggingface/transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

JavaScript15.5k1.1kUpdated 9 hours ago

browserjavascripttransformerswebml

BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Python14.4k993Updated 4 hours ago

attention-mechanismchatgptdeep-learninggptgpt-2+9

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python12.9k3.1kUpdated 16 hours ago

bertcompressiondistributed-trainingdocument-intelligenceembedding+14

neuml/txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Python12.3k787Updated 6 hours ago

agentsaiai-agentsembeddingsinformation-retrieval+15

NielsRogge/Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook11.5k1.7kUpdated 3 hours ago

bertgpt-2layoutlmpytorchtransformers+1

qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python11.4k1.8kUpdated 3 hours ago

computer-visiondeeplab-v3-plusdeeplabv3dptfpn+15

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Python11.3k1.7kUpdated 11 hours ago

asraudioaudio-processingdeep-learninghuggingface+15

huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust10.5k1.0kUpdated 7 hours ago

bertgptlanguage-modelnatural-language-processingnatural-language-understanding+2

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++9.8k3.1kUpdated 6 hours ago

aicomputer-visiondeep-learningdeploy-aidiffusion-models+14

niedev/RTranslator

Open source real-time translation app for Android that runs locally

C++9.7k874Updated 2 hours ago

androidandroid-appbluetooth-lemobile-appnllb+9

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python9.1k887Updated just now

large-language-modelsopenai-o1proximal-policy-optimizationraylibreinforcement-learning+3

intel/ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

Python8.7k1.4kUpdated 1 hour ago

gpullmpytorchtransformers

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Jupyter Notebook8.6k561Updated 1 hour ago

auto-regressive-modelautoregressive-modelsdiffusion-modelsgenerative-aigenerative-model+7

EleutherAI/gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python8.3k963Updated 4 days ago

gptgpt-2gpt-3language-modeltransformers

jessevig/bertviz

BertViz: Visualize Attention in Transformer Models

Python7.9k869Updated 1 day ago

bertgpt2machine-learningnatural-language-processingneural-network+6

MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python7.4k881Updated 10 hours ago

bertldavismachine-learningnlpsentence-embeddings+5

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python7.4k1.1kUpdated just now

deepspeed-librarygpt-3language-modeltransformers

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python7.1k954Updated 3 hours ago

anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacy+15

SkalskiP/courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

Python6.4k589Updated 1 day ago

computer-visiondeep-learningdeep-neural-networksgenerative-modelmachine-learning+7

Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python6.2k476Updated just now

apple-siliconaudio-processingmlxmultimodalspeech-recognition+4

Page 1 of 34