106 results for “topic:gpt-oss”
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
SGLang is a high-performance serving framework for large language models and multimodal models.
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.
A curated list of resources dedicated to open source GitHub repositories related to ChatGPT, OpenAI API, and Codex
A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.1, Claude, Grok, DeepSeek, OpenRouter, Kimi 2.5, GLM 5, SiliconFlow, GPT-oss, Gemma 3, Qwen 3.5
Agent Workstation for Codex CLI + Claude Code — with task scheduler, git worktree & remote control, Tauri
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
open-source healthcare ai
Visual Builder for AI Workflows and Agents
implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs
Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.
A curated list of awesome resources, tools, and tutorials for OpenAI Codex CLI
What does gpt-oss tell us about OpenAI's training data?
GGUF Loader with its Agentic Mode, and floating button, ai Models | Open Source & Offline. Mistral, Deepseek, llama, gemma, qwen
ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture
💡AI finder/explorer. One click @files via a visual filetree and save insights in a notepad. build with Tauri
Discover the Best AI Models for Your PC
A curated list of awesome GPT-OSS resources, tools, tutorials, and projects
A Chrome extension hosts an Ollama UI web server on localhost and other servers, helping you manage models and chat with any open-source model. 🚀💻✨
PHP port for openai/tiktoken (most)
agentsculptor is an experimental AI-powered development agent designed to analyze, refactor, and extend Python projects automatically. It uses an OpenAI-like planner–executor loop on top of a vLLM backend, combining project context analysis, structured tool calls, and iterative refinement. It has only been tested with gpt-oss-120b via vLLM.
Connect Codex-CLI to third-party LLM servers like LM Studio and Ollama
MCP server that connects Claude to local Ollama models, delegating simple tasks to save tokens for complex reasoning
Sample application generated using Opencode and Ollama