2,524 results for “topic:gradio”
Stable Diffusion web UI
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Generate audiobooks from e-books, voice cloning & 1158+ languages!
stable diffusion webui colab
Easy Docker setup for Stable Diffusion with user-friendly UI
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
A Web UI for easy subtitle using whisper model.
A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.
[EMNLP-2024] Build multimodal language agents for fast prototype and production
A colab gradio web UI for running Large Language Models
A sketch extractor for anime/illustration.
Fast stable diffusion on CPU and AI PC
🌊 Images to 3D Parallax effect video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
🤗 image matching webui
a self-hosted webui for 30+ generative ai
Ship RAG based LLM web apps in seconds.
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.
Chat with multiple PDFs locally
Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/status/1891370506537910724 https://www.threads.net/@githubprojects/post/DGKdoE4zdUX
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
A physics-based multi-material FDM color system. Converts images to full-color 3D prints via calibrated light-transmission mixing. Supports 2-8 color systems with slicer-ready 3MF export (BambuStudio/OrcaSlicer).
gradio WebUI for AdvancedLivePortrait