"topic:gradio" — Search

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python6.4k689Updated 1 hour ago

audiobookfaster-whispergradiokaraokepodcastsspeech-recognitionspeech-synthesisspeech-to-textsubtitlestext-to-speechtranscriptiontranslatorttsvoice-cloningvoice-conversionwebuiwhisperwhisperxyt-dlp

modelscope/FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python5.4k654Updated 3 hours ago

gradiogradio-python-llmllmspeech-recognitionspeech-to-textsubtitles-generatorvideo-clipvideo-subtitles

ant-research/MagicQuill

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python3.7k382Updated 11 hours ago

aigcgradioimage-editingmllm

OpenGVLab/Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python3.3k268Updated 20 hours ago

big-modelcaptioning-videoschatchatgptfoundation-modelsgradiolangchainlarge-language-modelslarge-modelstablelmvideovideo-question-answeringvideo-understanding

OpenGVLab/InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Python3.2k231Updated 4 days ago

chatgptclickdragganfoundation-modelgptgpt-4gradiohuskyimage-captioningimagebindinternimagelangchainllamallmmultimodalsamsegment-anythingvicunavideo-generationvqa

rsxdalv/TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

TypeScript3.0k305Updated 9 hours ago

ace-stepaiaudio-generationcosyvoicegenerative-aigeneratorgradiomusicmusicgenopenai-apiopenvoicervcstyletts2text-to-speechtortoise-ttsttsvocos

jhj0517/Whisper-WebUI

A Web UI for easy subtitle using whisper model.

Python2.7k396Updated 12 hours ago

aigradioopen-sourcepythonpytorchweb-uiwhisper

GiovanniPasq/agentic-rag-for-dummies

A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

Jupyter Notebook2.7k373Updated 1 hour ago

agentagentic-aiagentic-ragagentsai-agentsbm25generative-aigradiolangchainlanggraphllmollamaqdrantragrag-agentsrag-chatbotrag-pipelineretrieval-augmented-generationretrieval-augmented-generation-rag

om-ai-lab/OmAgent

[EMNLP-2024] Build multimodal language agents for fast prototype and production

Python2.6k286Updated 19 hours ago

agentchatbotgeminigptgpt4gradiolanguage-agentlarge-language-modelsllamallavallmmultimodalmultimodal-agentopenaipythonragsmart-hardwarevision-and-languagevlmworkflow

camenduru/text-generation-webui-colab

A colab gradio web UI for running Large Language Models

Jupyter Notebook2.1k362Updated 1 week ago

alpacacolabcolab-notebookcolaboratorygradiokoalalamallamallamasllmvicuna

Mukosame/Anime2Sketch

A sketch extractor for anime/illustration.

Python2.1k175Updated 17 hours ago

animecomiccomputer-visiondeep-learninggangansgenerative-adversarial-networkgradioimage-generationmangapytorchsketchwacv

rupeshs/fastsdcpu

Fast stable diffusion on CPU and AI PC

Python2.0k193Updated 2 hours ago

aipcapiclicpudesktopguidiffusersdiffusionfastsdcpufluxgradiolatentconsistencymodelslcmdiffusionopenvinoqtsdupcalesdxlturbosdxsstablediffusiontorchwebui

BrokenSource/DepthFlow

🌊 Images to 3D Parallax effect video

Python1.3k102Updated 7 hours ago

computer-visiondepth-mapdepth-mapsdepth-predictiondepthflowdepthygradioimage-parallaximage-to-videoimmersityimmersityaileiapixmonocular-depthmonocular-depth-estimationparallaxparallax-effectshaderflowshadertoy

kabachuha/sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Python1.3k115Updated 4 days ago

automatic1111extensiongradiomodelscopestable-diffusiontext2videovideocrafterwebui

Uminosachi/sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Python1.3k119Updated 17 hours ago

ai-artanythingdiffusersdiffusionextensiongenerative-artgradiohuggingfacehuggingface-diffusersimage-generationimage2imageimg2imginpaintinpaint-anythinginpaintinglatent-diffusionsegmentsegment-anythingsegmentationstable-diffusion

Vincentqyw/image-matching-webui

🤗 image matching webui

Python1.2k110Updated 21 hours ago

aspanformerdeep-learningfeature-matchinggradioimage-matchingkeypoint-matchingkornialightglueloftrpose-estimationsiftsupergluesuperpointtopicfmvisual-localization

Woolverine94/biniou

a self-hosted webui for 30+ generative ai

Python1.1k130Updated 8 hours ago

animatediffaudiocraftbarkcontrolnetdiffusersfluxgenerative-aigfpgangradiohuggingfaceinsightfaceip-adapterkandinskyllama-cpp-pythonphotomakerreal-esrganstable-diffusionstable-diffusion-3-5webuiwhisper

viddexa/autollmArchived

Ship RAG based LLM web apps in seconds.

Python1.0k98Updated 5 days ago

anthropicbedrockcoherefastapigradiolangchainlarge-language-modelsllama-indexllama2llmopenaipalmpypipythonretrieval-augmented-generationvector-databasevertex-ai

ThereforeGames/unprompted

Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.

Python80871Updated 1 week ago

a1111-stable-diffusion-webuiai-artdeep-learninggptgradioimg2imgpythonshortcodestable-diffusiontemplate-enginetext2imagetxt2imgwildcards

datvodinh/rag-chatbot

Chat with multiple PDFs locally

Python636103Updated 1 week ago

chatbotchatbot-uichatbotsgradiollama-indexllama3llmmistralollamaquestion-answeringrag

SanshruthR/CCTV_YOLO

Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/status/1891370506537910724 https://www.threads.net/@githubprojects/post/DGKdoE4zdUX

Python58458Updated 3 weeks ago

cctvcctv-surveillancegradiorealtimeyoloyolov5

OpenGVLab/Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Python55839Updated 2 weeks ago

chatchatbotchatgptgradiolarge-language-modelsllmsmulti-modalityvision-language-modelvqa

MOVIBALE/Lumina-Layers

A physics-based multi-material FDM color system. Converts images to full-color 3D prints via calibrated light-transmission mixing. Supports 2-8 color systems with slicer-ready 3MF export (BambuStudio/OrcaSlicer).

Python530102Updated 14 hours ago

3d-printing3mfbambulabcalibrationcolor-mixingfdmgradiolutmulti-materialorcaslicerpythonslicer

jhj0517/AdvancedLivePortrait-WebUI

gradio WebUI for AdvancedLivePortrait

Python53053Updated 11 hours ago

advancedliveportraitaideeplearningfacial-recognitiongradioliveportraitopen-sourcepythontorchwebui

Page 1 of 34