Sakib Ahamed

zsxkib

Born too late to explore the earth. \\ Born too early to explore the universe. \\ Born just in time for the AI uprising.

Replicate

Edinburgh

Languages

Python93%Rust3%TypeScript3%

Repos

167

Stars

422

Forks

131

Top Language

Python

Loading contributions...

Top Repositories

AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

47Python

cog-flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

44Python

InstantID

Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥

42Python

playground-v2-1024px-aesthetic

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.

39Python

voice-cloning-create-dataset

Create your own RVC v2 dataset from a youtube video

31Python

cog-MeiGenAI-InfiniteTalk

🎭Cogified version of MeiGen-AI/InfiniteTalk Unlimited-length talking video generation that supports image-to-video and video-to-video generation🗣️

21Python

Repositories

167

zsxkib/cog-comfyui-hunyuan-video

No description provided.

Python187Updated 1 year ago

zsxkib/cog-Framepack

🖼️Cogified implementation of FramePack: video diffusion, but feels like image diffusion

Python41Updated 10 months ago

zsxkib/cog-MeiGenAI-InfiniteTalk

🎭Cogified version of MeiGen-AI/InfiniteTalk Unlimited-length talking video generation that supports image-to-video and video-to-video generation🗣️

Python217Updated 6 months ago

zsxkib/cog-nvidia-canary-qwen-2.5b

🙊Cogified speech-to-text model nvidia/canary-qwen-2.5b (best ASR model according to hf-audio/open_asr_leaderboard as of 18/Jul/2025)🎙️

Python194Updated 7 months ago

zsxkib/cog-flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Python446Updated 1 year ago

zsxkib/cog-comfyui-wan-lora-trainer

A ComfyUI based Wan (video generation) LoRa Trainer

Python91Updated 1 year ago

zsxkib/cog-comfyui-wan-with-loraFork

No description provided.

Python20Updated 1 year ago

zsxkib/cog-create-video-dataset

Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning

Python143Updated 11 months ago

zsxkib/cog-comfyuiFork

Run ComfyUI with an API

Python34Updated 8 months ago

zsxkib/cog-mmaudio

Replicate Cog'ified MMAudio

Python182Updated 11 months ago

zsxkib/qwen-image-macos

🎨 Native AI image generation for Apple Silicon with Qwen-Image. Lightning LoRA acceleration for fast 4–8 step runs. Zero Docker, just works.

Python161Updated 6 months ago

aiapple-siliconcli-tooldeep-learningdiffusionimage-generationlightning-loram1m2m3machine-learningmacosmpspythonpytorchqwen

zsxkib/codexFork

Lightweight coding agent that runs in your terminal

Rust00Updated 3 weeks ago

zsxkib/slack-mcp-serverFork

Read-only MCP server for Slack workspace data

TypeScript00Updated 3 weeks ago

zsxkib/AICoverGenFork

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Python4718Updated 2 years ago

zsxkib/voice-cloning-create-dataset

Create your own RVC v2 dataset from a youtube video

Python3119Updated 2 years ago

zsxkib/cog-aura-sr-v2

AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications

Python123Updated 1 year ago

zsxkib/voice-cloning-trainingFork

Voice data <= 10 mins can also be used to train a good VC model!

Python1213Updated 2 years ago

zsxkib/cog-qwen-edit-2509-lora

No description provided.

Python21Updated 4 months ago

zsxkib/cog-google-embeddinggemma-300m

🚀 Google's compact 300M parameter embedding model for production-ready semantic search and text similarity tasks 🎯

Python41Updated 6 months ago

zsxkib/cog-ResembleAI-Chatterbox-Multilingual-TTS

🗣️Generate high-quality multilingual speech from text with reference audio styling, supporting 23 languages

Python112Updated 6 months ago

zsxkib/cog-nvidia-audio-flamingo-3

🎼Cog'd Advancing Audio Intelligence with Fully Open Large Audio-Language Models🎶

Python42Updated 8 months ago

zsxkib/samuraiFork

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python31Updated 1 year ago

zsxkib/hololive-style-bert-vits2

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Python51Updated 1 year ago

zsxkib/playground-v2-1024px-aesthetic

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.

Python397Updated 2 years ago

zsxkib/cog-MultiTalk

🗣️MultiTalk all wrapped in Cog🎙️

Python204Updated 8 months ago

zsxkib/trocr-base-handwritten

🖋️➡️📱Converts handwritten text images into digital text

Python40Updated 2 years ago

ocr

zsxkib/InstantIDFork

Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥

Python4220Updated 1 year ago

deepfakeinstantzeroshot

zsxkib/TTDS-G35-CW3

TTDS Group Project: Video Games Search Engine. Sakib Ahamed. Dan Buxton, Kenza Amira, Wini Lau, Mansoor Ahmad

Python51Updated 2 years ago

corporadata-scienceneural-ranking-modelspagerankquerysearch-enginetechnologiestexttext-analysistext-classificationttdsweb-search

zsxkib/cog-ByteDance-Seed-SeedVR2

Cog wrapper for SeedVR2 (3B/7B) video & image restoration with optional color fix

Python22Updated 4 months ago

bytedancecogdiffusion-modelimage-restorationpytorchreplicateseedvr2video-restoration

zsxkib/cog-Hunyuan-Avatar

🤪Cogifed version of Tencent (Hunyuan)'s Open-Source Lip-Sync Model HunyuanVideo-Avatar🫦

Python110Updated 9 months ago

Sakib Ahamed

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity