Repositories
30Dolphin
PublicThe official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
deer-flow
PublicAn open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skills and subagents, it handles different levels of tasks that could take minutes to hours.
UI-TARS
PublicPioneering Automated GUI Interaction with Native Agents
trae-agent
PublicTrae Agent is an LLM-based agent for general purpose software engineering tasks.
primus
PublicArchivedcomfyui-lumi-batcher
PublicComfyUI Lumi Batcher is a batch processing extension plugin designed for ComfyUI, aiming to improve workflow debugging efficiency. Traditional debugging methods require adjusting parameters one by one, while this tool significantly enhances work efficiency through batch processing capabilities.
UI-TARS-desktop
PublicThe Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
LatentSync
PublicTaming Stable Diffusion for Lip Sync!
bhook
Public:fire: ByteHook is an Android PLT hook library which supports armeabi-v7a, arm64-v8a, x86 and x86_64.
bolt
PublicDreamID-V
PublicDreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
MegaTTS3
PublicPatchEval
PublicPatchEval: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities
ABQ-LLM
PublicAn acceleration library that supports arbitrary bit-width combinatorial quantization operations
InfiniStore
PublicKV cache store for distributed LLM inference
FullStackBench
PublicOfficial repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"
web-bench
PublicWeb-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.
Valley
PublicValley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.
OneReward
PublicIconPark
PublicArchived🍎Transform an SVG icon into multiple themes, and generate React icons,Vue icons,svg icons
sonic
PublicA blazingly fast JSON serializing & deserializing library
effective_transformer
PublicArchivedRunning BERT without Padding
fc-clip
PublicArchived[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
flowgram.ai
PublicFlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build AI workflow platforms faster and simpler.
pasa
PublicPaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant references, to ultimately obtain comprehensive and accurate results for complex scholarly queries.
mockey
PublicA simple and easy-to-use Go mocking library derived from ByteDance's internal best practices
tarsier
PublicTarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
monolith
PublicArchivedA Lightweight Recommendation System
Protenix
PublicToward High-Accuracy Open-Source Biomolecular Structure Prediction.
ByteTransformer
Publicoptimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052