"topic:token-compression" — Search

18 results for “topic:token-compression”

🦞 LLM Token Compression & Reduction Tool — Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.

Python1.6k139Updated 5 days ago

ai-agent-toolsai-cost-savingai-infrastructureclaw-compactorcontext-compressioncontext-pruningcontext-window-optimizationdeveloper-toolsllm-compressionllm-context-compressionllm-cost-reductionllm-token-compressionllm-toolsopenclawprompt-compressionpython-toolstoken-compressiontoken-optimizationtoken-reductiontoken-saving

cokeshao/Awesome-Multimodal-Token-Compression

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

32420Updated 3 weeks ago

awesome-listefficient-aiefficient-mllmlong-contextmllmmodel-accelerationtoken-compression

xuyang-liu16/Awesome-Token-level-Model-Compression

📚 Collection of token-level model compression resources.

1938Updated 6 months ago

computer-visionefficient-deep-learningmodel-accelerationmodel-compressiontoken-compressiontoken-mergingtoken-pruning

HumanMLLM/LLaVA-Scissor

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Python1192Updated 8 months ago

connected-componentsmllmmultimodal-large-language-modelstoken-compressionvideo-language-understandingvideo-understanding

HelgeSverre/toon-php

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

PHP1169Updated 3 months ago

aidata-formatllmphpserializationtoken-compressiontoon

HVision-NKU/GlimpsePrune

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

Python921Updated 1 month ago

inference-efficiencylvlmsmllmstoken-compressionvisual-token-pruning

YiwengXie/FluxMem

[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

Python430Updated 1 week ago

large-multimodal-modelsstreaming-videotoken-compressionvideo-understanding

Fanziyang-v/FlashVID

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Python360Updated 1 month ago

efficiencyflashvidmultimodaltoken-compressionvideo-llms

hanxunyu/VisionTrim

[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"

Shell283Updated 2 weeks ago

efficiencylightweight-vlmmultimodaltoken-compression

JinXins/MergeMix

[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

Python201Updated 2 weeks ago

data-augmentationiclr2026image-classificationllavamixupmmcvmultimodalpreference-learningranking-losstoken-compressiontoken-merging

sangminwoo/awesome-token-redundancy-reduction

😎 Awesome papers on token redundancy reduction

110Updated 1 year ago

token-compressiontoken-mergingtoken-pruningtoken-reductiontoken-redundancy-reductiontoken-sparsification

mvish7/dycoke_token_compression

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

Python50Updated 4 months ago

inference-optimizationtoken-compressionvideo-large-language-modelsvlms

MouxiaoHuang/PPE

[ICLR 2026] Official code of PPE: Positional Preservation Embedding for Token Compression in Multimodal Large Language Models.

Python40Updated 1 month ago

iclr2026large-language-modelsmultimodalpositional-encodingtoken-clusteringtoken-compressiontoken-mergingvision-language-model

pzrain/DiViCo

Official implementation of TCSVT 2025 paper: DiViCo: Disentangled Visual Token Compression For Efficient Large Vision-Language Model

Python30Updated 10 months ago

large-vision-language-modelmultimodaltoken-compression

jhenderiks/carapace

hardened docker container & compose for openclaw

TypeScript10Updated 21 hours ago

ai-agentcontext-modedockerllmmcpopenclawrtktoken-compression

nyelzkie/toon-php

🛠️ Implement TOON in PHP for efficient serialization of JSON-like data, optimizing parsing for Large Language Models while maintaining clarity and structure.

PHP00Updated 1 hour ago

aicsvdecodedecoderencodejsonjson-alternativelibraryllmoopphpphp-serializephp-toonstructured-datatoken-compressiontoken-oriented-notationtoontoon-php

SebastianMaciel/jsx-notation

Compress React/Next.js files by ~40% for AI assistants. MCP server + encoder.

JavaScript00Updated 2 weeks ago

aibabelclaudecursordeveloper-toolsjsxllmmcpnextjsreacttailwindtoken-compressiontsxvscode

copyleftdev/laconic

Maximum meaning, minimum tokens. Rust-based markdown compression for LLM workflows.

TypeScript00Updated 1 week ago

agentic-aiai-toolsclaudeclicontext-windowcost-reductiondeveloper-toolsgeminigpt-4langchainllmllm-optimizationmarkdownmcpmodel-context-protocolprompt-engineeringragrusttoken-compressiontoken-optimization