6,330 results for “topic:benchmark”
A command-line benchmarking tool
A MNIST-like fashion product database. Benchmark :point_down:
Powerful .NET library for benchmarking
:metal: awesome-semantic-segmentation
Ohayou(おはよう), HTTP load generator, inspired by rakyll/hey with tui animation.
A microbenchmark support library
Source for the TechEmpower Framework Benchmarks project
OpenMMLab Pose Estimation Toolbox and Benchmark.
Which is the fastest web framework?
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Scriptable database and system performance benchmark
VPS 融合怪服务器测评项目 更推荐使用无环境依赖的Go版本 VPS Fusion Monster Server Test Script – More recommended to use the Go version with no environment dependencies: https://github.com/oneclickvirt/ecs
YABS - a simple bash script to estimate Linux server performance using fio, iperf3, & Geekbench
Benchmarks of approximate nearest neighbor libraries in Python
dperf: High-Performance Network Load Testing Tool Based on DPDK
Statistics-driven benchmarking library for Rust
Across the Great Wall we can reach every corner in the world
Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
SWE-bench: Can Language Models Resolve Real-world Github Issues?
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
A tiny boost library in C++11.
Python package for the evaluation of odometry and SLAM
A series of large language models developed by Baichuan Intelligent Technology
Visual Tracking Paper List
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
HTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)
XcodeBenchmark measures the compilation time of a large codebase on iMac, MacBook, and Mac Pro
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)