OpenEvolve

🧬 The most advanced open-source evolutionary coding agent

Turn your LLMs into autonomous code optimizers that discover breakthrough algorithms

🚀 Quick Start • 📖 Examples • 💬 Discussions

From random search to state-of-the-art: Watch your code evolve in real-time

✨ Why OpenEvolve?

🎯 Autonomous Discovery LLMs don't just optimize—they discover entirely new algorithms. No human guidance needed.	⚡ Proven Results 2-3x speedups on real hardware. State-of-the-art circle packing. Breakthrough optimizations.	🔬 Research Grade Full reproducibility, extensive evaluation pipelines, and scientific rigor built-in.

OpenEvolve vs Manual Optimization:

Aspect	Manual Optimization	OpenEvolve
Time to Solution	Days to weeks	Hours
Exploration Breadth	Limited by human creativity	Unlimited LLM creativity
Reproducibility	Hard to replicate	Fully deterministic
Multi-objective	Complex tradeoffs	Automatic Pareto optimization
Scaling	Doesn't scale	Parallel evolution across islands

🏆 Proven Achievements

🎯 Domain	📈 Achievement	🔗 Example
GPU Optimization	2-3x speedup on Apple Silicon	MLX Metal Kernels
Mathematical	State-of-the-art circle packing (n=26)	Circle Packing
Algorithm Design	Adaptive sorting algorithms	Rust Adaptive Sort
Scientific Computing	Automated filter design	Signal Processing
Multi-Language	Python, Rust, R, Metal shaders	All Examples

🚀 Quick Start

Get from zero to evolving code in 30 seconds:

# Install OpenEvolve
pip install openevolve

# Set your LLM API key (works with any OpenAI-compatible provider)
export OPENAI_API_KEY="your-api-key"

# Run your first evolution!
python -c "
from openevolve import run_evolution
result = run_evolution(
    'examples/function_minimization/initial_program.py',
    'examples/function_minimization/evaluator.py'
)
print(f'Best score: {result.best_score:.4f}')
"

📚 Library Usage

OpenEvolve can be used as a library without any external files:

from openevolve import run_evolution, evolve_function

# Evolution with inline code (no files needed!)
result = run_evolution(
    initial_program='''
    def fibonacci(n):
        if n <= 1: return n
        return fibonacci(n-1) + fibonacci(n-2)
    ''',
    evaluator=lambda path: {"score": benchmark_fib(path)},
    iterations=100
)

# Evolve Python functions directly
def bubble_sort(arr):
    for i in range(len(arr)):
        for j in range(len(arr)-1):
            if arr[j] > arr[j+1]:
                arr[j], arr[j+1] = arr[j+1], arr[j] 
    return arr

result = evolve_function(
    bubble_sort,
    test_cases=[([3,1,2], [1,2,3]), ([5,2,8], [2,5,8])],
    iterations=50
)
print(f"Evolved sorting algorithm: {result.best_code}")

Want more control? Use the full CLI:

python openevolve-run.py examples/function_minimization/initial_program.py \
  examples/function_minimization/evaluator.py \
  --config examples/function_minimization/config.yaml \
  --iterations 1000

Prefer Docker?

docker run --rm -v $(pwd):/app ghcr.io/codelion/openevolve:latest \
  examples/function_minimization/initial_program.py \
  examples/function_minimization/evaluator.py --iterations 100

🎬 See It In Action

🔥 Circle Packing: From Random to State-of-the-Art

Watch OpenEvolve discover optimal circle packing in real-time:

Generation 1	Generation 190	Generation 460 (Final)

Random placement	Learning structure	State-of-the-art result

Result: Matches published benchmarks for n=26 circle packing problem.

⚡ GPU Kernel Evolution

Before (Baseline):

// Standard attention implementation
kernel void attention_baseline(/* ... */) {
    // Generic matrix multiplication
    float sum = 0.0;
    for (int i = 0; i < seq_len; i++) {
        sum += query[tid] * key[i];
    }
}

After Evolution (2.8x faster):

// OpenEvolve discovered optimization
kernel void attention_evolved(/* ... */) {
    // Hardware-aware tiling + unified memory optimization
    threadgroup float shared_mem[256];
    // ... evolved algorithm exploiting Apple Silicon architecture
}

Performance Impact: 2.8x speedup on Apple M1 Pro, maintaining numerical accuracy.

🧬 How OpenEvolve Works

OpenEvolve implements a sophisticated evolutionary coding pipeline that goes far beyond simple optimization:

🎯 Core Innovation: MAP-Elites + LLMs

Quality-Diversity Evolution: Maintains diverse populations across feature dimensions
Island-Based Architecture: Multiple populations prevent premature convergence
LLM Ensemble: Multiple models with intelligent fallback strategies
Artifact Side-Channel: Error feedback improves subsequent generations

🚀 Advanced Features

🔬 Scientific Reproducibility

Comprehensive Seeding: Every component (LLM, database, evaluation) is seeded
Default Seed=42: Immediate reproducible results out of the box
Deterministic Evolution: Exact reproduction of runs across machines
Component Isolation: Hash-based isolation prevents cross-contamination

🤖 Advanced LLM Integration

Test-Time Compute: Integration with OptiLLM for MoA and enhanced reasoning
Universal API: Works with OpenAI, Google, local models
Plugin Ecosystem: Support for OptiLLM plugins (readurls, executecode, z3_solver)
Intelligent Ensembles: Weighted combinations with sophisticated fallback

🧬 Evolution Algorithm Innovations

Double Selection: Different programs for performance vs inspiration
Adaptive Feature Dimensions: Custom quality-diversity metrics
Migration Patterns: Ring topology with controlled gene flow
Multi-Strategy Sampling: Elite, diverse, and exploratory selection

🎯 Perfect For

Use Case	Why OpenEvolve Excels
🏃‍♂️ Performance Optimization	Discovers hardware-specific optimizations humans miss
🧮 Algorithm Discovery	Finds novel approaches to classic problems
🔬 Scientific Computing	Automates tedious manual tuning processes
🎮 Competitive Programming	Generates multiple solution strategies
📊 Multi-Objective Problems	Pareto-optimal solutions across dimensions

🛠 Installation & Setup

Requirements

Python: 3.10+
LLM Access: Any OpenAI-compatible API
Optional: Docker for containerized runs

Installation Options

📦 PyPI (Recommended)

pip install openevolve

🔧 Development Install

git clone https://github.com/codelion/openevolve.git
cd openevolve
pip install -e ".[dev]"

🐳 Docker

docker pull ghcr.io/codelion/openevolve:latest

LLM Provider Setup

OpenEvolve works with any OpenAI-compatible API:

🔥 OpenAI (Direct)

export OPENAI_API_KEY="sk-..."
# Uses OpenAI endpoints by default

🤖 Google Gemini

# config.yaml
llm:
  api_base: "https://generativelanguage.googleapis.com/v1beta/openai/"
  model: "gemini-2.5-pro"

export OPENAI_API_KEY="your-gemini-api-key"

🏠 Local Models (Ollama/vLLM)

# config.yaml
llm:
  api_base: "http://localhost:11434/v1"  # Ollama
  model: "codellama:7b"

⚡ OptiLLM (Advanced)

For maximum flexibility with rate limiting, model routing, and test-time compute:

# Install OptiLLM
pip install optillm

# Start OptiLLM proxy
optillm --port 8000

# Point OpenEvolve to OptiLLM
export OPENAI_API_KEY="your-actual-key"

llm:
  api_base: "http://localhost:8000/v1"
  model: "moa&readurls-o3"  # Test-time compute + web access

📸 Examples Gallery

🏆 Showcase Projects

Project	Domain	Achievement	Demo
🎯 Function Minimization	Optimization	Random → Simulated Annealing	View Results
⚡ MLX GPU Kernels	Hardware	2-3x Apple Silicon speedup	Benchmarks
🔄 Rust Adaptive Sort	Algorithms	Data-aware sorting	Code Evolution
📐 Symbolic Regression	Science	Automated equation discovery	LLM-SRBench
🕸️ Web Scraper + OptiLLM	AI Integration	Test-time compute optimization	Smart Scraping

🎯 Quick Example: Function Minimization

Watch OpenEvolve evolve from random search to sophisticated optimization:

# Initial Program (Random Search)
def minimize_function(func, bounds, max_evals=1000):
    best_x, best_val = None, float('inf')
    for _ in range(max_evals):
        x = random_point_in_bounds(bounds)
        val = func(x)
        if val < best_val:
            best_x, best_val = x, val
    return best_x, best_val

↓ Evolution Process ↓

# Evolved Program (Simulated Annealing + Adaptive Cooling)
def minimize_function(func, bounds, max_evals=1000):
    x = random_point_in_bounds(bounds)
    temp = adaptive_initial_temperature(func, bounds)
    
    for i in range(max_evals):
        neighbor = generate_neighbor(x, temp, bounds)
        delta = func(neighbor) - func(x)
        
        if delta < 0 or random.random() < exp(-delta/temp):
            x = neighbor
            
        temp *= adaptive_cooling_rate(i, max_evals)  # Dynamic cooling
    
    return x, func(x)

Performance: 100x improvement in convergence speed!

🔬 Advanced Examples

🎨 Prompt Evolution

Evolve prompts instead of code for better LLM performance:

# Example: HotpotQA dataset
Initial Prompt: "Answer the question based on the context."

Evolved Prompt: "As an expert analyst, carefully examine the provided context. 
Break down complex multi-hop reasoning into clear steps. Cross-reference 
information from multiple sources to ensure accuracy. Answer: [question]"

Result: +23% accuracy improvement on HotpotQA benchmark

Full Example

🏁 Competitive Programming

Automatic solution generation for programming contests:

# Problem: Find maximum subarray sum
# OpenEvolve discovers multiple approaches:

# Evolution Path 1: Brute Force → Kadane's Algorithm
# Evolution Path 2: Divide & Conquer → Optimized Kadane's
# Evolution Path 3: Dynamic Programming → Space-Optimized DP

Online Judge Integration

⚙️ Configuration

OpenEvolve offers extensive configuration for advanced users:

# Advanced Configuration Example
max_iterations: 1000
random_seed: 42  # Full reproducibility

llm:
  # Ensemble with test-time compute
  models:
    - name: "gemini-2.5-pro"
      weight: 0.6
    - name: "moa&readurls-o3"  # OptiLLM features
      weight: 0.4
  temperature: 0.7

database:
  # MAP-Elites quality-diversity
  population_size: 500
  num_islands: 5  # Parallel evolution
  migration_interval: 20
  feature_dimensions: ["complexity", "diversity", "performance"]

evaluator:
  enable_artifacts: true      # Error feedback to LLM
  cascade_evaluation: true    # Multi-stage testing
  use_llm_feedback: true      # AI code quality assessment

prompt:
  # Sophisticated inspiration system
  num_top_programs: 3         # Best performers
  num_diverse_programs: 2     # Creative exploration
  include_artifacts: true     # Execution feedback
  
  # Custom templates
  template_dir: "custom_prompts/"
  use_template_stochasticity: true  # Randomized prompts

🎯 Feature Engineering

Control how programs are organized in the quality-diversity grid:

database:
  feature_dimensions: 
    - "complexity"      # Built-in: code length
    - "diversity"       # Built-in: structural diversity
    - "performance"     # Custom: from your evaluator
    - "memory_usage"    # Custom: from your evaluator
    
  feature_bins:
    complexity: 10      # 10 complexity levels
    performance: 20     # 20 performance buckets
    memory_usage: 15    # 15 memory usage categories

Important: Return raw values from evaluator, OpenEvolve handles binning automatically.

🎨 Custom Prompt Templates

Advanced prompt engineering with custom templates:

prompt:
  template_dir: "custom_templates/"
  use_template_stochasticity: true
  template_variations:
    greeting:
      - "Let's enhance this code:"
      - "Time to optimize:"
      - "Improving the algorithm:"

See prompt examples for complete template customization.

🔧 Artifacts & Debugging

Artifacts side-channel provides rich feedback to accelerate evolution:

# Evaluator can return execution context
from openevolve.evaluation_result import EvaluationResult

return EvaluationResult(
    metrics={"performance": 0.85, "correctness": 1.0},
    artifacts={
        "stderr": "Warning: suboptimal memory access pattern",
        "profiling_data": {...},
        "llm_feedback": "Code is correct but could use better variable names",
        "build_warnings": ["unused variable x"]
    }
)

Next generation prompt automatically includes:

## Previous Execution Feedback
⚠️ Warning: suboptimal memory access pattern
💡 LLM Feedback: Code is correct but could use better variable names
🔧 Build Warnings: unused variable x

This creates a feedback loop where each generation learns from previous mistakes!

📊 Visualization

Real-time evolution tracking with interactive web interface:

# Install visualization dependencies
pip install -r scripts/requirements.txt

# Launch interactive visualizer
python scripts/visualizer.py

# Or visualize specific checkpoint
python scripts/visualizer.py --path examples/function_minimization/openevolve_output/checkpoints/checkpoint_100/

Features:

🌳 Evolution tree with parent-child relationships
📈 Performance tracking across generations
🔍 Code diff viewer showing mutations
📊 MAP-Elites grid visualization
🎯 Multi-metric analysis with custom dimensions

🚀 Roadmap

🔥 Upcoming Features

Multi-Modal Evolution: Images, audio, and text simultaneously
Federated Learning: Distributed evolution across multiple machines
AutoML Integration: Hyperparameter and architecture evolution
Benchmark Suite: Standardized evaluation across domains

🌟 Research Directions

Self-Modifying Prompts: Evolution modifies its own prompting strategy
Cross-Language Evolution: Python → Rust → C++ optimization chains
Neurosymbolic Reasoning: Combine neural and symbolic approaches
Human-AI Collaboration: Interactive evolution with human feedback

Want to contribute? Check out our roadmap discussions!

🤔 FAQ

💰 How much does it cost to run?

Cost depends on your LLM provider and iterations:

o3: ~$0.15-0.60 per iteration (depending on code size)
o3-mini: ~$0.03-0.12 per iteration (more cost-effective)
Gemini-2.5-Pro: ~$0.08-0.30 per iteration
Gemini-2.5-Flash: ~$0.01-0.05 per iteration (fastest and cheapest)
Local models: Nearly free after setup
OptiLLM: Use cheaper models with test-time compute for better results

Cost-saving tips:

Start with fewer iterations (100-200)
Use o3-mini, Gemini-2.5-Flash or local models for exploration
Use cascade evaluation to filter bad programs early
Configure smaller population sizes initially

🆚 How does this compare to manual optimization?

Aspect	Manual	OpenEvolve
Initial Learning	Weeks to understand domain	Minutes to start
Solution Quality	Depends on expertise	Consistently explores novel approaches
Time Investment	Days-weeks per optimization	Hours for complete evolution
Reproducibility	Hard to replicate exact process	Perfect reproduction with seeds
Scaling	Doesn't scale beyond human capacity	Parallel evolution across islands

OpenEvolve shines when you need to explore large solution spaces or optimize for multiple objectives simultaneously.

🔧 Can I use my own LLM?

Yes! OpenEvolve supports any OpenAI-compatible API:

Commercial: OpenAI, Google, Cohere
Local: Ollama, vLLM, LM Studio, text-generation-webui
Advanced: OptiLLM for routing and test-time compute

Just set the api_base in your config to point to your endpoint.

🚨 What if evolution gets stuck?

Built-in mechanisms prevent stagnation:

Island migration: Fresh genes from other populations
Temperature control: Exploration vs exploitation balance
Diversity maintenance: MAP-Elites prevents convergence
Artifact feedback: Error messages guide improvements
Template stochasticity: Randomized prompts break patterns

Manual interventions:

Increase num_diverse_programs for more exploration
Add custom feature dimensions to diversify search
Use template variations to randomize prompts
Adjust migration intervals for more cross-pollination

📈 How do I measure success?

Multiple success metrics:

Primary Metric: Your evaluator's combined_score or metric average
Convergence: Best score improvement over time
Diversity: MAP-Elites grid coverage
Efficiency: Iterations to reach target performance
Robustness: Performance across different test cases

Use the visualizer to track all metrics in real-time and identify when evolution has converged.

🌟 Contributors

Thanks to all our amazing contributors who make OpenEvolve possible!

🤝 Contributing

We welcome contributions! Here's how to get started:

🍴 Fork the repository
🌿 Create your feature branch: git checkout -b feat-amazing-feature
✨ Add your changes and tests
✅ Test everything: python -m unittest discover tests
📝 Commit with a clear message
🚀 Push and create a Pull Request

New to open source? Check out our Contributing Guide and look for good-first-issue labels!

📚 Academic & Research

Articles & Blog Posts About OpenEvolve:

Towards Open Evolutionary Agents - Evolution of coding agents and the open-source movement
OpenEvolve: GPU Kernel Discovery - Automated discovery of optimized GPU kernels with 2-3x speedups
OpenEvolve: Evolutionary Coding with LLMs - Introduction to evolutionary algorithm discovery using large language models

📊 Citation

If you use OpenEvolve in your research, please cite:

@software{openevolve,
  title = {OpenEvolve: an open-source evolutionary coding agent},
  author = {Asankhaya Sharma},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/codelion/openevolve}
}

🚀 Ready to evolve your code?

Made with ❤️ by the OpenEvolve community

Star ⭐ this repository if OpenEvolve helps you discover breakthrough algorithms!

curino/openevolve