GitHunt
AL

aliyanz85/crisis-sim

๐Ÿšจ AI-powered emergency response simulation combining Mesa ABM with advanced LLM reasoning strategies (ReAct, Reflexion, Plan-Execute) for optimized crisis management and multi-agent coordination

๐Ÿšจ CrisisSim: AI-Powered Emergency Response Simulation

Advanced multi-agent simulation platform combining Mesa ABM with LLM-driven reasoning strategies for optimized crisis response planning

Python
Mesa
License

๐ŸŽฏ Overview

CrisisSim is a comprehensive emergency response simulation that models crisis scenarios with intelligent AI agents. The platform integrates multiple Large Language Model (LLM) reasoning strategies including ReAct, Reflexion, Plan-Execute, Chain-of-Thought, and Tree-of-Thought to optimize rescue operations, resource allocation, and emergency response coordination.

โœจ Key Features

๐Ÿค– Multi-Strategy AI Planning

  • ReAct: Reasoning and Acting in iterative cycles
  • Reflexion: Self-reflection and memory-driven improvements
  • Plan-Execute: Hierarchical planning with tactical execution
  • Chain-of-Thought (CoT): Sequential reasoning chains
  • Tree-of-Thought (ToT): Branched reasoning exploration

๐ŸŒ Realistic Crisis Environment

  • Dynamic Fire Spread: Realistic fire propagation mechanics
  • Aftershock Events: Earthquake aftermath simulations
  • Resource Constraints: Battery, water, and tool limitations
  • Hospital Triage: FIFO and priority-based patient management
  • Multi-Agent Coordination: Drones, medics, and trucks working together

๐Ÿ“Š Comprehensive Evaluation Framework

  • Performance Metrics: Rescue efficiency, response time, resource utilization
  • Batch Evaluation: Multi-seed statistical analysis
  • Visualization: Real-time web UI and detailed performance plots
  • Comparative Analysis: Strategy performance across different scenarios

๐Ÿš€ Quick Start

Installation

# Clone the repository
git clone https://github.com/aliyanz85/crisis-sim.git
cd crisis-sim

# Setup environment
python -m venv .venv && source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

Basic Usage

๐ŸŽฎ Interactive Web Interface

python server.py
# Open http://127.0.0.1:8522 in your browser

๐Ÿ”„ Command Line Simulation

# Quick demo with mock LLM (no API keys required)
python main.py --map configs/map_small.yaml --provider mock --strategy react --seed 42 --ticks 150

# Advanced run with Groq API
export LLM_PROVIDER=groq
export GROQ_API_KEY=your_api_key_here
python main.py --map configs/map_medium.yaml --provider groq --strategy plan_execute --ticks 200

๐Ÿ“ˆ Batch Evaluation & Analysis

# Run comprehensive evaluation
python eval/harness.py --n_seeds 5 --maps configs/map_small.yaml configs/map_medium.yaml configs/map_hard.yaml --strategies react reflexion plan_execute --ticks 200

# Generate performance plots
python eval/plots.py --summary results/agg/summary.csv --out results/plots

๐Ÿ—๏ธ Architecture

CrisisSim/
โ”œโ”€โ”€ ๐Ÿง  reasoning/          # LLM strategy implementations
โ”‚   โ”œโ”€โ”€ react.py          # ReAct reasoning loops
โ”‚   โ”œโ”€โ”€ reflexion.py      # Memory-driven self-improvement
โ”‚   โ”œโ”€โ”€ plan_execute.py   # Hierarchical planning
โ”‚   โ”œโ”€โ”€ cot.py           # Chain-of-thought reasoning
โ”‚   โ””โ”€โ”€ tot.py           # Tree-of-thought exploration
โ”œโ”€โ”€ ๐ŸŒ env/               # Simulation environment
โ”‚   โ”œโ”€โ”€ world.py         # Mesa model & crisis dynamics
โ”‚   โ”œโ”€โ”€ agents.py        # Agent behaviors (drones, medics, trucks)
โ”‚   โ”œโ”€โ”€ dynamics.py      # Fire spread, aftershocks
โ”‚   โ””โ”€โ”€ sensors.py       # State observation system
โ”œโ”€โ”€ ๐Ÿ› ๏ธ tools/            # Agent capabilities
โ”‚   โ”œโ”€โ”€ hospital.py      # Medical facility management
โ”‚   โ”œโ”€โ”€ resources.py     # Resource tracking & constraints
โ”‚   โ””โ”€โ”€ routing.py       # Pathfinding & navigation
โ”œโ”€โ”€ ๐Ÿ“Š eval/              # Performance evaluation
โ”‚   โ”œโ”€โ”€ harness.py       # Batch experiment runner
โ”‚   โ””โ”€โ”€ plots.py         # Visualization generation
โ””โ”€โ”€ ๐Ÿ“‹ configs/           # Scenario configurations
    โ”œโ”€โ”€ map_small.yaml   # Training scenarios
    โ”œโ”€โ”€ map_medium.yaml  # Standard benchmarks
    โ””โ”€โ”€ map_hard.yaml    # Challenge scenarios

๐ŸŽฏ Supported Scenarios

Scenario Size Complexity Survivors Key Challenges
Small 20ร—20 Beginner 15 Basic coordination
Medium 25ร—25 Intermediate 25 Resource management
Hard 30ร—30 Advanced 40 Multi-crisis events

๐Ÿ“Š Performance Metrics

  • ๐Ÿฅ Rescue Efficiency: Survivors saved vs. casualties
  • โฑ๏ธ Response Time: Average rescue completion time
  • ๐Ÿ”‹ Resource Utilization: Energy and tool consumption
  • ๐Ÿš’ Crisis Mitigation: Fires extinguished, roads cleared
  • ๐Ÿฅ Hospital Management: Triage efficiency, overflow events
  • ๐Ÿค– AI Performance: JSON validity, replanning frequency

๐Ÿ”Œ LLM Provider Support

Provider Models Setup
Groq Llama 3.3 70B export GROQ_API_KEY=your_key
Google Gemini Gemini 1.5 Flash export GEMINI_API_KEY=your_key
Mock Heuristic Fallback No setup required

๐ŸŽจ Visualization Features

Real-time Web Interface

  • ๐Ÿ—บ๏ธ Interactive crisis map visualization
  • ๐Ÿ“Š Live performance dashboards
  • ๐Ÿ“ˆ Real-time metrics tracking
  • ๐ŸŽฎ Manual control override capabilities

Performance Analytics

  • ๐Ÿ“Š Strategy comparison charts
  • ๐Ÿ“ˆ Rescue efficiency trends
  • ๐ŸŽฏ Resource utilization heatmaps
  • ๐Ÿ“‰ Statistical significance testing

๐Ÿ›ก๏ธ Safety & Ethics

This simulation is designed for:

  • ๐Ÿ“š Research: Emergency response optimization
  • ๐ŸŽ“ Education: Crisis management training
  • ๐Ÿข Planning: Resource allocation strategies
  • ๐Ÿงช Development: AI reasoning system testing

๐Ÿค Contributing

We welcome contributions! Areas of focus:

  • ๐Ÿง  New LLM reasoning strategies
  • ๐ŸŒ Additional crisis scenarios
  • ๐Ÿ“Š Advanced evaluation metrics
  • ๐ŸŽจ Visualization improvements

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Mesa Project: Agent-based modeling framework
  • OpenAI: LLM reasoning methodologies
  • Crisis Response Community: Domain expertise and validation

Built with โค๏ธ for emergency response optimization and AI reasoning research

aliyanz85/crisis-sim | GitHunt