RL-KG-Agent

A Reinforcement Learning agent for Knowledge Graph reasoning that combines static RDF knowledge graphs with learned internal knowledge representations and LLM capabilities.

Features

Multi-modal Knowledge Access: Integrates static RDF/TTL knowledge graphs with dynamic internal knowledge graph memory
Reinforcement Learning: Uses PPO (Proximal Policy Optimization) for learning optimal question-answering strategies
Rich Action Space: 5 core actions including SPARQL queries, LLM responses, knowledge storage, clarifying questions, and planning
Semantic Reward System: Uses sentence transformers for semantic similarity-based rewards
Long-term Memory: Builds and maintains an internal knowledge graph of learned information
HuggingFace Integration: Supports training on popular QA datasets (SQuAD, Natural Questions, MS MARCO)

Installation

Using UV (Recommended)

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone the repository
git clone <repository-url>
cd rl-kg-agent

# Install dependencies
uv sync

# For development with all dev dependencies
uv sync --dev

Quick Development Setup

# Complete development setup using Makefile
make dev-setup

# Or manually with uv
uv sync --dev

Using pip (Alternative)

# Clone the repository
git clone <repository-url>
cd rl-kg-agent

# Install dependencies
pip install -e .

# For development
pip install -e .[dev]

Quick Start

1. Interactive Mode

Start an interactive session with your knowledge graph:

# Using uv
uv run rl-kg-agent interactive --ttl-file path/to/your/knowledge_graph.ttl

# Or activate environment first
uv shell
rl-kg-agent interactive --ttl-file path/to/your/knowledge_graph.ttl

2. Training

Train the RL agent on a QA dataset:

uv run rl-kg-agent train --ttl-file path/to/your/knowledge_graph.ttl --dataset squad --episodes 1000 --output-model trained_model

3. Evaluation

Evaluate a trained model:

uv run rl-kg-agent evaluate --ttl-file path/to/your/knowledge_graph.ttl --model-path trained_model --test-file test_questions.json

Using Makefile (Alternative)

# Interactive mode
make run-interactive TTL_FILE=path/to/your/knowledge_graph.ttl

# Training
make run-train TTL_FILE=path/to/your/knowledge_graph.ttl

# Evaluation
make run-eval TTL_FILE=path/to/your/knowledge_graph.ttl MODEL_PATH=trained_model

Architecture

Core Components

Knowledge Graph Loader (kg_loader.py)
- Loads RDF/TTL files using rdflib
- Executes SPARQL queries
- Provides schema introspection
Internal Knowledge Graph (internal_kg.py)
- Dynamic knowledge storage with nodes and edges
- Query context memory
- Importance scoring and pruning
Action Space (action_space.py)
- 5 discrete actions for the RL agent
- Each action has applicability conditions and execution logic
PPO Agent (ppo_agent.py)
- Custom Gymnasium environment
- PPO implementation using stable-baselines3
- Multi-input observation space
Reward Calculator (reward_calculator.py)
- Sentence transformer-based semantic similarity
- Multi-component reward system
- Configurable reward weights

Action Space

RESPOND_WITH_LLM: Generate response using language model
QUERY_STATIC_KG: Write and execute SPARQL queries
STORE_TO_INTERNAL_KG: Store learned information in memory
ASK_REFINING_QUESTION: Ask clarifying questions
LLM_PLANNING_STAGE: Engage in step-by-step reasoning

Configuration

Create a configuration file to customize behavior:

{
  "reward_weights": {
    "semantic_similarity": 0.4,
    "action_success": 0.25,
    "knowledge_gain": 0.15,
    "efficiency": 0.1,
    "user_satisfaction": 0.1
  },
  "ppo_config": {
    "learning_rate": 3e-4,
    "n_steps": 2048,
    "batch_size": 64
  }
}

Dataset Support

The system supports several QA datasets:

SQuAD 1.1/2.0: Stanford Question Answering Dataset
Natural Questions: Google's natural language questions
MS MARCO: Microsoft's machine reading comprehension
Custom datasets: JSON/CSV format support

Development

Project Structure

src/rl_kg_agent/
├── agents/          # PPO agent implementation
├── actions/         # Action space and management
├── knowledge/       # KG loading and internal storage
├── utils/           # Reward calculation and utilities
├── data/           # Dataset loading and processing
└── cli.py          # Command line interface

Running Tests

# Using uv
uv run pytest tests/

# Or with coverage
uv run pytest tests/ --cov=src/rl_kg_agent

Code Quality

# Using individual commands
uv run ruff format src/          # Format code
uv run ruff check src/           # Lint code
uv run mypy src/                 # Type checking
uv run pytest tests/             # Run tests

# Using Makefile (recommended for development)
make format                      # Format code
make lint                        # Lint code
make lint-fix                    # Lint with automatic fixes
make type-check                  # Type checking
make test                        # Run tests
make test-cov                    # Run tests with coverage
make quality                     # Run all quality checks
make ci                          # Full CI pipeline

# See all available commands
make help

Examples

Custom Knowledge Graph

Create a simple TTL file:

@prefix ex: <http://example.org/> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .

ex:Paris rdfs:label "Paris" .
ex:Paris ex:isCapitalOf ex:France .
ex:France rdfs:label "France" .

Test Questions

Create a JSON file with test questions:

[
  {
    "question": "What is the capital of France?",
    "expected_answer": "Paris"
  },
  {
    "question": "Which city is the capital of France?",
    "expected_answer": "Paris"
  }
]

License

Apache 2.0 - see LICENSE file for details.

Contributing

Fork the repository
Create a feature branch
Make changes with tests
Submit a pull request

Roadmap

Advanced entity recognition integration
Multi-hop reasoning capabilities
Distributed training support
Web interface
Integration with more LLM APIs
Advanced SPARQL query generation

david4096/rl-kg-agent