GitHunt
BE

beita6969/ScienceClaw

๐Ÿ”ฌ๐Ÿฆž A self-evolving AI research colleague for scientists. 285 skills, zero hallucination, persistent memory.

ScienceClaw โ€” AI Research Gateway

A self-evolving AI research colleague for scientists.

Stars 285 Skills 28+ Disciplines Zero Hallucination License


Why ScienceClaw?

General-purpose AI assistants are built for everyone. ScienceClaw is built for researchers.

The core idea is simple: an AI that does real scientific work โ€” searching literature, querying databases, running analyses โ€” and gets better at it the more you use it. It remembers your research context across sessions, adapts its skills to your field, and never fabricates a citation.

ScienceClaw is built on the OpenClaw engine, but redesigned from the ground up for academic research.

ScienceClaw vs Standard AI


๐Ÿงฌ Core 1: Self-Evolving Skills

This is ScienceClaw's most important feature.

Most AI tools ship with a fixed set of capabilities. ScienceClaw's skills evolve with you. Every time you complete a research task, the system learns:

Skill Self-Evolution Cycle

What this means in practice:

  • Week 1: You study immunology. ScienceClaw learns that PubMed + Semantic Scholar works best for your queries, that you prefer forest plots over tables, and that you always need PMID + DOI in citations.
  • Week 4: The system has created specialized skills for your subfield โ€” optimized search templates, preferred statistical methods, database priority chains tuned to immunology literature.
  • Month 3: ScienceClaw handles your domain like a trained research assistant. It knows which databases to hit first, which journals matter, and how you like your output formatted.

Compared to standard OpenClaw: OpenClaw ships with ~54 general-purpose skills that don't change. ScienceClaw starts with 285 skills and grows from there โ€” the agent writes new SKILL.md files at runtime without any redeployment.


๐Ÿง  Core 2: Research Memory That Persists

Standard AI assistants forget everything when the conversation ends. ScienceClaw doesn't.

Four-Layer Research Memory

What this enables:

  • "Continue the literature review we started last Tuesday" โ€” it remembers where you left off
  • "Use the same search strategy that worked for the BRCA2 project" โ€” it retrieves past patterns
  • Cross-session knowledge accumulation โ€” findings from project A can inform project B
  • Smart context pruning โ€” when the context window fills up, it preserves statistical results, effect sizes, and key citations while compacting intermediate steps

Compared to standard OpenClaw: OpenClaw has a basic memory plugin. ScienceClaw adds temporal decay weighting, LanceDB vector storage, and cross-session research pattern retrieval โ€” specifically designed for long-running academic work.


โฑ๏ธ Core 3: Built for Long-Duration Research

A real literature review takes hours, not seconds. Most AI tools time out after a few minutes. ScienceClaw is engineered for extended research sessions:

Capability Standard OpenClaw ScienceClaw
Agent timeout 600s (10 min) 3600s (1 hour+)
Session persistence Ends with conversation Heartbeat keeps sessions alive across interruptions
Research depth Single-pass response Multi-phase protocol with mandatory depth thresholds
Minimum effort No guarantee Quick=5, Survey=30, Review=60, Systematic=100+ tool calls
Early stopping Common Anti-premature-conclusion checklist blocks shallow answers
Context management Basic truncation Smart compaction preserves key findings when context fills up

The persistence protocol enforces real research depth. Before ScienceClaw concludes any task, it must verify:

  • โœ… Searched at least 3 different databases/sources
  • โœ… Retrieved full metadata (not just titles)
  • โœ… Cross-referenced findings across sources
  • โœ… Checked for contradictory evidence
  • โœ… Verified key statistics against primary sources
  • โœ… Organized results into a structured output file
  • โœ… Met the minimum tool-call threshold for the task type

If any box is unchecked, it keeps working instead of giving you a half-baked answer.

Compared to standard OpenClaw: OpenClaw's default 10-minute timeout is fine for sending messages and setting reminders. ScienceClaw's 1-hour sessions with heartbeat monitoring and mandatory depth enforcement are built for real academic research.


๐Ÿšซ Core 4: Zero Hallucination

This is the highest-priority rule in the entire system. It's non-negotiable.

The problem: General AI assistants routinely fabricate citations โ€” inventing DOIs, making up author names, citing papers that don't exist. In scientific work, this is catastrophic.

ScienceClaw's approach:

EVERY citation must come from a tool result in the CURRENT conversation.

If a database didn't return it โ†’ you can't cite it.
If you're not sure โ†’ say "not verified" explicitly.
If you can't find evidence โ†’ say so. Don't guess.

No "I think." No "probably." No hallucinated PMIDs.

This is enforced at the protocol level in SCIENCE.md โ€” the 629-line research protocol that governs all agent behavior. It's not a suggestion. It's a hard rule that applies before any other instruction.

Compared to standard OpenClaw: OpenClaw has no special hallucination controls. ScienceClaw's SCIENCE.md protocol treats every factual claim as requiring evidence โ€” the same standard you'd apply to a manuscript under peer review.


๐ŸŒ Core 5: All of Science, Not Just Biomedicine

ScienceClaw covers natural sciences AND social sciences across dozens of disciplines:

Scientific Discipline Coverage

๐Ÿ“‹ Full discipline & database list

Natural Sciences

Domain Key Skills & Databases
Biomedicine PubMed, UniProt, KEGG, PDB, ClinicalTrials, gnomAD, scanpy, biopython
Chemistry PubChem, ChEMBL, RDKit, drug-discovery, molecular-dynamics
Genomics NCBI Entrez, Ensembl, ClinVar, GEO, phylogenetics
Materials Science Materials Project, pymatgen, materials-screening
Physics astropy, quantum-computing, physics-solver, simulation
Environmental Science Copernicus climate data, geospatial analysis, GIS tools
Food Science Specialized analysis pipelines

Social Sciences

Domain Key Skills & Databases
Economics World Bank, SSRN, census data, econometrics
Political Science Policy analysis, legislative data
Psychology Experimental design, statistical testing, meta-analysis
Linguistics spaCy, NLTK, NLP analysis
Education Research methodology, assessment analysis
Sociology Network analysis, survey methods

Cross-Disciplinary Tools

Category Capabilities
Statistics SciPy, statsmodels, scikit-learn, effect sizes, confidence intervals, multiple comparison corrections
Visualization matplotlib, plotly, seaborn, publication-quality figures
Writing LaTeX papers, systematic reviews (PRISMA), grant proposals, patent drafting
Mathematics SymPy symbolic computation, numerical methods, optimization

285 skills total โ€” and growing, because the self-evolution system creates new ones as you work.

Compared to standard OpenClaw: OpenClaw has no scientific database integrations. No PubMed, no UniProt, no arXiv, no World Bank. ScienceClaw connects to 25+ academic databases with structured API query skills across all major scientific disciplines.


Quick Start

# Clone
git clone https://github.com/beita6969/ScienceClaw.git
cd ScienceClaw

# One-click setup (installs everything: Node, Python, MCP servers, skills)
chmod +x setup.sh && ./setup.sh

# Or manual install
pnpm install && npx openclaw onboard

Enable Research Features

The setup.sh script automatically configures everything. For manual setup, edit ~/.openclaw/openclaw.json:

{
  "gateway": { "mode": "local" },
  "plugins": {
    "slots": { "memory": "memory-core" },
    "entries": {
      "memory-core": { "enabled": true },
      "memory-lancedb": { "enabled": true }
    }
  },
  "agents": {
    "defaults": {
      "heartbeat": { "interval": 1800 }
    }
  }
}

Project Structure

ScienceClaw/
โ”œโ”€โ”€ setup.sh                # ๐Ÿฆž One-click setup (run this first!)
โ”œโ”€โ”€ SCIENCE.md              # 629-line research protocol (the brain)
โ”œโ”€โ”€ skills/                 # 285 skill definitions (and growing)
โ”‚   โ”œโ”€โ”€ skill-evolution/    # Self-improving skill system
โ”‚   โ”œโ”€โ”€ research-reflection/# Post-task learning & evaluation
โ”‚   โ”œโ”€โ”€ skill-creator/      # Runtime skill generation
โ”‚   โ””โ”€โ”€ ...
โ”œโ”€โ”€ src/                    # Core engine
โ”‚   โ”œโ”€โ”€ memory/             # 4-layer memory (temporal decay, LanceDB)
โ”‚   โ”œโ”€โ”€ agents/             # Agent orchestration & persistence
โ”‚   โ””โ”€โ”€ skills/             # Skill loading & execution
โ”œโ”€โ”€ ui/                     # Web-based research gateway UI
โ”œโ”€โ”€ extensions/             # Plugin system
โ”œโ”€โ”€ deploy/                 # Docker, Fly.io, Podman configs
โ”œโ”€โ”€ config/                 # Vitest, build, lint configs
โ””โ”€โ”€ docs/                   # Documentation

Contact Us

๐Ÿ“ง mingdazhang@ieee.org

License

MIT โ€” see LICENSE.