GitHunt
D-

d-v-b/datajoint-python

Relational data pipelines for the science lab

Welcome to DataJoint for Python!

PyPI pypi release
pypi downloads
Conda Forge conda-forge release
conda-forge downloads
Since Release commit since last release
Test Status test status
Release Status release status
Doc Status doc status
Coverage coverage
Developer Chat datajoint slack
License LGPL-2.1
Citation bioRxiv
zenodo

DataJoint for Python is a framework for scientific workflow management based on
relational principles. DataJoint is built on the foundation of the relational data
model and prescribes a consistent method for organizing, populating, computing, and
querying data.

DataJoint was initially developed in 2009 by Dimitri Yatsenko in Andreas Tolias' Lab at
Baylor College of Medicine for the distributed processing and management of large
volumes of data streaming from regular experiments. Starting in 2011, DataJoint has
been available as an open-source project adopted by other labs and improved through
contributions from several developers.
Presently, the primary developer of DataJoint open-source software is the company
DataJoint (https://datajoint.com).

Data Pipeline Example

pipeline

Yatsenko et al., bioRxiv 2021

Getting Started

Languages

Python99.8%Dockerfile0.2%
GNU Lesser General Public License v2.1
Created August 28, 2025
Updated September 26, 2025
d-v-b/datajoint-python | GitHunt