53 results for “topic:data-infrastructure”
Postgres operator creates and manages PostgreSQL clusters running in Kubernetes
Production PostgreSQL for Kubernetes, from high availability Postgres clusters to full-scale database-as-a-service.
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
Highly available elephant herd: HA PostgreSQL cluster using Docker
TensorBase is a new big data warehousing with modern efforts.
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
Semantica 🧠 — A framework for building semantic layers, context graphs, and decision intelligence systems with explainability and provenance.
A battle-tested, flexible & comprehensive monitoring solution for your PostgreSQL databases
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
JSON schema parser for Apache Spark
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
Kanadi is a Nakadi client for Scala
OpenSnowcat Enricher (Apache 2.0 License)
No description provided.
Data dependency manager
Python function to stream unzip all the files in a ZIP archive on the fly
Python function to construct a ZIP archive on the fly
A generic data pipeline which will map Elasticsearch documents to Bigquery table rows
Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those interested in both conceptual theory and use case examples for database design and development.
Service for sharing user consent to cookies across multiple domains
Collections of POC/dev data infrastructure. | #SE
TP d'architecture décisionnel à destination des étudiants de l'EPSI et DC Paris. Le but est de déployer une architecture data dès la récupération de la donnée vers la restitution sous la forme de dataviz en passant par un Datalake, Data Warehouse et d'un Data Mart
Bring Infrastructure as Code best practices to your data workflows with Kestra and Terraform
A fake GOV.UK homepage and start pages for SDE prototype services
Python package to parse Companies House accounts data in a streaming way
Processing code for Scientific Data Descriptor paper
Open-source API to securely share data with customers.
Professional GitHub profile README for Jacob P. Evans—data infrastructure specialist and Splunk/Cribl consultant with expertise in security ops and SIEM architecture. Showcases skills across cloud platforms (AWS, GCP, Azure), programming (Python, Go, Bash), DevOps tools. Highlights homelab automation, AI/LLM development, prompt engineering.
BlockDB - lineage-verified DeFi datasets and real-time on-chain data APIs for quant funds, AI teams, and Web3 developers. Clean, schema-stable Ethereum and EVM data with reorg awareness and traceable records.
🌐 oGraph — Open-source identity layer solving identity fragmentation across the digital ecosystem