Top Repositories
Apache Doris is an easy-to-use, high performance and unified analytics database.
Apache Parquet Format
A high-performance observability data pipeline.
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. ๐ฅ ๐ฅ. ๐ Open source Application Performance Monitoring (APM) & Observability tool
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Apache Iceberg
Repositories
107A high-performance observability data pipeline.
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. ๐ฅ ๐ฅ. ๐ Open source Application Performance Monitoring (APM) & Observability tool
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Apache Iceberg
Apache Flink
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
Apache Doris is an easy-to-use, high performance and unified analytics database.
No description provided.
No description provided.
No description provided.
Flink Agents is an Agentic AI framework based on Apache Flink
Proxy that captures and visualizes in-flight Claude Code requests and conversations.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.
Apache Spark - A unified analytics engine for large-scale data processing
Apache Pinot - A realtime distributed OLAP datastore
Apache Lucene open-source search software
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Open, Multi-modal Catalog for Data & AI
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Apache Parquet Format
The official home of the Presto distributed SQL query engine for big data
A compute manifest and composable tools for ML, built on Ibis, DataFusion, and Arrow Flight.
A single-node analytical database engine with geospatial as a first-class citizen
No description provided.
#1 OpenCode Plugin- Battery included. ASYNC SUBAGENTS (YES LIKE CLAUDE CODE) ยท Curated agents with proper models ยท Crafted tools like LSP/AST included ยท Curated MCPs ยท Claude Code Compatible Layer โ Steroids for your OpenCode. The Best LLM Agent Experience is Here.
Incremental view maintenance & query rewriting for materialized views in DataFusion
Apache Calcite
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.