Top Repositories
Open Control Plane for Tables in Data Lakehouse
Apache Iceberg
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
Repositories
25Open Control Plane for Tables in Data Lakehouse
Apache Iceberg
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM's (Ollama, LMStudio, GPT4All, Llama.cpp and Exo) and Cloud based LLMs to help review, test, explain your project code.
Apache Spark - A unified analytics engine for large-scale data processing
No description provided.
No description provided.
No description provided.
No description provided.
Mirror of Apache Kafka
No description provided.
Ultra fast JSON decoder and encoder written in C with Python bindings
Apache Airflow (Incubating)
Solutions for project euler problems
No description provided.
A scalable, distributed Time Series Database.
For the latest version of boto, see https://github.com/boto/boto3 -- Python interface to Amazon Web Services
A highly scalable real-time graphing system
No description provided.
No description provided.
Algorithms library
No description provided.
No description provided.