Sumedh Sakdeo
sumedhsakdeo
https://www.linkedin.com/in/sumedhsakdeo/
Languages
Top Repositories
Repositories
25OpenHouse - A Control Plane for Tables
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
PyIceberg
Apache Iceberg
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
An Open Standard for lineage metadata collection
Apache Flink
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Inference code for Llama models
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Open, Multi-modal Catalog for Data & AI
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
DuckDB is an in-process SQL OLAP Database Management System
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Apache Hadoop
Apache Spark - A unified analytics engine for large-scale data processing
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
SQLAlchemy dialect for BigQuery
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
No description provided.
No description provided.
No description provided.
Final Project for cs133
No description provided.