GitHunt

Sumedh Sakdeo

sumedhsakdeo

https://www.linkedin.com/in/sumedhsakdeo/

LinkedIn
Bay Area

Languages

Java43%C++29%Python14%JavaScript14%

Top Repositories

Repositories

25
SU
sumedhsakdeo/openhouseFork

OpenHouse - A Control Plane for Tables

Java00Updated 5 hours ago
SU
sumedhsakdeo/arrowFork

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

00Updated 3 weeks ago
SU
sumedhsakdeo/iceberg-pythonFork

PyIceberg

00Updated 3 weeks ago
SU
sumedhsakdeo/icebergFork

Apache Iceberg

Java00Updated 1 month ago
SU
sumedhsakdeo/li_icebergFork

A temporary home for LinkedIn's changes to Apache Iceberg (incubating)

Java00Updated 2 months ago
SU
sumedhsakdeo/coralFork

Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.

00Updated 4 months ago
SU
sumedhsakdeo/OpenLineageFork

An Open Standard for lineage metadata collection

00Updated 5 months ago
SU
sumedhsakdeo/flinkFork

Apache Flink

00Updated 5 months ago
SU
sumedhsakdeo/lotusFork

LOTUS: A semantic query engine for fast and easy LLM-powered data processing

00Updated 6 months ago
SU
sumedhsakdeo/llamaFork

Inference code for Llama models

00Updated 1 year ago
SU
sumedhsakdeo/haystackFork

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

00Updated 1 year ago
SU
sumedhsakdeo/unitycatalogFork

Open, Multi-modal Catalog for Data & AI

00Updated 1 year ago
SU
sumedhsakdeo/trinoFork

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

00Updated 1 year ago
SU
sumedhsakdeo/duckdbFork

DuckDB is an in-process SQL OLAP Database Management System

00Updated 1 year ago
SU
sumedhsakdeo/orcFork

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

00Updated 4 years ago
SU
sumedhsakdeo/hadoopFork

Apache Hadoop

00Updated 4 years ago
SU
sumedhsakdeo/sparkFork

Apache Spark - A unified analytics engine for large-scale data processing

00Updated 4 years ago
SU
sumedhsakdeo/gobblinFork

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

00Updated 4 years ago
SU
sumedhsakdeo/pybigqueryFork

SQLAlchemy dialect for BigQuery

Python10Updated 4 years ago
SU
sumedhsakdeo/incubator-supersetFork

Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application

JavaScript00Updated 7 years ago
SU
sumedhsakdeo/TheOneProject

No description provided.

00Updated 11 years ago
SU
sumedhsakdeo/TheOne

No description provided.

00Updated 11 years ago
SU
sumedhsakdeo/zynx

No description provided.

C++10Updated 12 years ago
SU
sumedhsakdeo/cs133

Final Project for cs133

C++41Updated 12 years ago
SU
sumedhsakdeo/one_1.4.1

No description provided.

00Updated 13 years ago

Gists

Recent Activity

Sumedh Sakdeo (sumedhsakdeo) | GitHunt