Vara Bonthu
vara-bonthu
Principal OSS Specialist SA | Data + AI & Kubernetes @aws
Languages
Top Repositories
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
MCP Server for Apache Spark History Server. The bridge between Agnetic AI and Apache Spark.
DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS
AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service
Terraform Module: Deploy Data/ML Addons Helm Charts on EKS 🚀
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Repositories
41Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
MCP Server for Apache Spark History Server. The bridge between Agnetic AI and Apache Spark.
DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS
Open-source advocate, cloud architect, and AI/ML enthusiast helping teams build scalable data platforms on Kubernetes
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
No description provided.
AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service
Terraform Module: Deploy Data/ML Addons Helm Charts on EKS 🚀
Repository used to main group ACLs used by Kubeflow developers
Information about the Kubeflow community including proposals and governance information.
Kubeflow blog based on fastpages
Kubeflow Website
No description provided.
Packer configuration for building a custom EKS AMI
No description provided.
Accelerating Data processing workloads on GPUs with Spark-RAPIDS
Apache Druid: a high performance real-time analytics database.
A high-throughput and memory-efficient inference and serving engine for LLMs
Utilities intended for use with Llama models.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Spin up a complete internal developer platform with only Docker required as a dependency.
This is the reference implementation of CNOE and its toolings on AWS
Distributed ML Training and Fine-Tuning on Kubernetes
Experimental data plane controller to copy data from a Kubernetes cluster to cloud object stores
No description provided.
Terraform AWS provider
Crossplane AWS Provider
Networking plugin repository for pod networking in Kubernetes using Elastic Network Interfaces on AWS
Terraform module which creates AWS EMR resources
A Cloud Native Batch System (Project under CNCF)