Hemil Desai
hemildesai
Building training frameworks and tools for MoEs @NVIDIA-NeMo
Languages
Repos
37
Stars
47
Forks
16
Top Language
Python
Loading contributions...
Top Repositories
An online multiplayer board game similar to Catan
Deepspeed integration with mmdetection3d
A toolkit for benchmarking Generative Models
OpenMMLab's next-generation platform for general 3D object detection.
Experiments with Cifar10
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
Repositories
37An online multiplayer board game similar to Catan
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
verl: Volcano Engine Reinforcement Learning for LLMs
DeepEP: an efficient expert-parallel communication library
A project to improve skills of large language models
A tool to configure, launch and manage your machine learning experiments.
Deepspeed integration with mmdetection3d
A toolkit for benchmarking Generative Models
OpenMMLab's next-generation platform for general 3D object detection.
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
My personal website
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
No description provided.
This is a go library for twitter v2 API integration.
No description provided.
No description provided.
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Experiments with Cifar10
🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
No description provided.
Version control for machine learning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Bringing full-stack to the Jamstack.
GitHub README
Event-based dependency manager for Kubernetes.
OAuth 2.0 Bearer JWT Authorizer for AWS API Gateway
No description provided.
No description provided.
Numpy main repository
A web application to store and organize online resources shared within an organization