Zhe Zhang
zhe-thoughts
Current: Distinguished Engineer @NVIDIA DGX Cloud. @Apache Member Former Head of @ray-project + Head of Field Engineering @anyscale
Languages
Repos
31
Stars
5
Forks
3
Top Language
Python
Loading contributions...
Top Repositories
A novel by Caroline and Preston Zhang, combining their loves for Harry Potter and Star Wars. Enjoy! (The format of the site is forked from the excellent example https://github.com/hankquinlan/hankquinlan.github.io)
Zhe's blog
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Repositories
31Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
SGLang is a fast serving framework for large language models and vision language models.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
No description provided.
Roblox Foundation Model for 3D Intelligence
Numbers every programmer should know (GPU edition)
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
A native PyTorch Library for large model training
The simplest, fastest repository for training/finetuning medium-sized GPTs.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
nanoGPT style version of Llama 3.1
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorFlow on YARN (TonY) is a framework to natively run TensorFlow on Apache Hadoop.
The official home of the Presto distributed SQL query engine for big data
All Algorithms implemented in Python
Bug Life - GitHub Data Challenge 2014
No description provided.
Build and manage real-life data science projects with ease.
A novel by Caroline and Preston Zhang, combining their loves for Harry Potter and Star Wars. Enjoy! (The format of the site is forked from the excellent example https://github.com/hankquinlan/hankquinlan.github.io)
This is a demo GitHub Pages and Jekyll site. See README for more info.
Secure HDFS Access from Kubernetes
Zhe's blog
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
Mirror of Apache Arrow
A Reliable Memory Centric Distributed Storage System
No description provided.