GitHunt

Ming-Xu Huang

mingxu1067

NVIDIA
Taiwan

Languages

Python50%C++25%Roff13%Java13%

Repos

17

Stars

10

Forks

0

Top Language

Python

Loading contributions...

Top Repositories

Repositories

17
MI
mingxu1067/jaxFork

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

00Updated 3 months ago
MI
mingxu1067/xlaFork

A machine learning compiler for GPUs, CPUs, and ML accelerators

00Updated 3 weeks ago
MI
mingxu1067/GPU-Perf-Analyzer

A tool to classify and statistic GPU kernel information.

Python100Updated 1 year ago
MI
mingxu1067/maxtextFork

A simple, performant and scalable Jax LLM!

00Updated 6 months ago
MI
mingxu1067/oai_triton_perf_regression

No description provided.

Python00Updated 1 year ago
MI
mingxu1067/TransformerEngineFork

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.

Python00Updated 3 months ago
MI
mingxu1067/JAX-ToolboxFork

JAX-Toolbox

00Updated 2 years ago
MI
mingxu1067/cudnn-frontendFork

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

00Updated 3 years ago
MI
mingxu1067/docsFork

Documentations for PaddlePaddle

00Updated 3 years ago
MI
mingxu1067/paddle_allreduce_issues_reproduce

paddle_allreduce_issues_reproduce

Roff00Updated 4 years ago
MI
mingxu1067/PaddleFork

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Python00Updated 2 years ago
MI
mingxu1067/PaddleNLPFork

An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.

00Updated 4 years ago
MI
mingxu1067/modelsFork

Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)

00Updated 4 years ago
MI
mingxu1067/apexFork

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

00Updated 5 years ago
MI
mingxu1067/Apply_TD-learning_to_2048

No description provided.

C++00Updated 9 years ago
MI
mingxu1067/2048_Framework

No description provided.

C++00Updated 9 years ago
MI
mingxu1067/MIST-3D_printer-Trend-Analysis

No description provided.

Java00Updated 10 years ago

Gists

Recent Activity

Ming-Xu Huang (mingxu1067) | GitHunt