GitHunt

Mark Saroufim

msaroufim

CUDA uninЇsțållåțîön fāīłüřęđ. Płēȃšę čøñțàçț șūppørt før åššīštåñćē

@PyTorch and @gpu-mode
Bay Area

Organizations

Languages

Python78%TypeScript11%Rust6%CSS6%

Loading contributions...

Top Repositories

Repositories

209
MS
msaroufim/Triton-PuzzlesFork

Puzzles for learning Triton

30Updated 1 year ago
MS
msaroufim/leaderboard-pythonFork

Leaderboards backed by Redis in Python

Python00Updated 6 years ago
MS
msaroufim/ml-design-patterns

Software Architecture for ML engineers

41932Updated 3 years ago
deep-learningdesign-patternspythonpytorchsystems
MS
msaroufim/awesome-profiling

Awesome utilities for performance profiling

20110Updated 6 days ago
MS
msaroufim/mynotes

No description provided.

Python195Updated 2 weeks ago
MS
msaroufim/nanochatFork

The best ChatGPT that $100 can buy.

00Updated 1 week ago
MS
msaroufim/nanoGPTFork

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python01Updated 1 week ago
MS
msaroufim/thumbnail-montage

GPU MODE 100 lectures thumbnail montage video generator

Python20Updated 1 week ago
MS
msaroufim/intermediate-python

An intro for people that want to ship not just read code

13Updated 4 years ago
MS
msaroufim/pretty-rocm-smi

A prettier rocm-smi output with color-coded GPU stats

Rust100Updated 3 weeks ago
MS
msaroufim/steam-games

Fetch and sort Steam library by playtime

Python00Updated 3 weeks ago
MS
msaroufim/pytorch_build_times

No description provided.

10Updated 3 weeks ago
MS
msaroufim/newblog

new blog, who dis?

CSS00Updated 3 weeks ago
MS
msaroufim/helionFork

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python00Updated 3 weeks ago
MS
msaroufim/C-compiler-optimizations

Description of commonly done compiler optimizations in C

4710Updated 3 years ago
MS
msaroufim/gemm-eval

No description provided.

Python00Updated 1 month ago
MS
msaroufim/Discord-PDFPreview

Preview PDFs locally within the Discord UI!

Python213Updated 2 years ago
MS
msaroufim/pytorch-load-inline-highlighter

VS Code extension for syntax highlighting C++/CUDA/HIP code in PyTorch load_inline() strings

Python90Updated 7 months ago
MS
msaroufim/Data-Science-From-Scratch

Code Companion to Joel Grus' book

Python2911Updated 7 years ago
MS
msaroufim/dvcFork

⚡️Data & models versioning for ML projects, make them shareable and reproducible

Python00Updated 6 years ago
MS
msaroufim/llm_coder

Help Claude know about your library by giving it the main APIs in a prompt and integrate it into VS Code

TypeScript10Updated 1 year ago
MS
msaroufim/hqqFork

Official implementation of Half-Quadratic Quantization (HQQ)

00Updated 1 year ago
MS
msaroufim/cuda-pythonFork

CUDA Python: Performance meets Productivity

00Updated 10 months ago
MS
msaroufim/decent

No description provided.

00Updated 1 year ago
MS
msaroufim/torchftFork

PyTorch per step fault tolerance (actively under development)

Python00Updated 10 months ago
MS
msaroufim/gpumode-site

The world's best GPU community

TypeScript10Updated 1 year ago
MS
msaroufim/ruffFork

An extremely fast Python linter and code formatter, written in Rust.

00Updated 1 year ago
MS
msaroufim/ThunderKittensFork

Tile primitives for speedy kernels

00Updated 1 year ago
MS
msaroufim/chess

No description provided.

Python00Updated 1 year ago
MS
msaroufim/aoFork

PyTorch native quantization and sparsity for training and inference

Python00Updated 1 year ago

Gists

Recent Activity