Mark Saroufim

msaroufim

CUDA uninЇsțållåțîön fāīłüřęđ. Płēȃšę čøñțàçț șūppørt før åššīštåñćē

@PyTorch and @gpu-mode

Bay Area

https://marksaroufim.com

Organizations

Languages

Python78%TypeScript11%Rust6%CSS6%

Loading contributions...

Top Repositories

ml-design-patterns

Software Architecture for ML engineers

419

awesome-profiling

Awesome utilities for performance profiling

201

C-compiler-optimizations

Description of commonly done compiler optimizations in C

Data-Science-From-Scratch

Code Companion to Joel Grus' book

29Python

Discord-PDFPreview

Preview PDFs locally within the Discord UI!

21Python

mynotes

19Python

Repositories

209

msaroufim/Triton-PuzzlesFork

Puzzles for learning Triton

30Updated 1 year ago

msaroufim/leaderboard-pythonFork

Leaderboards backed by Redis in Python

Python00Updated 6 years ago

msaroufim/ml-design-patterns

Software Architecture for ML engineers

41932Updated 3 years ago

deep-learningdesign-patternspythonpytorchsystems

msaroufim/awesome-profiling

Awesome utilities for performance profiling

20110Updated 6 days ago

msaroufim/mynotes

No description provided.

Python195Updated 2 weeks ago

msaroufim/nanochatFork

The best ChatGPT that $100 can buy.

00Updated 1 week ago

msaroufim/nanoGPTFork

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python01Updated 1 week ago

msaroufim/thumbnail-montage

GPU MODE 100 lectures thumbnail montage video generator

Python20Updated 1 week ago

msaroufim/intermediate-python

An intro for people that want to ship not just read code

13Updated 4 years ago

msaroufim/pretty-rocm-smi

A prettier rocm-smi output with color-coded GPU stats

Rust100Updated 3 weeks ago

msaroufim/steam-games

Fetch and sort Steam library by playtime

Python00Updated 3 weeks ago

msaroufim/pytorch_build_times

No description provided.

10Updated 3 weeks ago

msaroufim/newblog

new blog, who dis?

CSS00Updated 3 weeks ago

msaroufim/helionFork

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python00Updated 3 weeks ago

msaroufim/C-compiler-optimizations

Description of commonly done compiler optimizations in C

4710Updated 3 years ago

msaroufim/gemm-eval

No description provided.

Python00Updated 1 month ago

msaroufim/Discord-PDFPreview

Preview PDFs locally within the Discord UI!

Python213Updated 2 years ago

msaroufim/pytorch-load-inline-highlighter

VS Code extension for syntax highlighting C++/CUDA/HIP code in PyTorch load_inline() strings

Python90Updated 7 months ago

msaroufim/Data-Science-From-Scratch

Code Companion to Joel Grus' book

Python2911Updated 7 years ago

msaroufim/dvcFork

⚡️Data & models versioning for ML projects, make them shareable and reproducible

Python00Updated 6 years ago

msaroufim/llm_coder

Help Claude know about your library by giving it the main APIs in a prompt and integrate it into VS Code

TypeScript10Updated 1 year ago

msaroufim/hqqFork

Official implementation of Half-Quadratic Quantization (HQQ)

00Updated 1 year ago

msaroufim/cuda-pythonFork

CUDA Python: Performance meets Productivity

00Updated 10 months ago

msaroufim/decent

No description provided.

00Updated 1 year ago

msaroufim/torchftFork

PyTorch per step fault tolerance (actively under development)

Python00Updated 10 months ago

msaroufim/gpumode-site

The world's best GPU community

TypeScript10Updated 1 year ago

msaroufim/ruffFork

An extremely fast Python linter and code formatter, written in Rust.

00Updated 1 year ago

msaroufim/ThunderKittensFork

Tile primitives for speedy kernels

00Updated 1 year ago

msaroufim/chess

No description provided.

Python00Updated 1 year ago

msaroufim/aoFork

PyTorch native quantization and sparsity for training and inference

Python00Updated 1 year ago

Mark Saroufim

Organizations

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity