Driss Guessous

drisspg

@pytorch core

@facebook

Los Angeles

Languages

Python57%C++14%HTML5%Lean5%Cuda5%TypeScript5%Rust5%CMake5%

Loading contributions...

Top Repositories

transformer_nuggets

A place to store reusable transformer components of my own creation or found on the interwebs

Cuda extensions for PyTorch

Learnings + Exercises from the PMPP book!

Lean 4 formalizations of proofs from Stephen Abbott's Understanding Analysis textbook

flex-flash-blog

PyTorch Job Queue — dispatch agents to fix PyTorch issues

Repositories

62

drisspg/torcho3

Rust-based GPU kernel library with PyTorch bindings via cutile-rs + PyO3

Python00Updated 21 hours ago

drisspg/torch-custom-ops-cookiecutter

No description provided.

Python00Updated 21 hours ago

drisspg/drisspg

No description provided.

10Updated 22 hours ago

No description provided.

Python00Updated 22 hours ago

drisspg/pt_job_queue

PyTorch Job Queue — dispatch agents to fix PyTorch issues

Python52Updated 2 days ago

drisspg/transformer_nuggets

A place to store reusable transformer components of my own creation or found on the interwebs

Python7512Updated 5 days ago

drisspg/flash-attentionFork

Fast and memory-efficient exact attention

Python00Updated 3 days ago

drisspg/stack-prFork

A tool for working with stacked PRs on github.

00Updated 1 week ago

drisspg/simple_cuda

Learnings + Exercises from the PMPP book!

C++110Updated 2 weeks ago

drisspg/flex-flash-blog

No description provided.

HTML70Updated 1 week ago

drisspg/osdcFork

Open Source Developer Cloud

00Updated 1 week ago

drisspg/lean_ua

Lean 4 formalizations of proofs from Stephen Abbott's Understanding Analysis textbook

Lean80Updated 9 months ago

drisspg/quackFork

A Quirky Assortment of CuTe Kernels

00Updated 3 days ago

drisspg/pt2-triageFork

Claude code skills for PT2 triage

00Updated 1 month ago

drisspg/pytorchFork

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python11Updated 1 month ago

drisspg/vllmFork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python00Updated 3 months ago

drisspg/lintrunnerFork

No description provided.

00Updated 3 months ago

drisspg/driss_torch

Cuda extensions for PyTorch

Cuda122Updated 3 months ago

drisspg/nuggets

No description provided.

TypeScript00Updated 4 months ago

drisspg/tlparseFork

TORCH_LOGS parser for PT2

Rust00Updated 1 month ago

drisspg/FBGEMMFork

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

00Updated 6 months ago

drisspg/simple_cpp

No description provided.

CMake00Updated 7 months ago

drisspg/attention-gym

Helpful tools and examples for working with flex-attention

Python30Updated 1 year ago

drisspg/tritonbenchFork

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

00Updated 11 months ago

drisspg/triton_differ

No description provided.

Python00Updated 1 year ago

drisspg/cutlassFork

CUDA Templates for Linear Algebra Subroutines

C++00Updated 3 months ago

drisspg/sglangFork

SGLang is a fast serving framework for large language models and vision language models.

00Updated 1 year ago

drisspg/lit-gptFork

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python11Updated 2 years ago

drisspg/tritonFork

Development repository for the Triton language and compiler

C++00Updated 3 weeks ago

drisspg/nanoGPTFork

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python00Updated 1 year ago

Gists

Recent Activity