GitHunt

Driss Guessous

drisspg

@pytorch core

@facebook
Los Angeles

Languages

Python57%C++14%HTML5%Lean5%Cuda5%TypeScript5%Rust5%CMake5%

Loading contributions...

Top Repositories

Repositories

62
DR
drisspg/torcho3

Rust-based GPU kernel library with PyTorch bindings via cutile-rs + PyO3

Python00Updated 21 hours ago
DR
drisspg/torch-custom-ops-cookiecutter

No description provided.

Python00Updated 21 hours ago
DR
drisspg/drisspg

No description provided.

10Updated 22 hours ago
DR
drisspg/vizz

No description provided.

Python00Updated 22 hours ago
DR
drisspg/pt_job_queue

PyTorch Job Queue — dispatch agents to fix PyTorch issues

Python52Updated 2 days ago
DR
drisspg/transformer_nuggets

A place to store reusable transformer components of my own creation or found on the interwebs

Python7512Updated 5 days ago
DR
drisspg/flash-attentionFork

Fast and memory-efficient exact attention

Python00Updated 3 days ago
DR
drisspg/stack-prFork

A tool for working with stacked PRs on github.

00Updated 1 week ago
DR
drisspg/simple_cuda

Learnings + Exercises from the PMPP book!

C++110Updated 2 weeks ago
DR
drisspg/flex-flash-blog

No description provided.

HTML70Updated 1 week ago
DR
drisspg/osdcFork

Open Source Developer Cloud

00Updated 1 week ago
DR
drisspg/lean_ua

Lean 4 formalizations of proofs from Stephen Abbott's Understanding Analysis textbook

Lean80Updated 9 months ago
DR
drisspg/quackFork

A Quirky Assortment of CuTe Kernels

00Updated 3 days ago
DR
drisspg/pt2-triageFork

Claude code skills for PT2 triage

00Updated 1 month ago
DR
drisspg/pytorchFork

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python11Updated 1 month ago
DR
drisspg/vllmFork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python00Updated 3 months ago
DR
drisspg/lintrunnerFork

No description provided.

00Updated 3 months ago
DR
drisspg/driss_torch

Cuda extensions for PyTorch

Cuda122Updated 3 months ago
DR
drisspg/nuggets

No description provided.

TypeScript00Updated 4 months ago
DR
drisspg/tlparseFork

TORCH_LOGS parser for PT2

Rust00Updated 1 month ago
DR
drisspg/FBGEMMFork

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

00Updated 6 months ago
DR
drisspg/simple_cpp

No description provided.

CMake00Updated 7 months ago
DR
drisspg/attention-gym

Helpful tools and examples for working with flex-attention

Python30Updated 1 year ago
DR
drisspg/tritonbenchFork

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

00Updated 11 months ago
DR
drisspg/triton_differ

No description provided.

Python00Updated 1 year ago
DR
drisspg/cutlassFork

CUDA Templates for Linear Algebra Subroutines

C++00Updated 3 months ago
DR
drisspg/sglangFork

SGLang is a fast serving framework for large language models and vision language models.

00Updated 1 year ago
DR
drisspg/lit-gptFork

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python11Updated 2 years ago
DR
drisspg/tritonFork

Development repository for the Triton language and compiler

C++00Updated 3 weeks ago
DR
drisspg/nanoGPTFork

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python00Updated 1 year ago

Gists

Recent Activity