Vlad

acforvs

Yandex

Languages

Python63%Jupyter Notebook26%C++11%

Repos

Stars

111

Forks

Top Language

Python

Loading contributions...

Top Repositories

multi-agent-pathfinding

Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*

27Python

dhc-robust-mapf

Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.

24Python

ysda-llm-scaling-week

Materials for the "Speeding up training with Triton and FP8" which were used for the https://llmscaling.yandex.com/en.

23Python

Cointegrated-Pairs-Trading

Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021

8Python

talks

My public talks are presented here

Kaggle-In-house-classification

Kaggle classification contest report (in Russian)

4Jupyter Notebook

Repositories

acforvs/theoretical-context-parallel

No description provided.

Python00Updated 1 month ago

acforvs/dhc-robust-mapf

Python241Updated 2 years ago

deep-learningmulti-agent-pathfindingmulti-agent-reinforcement-learningpartially-observable-markov-decision-processpoetrypythonpytorchrayreinforcement-learning

acforvs/multi-agent-pathfinding

Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*

Python275Updated 3 years ago

icra2021multiagent-path-findingmultiagent-reinforcement-learningpoetry-pythonpytorchreinforcement-learning

acforvs/ysda-llm-scaling-week

Materials for the "Speeding up training with Triton and FP8" which were used for the https://llmscaling.yandex.com/en.

Python230Updated 4 months ago

acforvs/DeepGEMMFork

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

00Updated 9 months ago

acforvs/awac_iql

Offline to Online RL: AWAC & IQL PyTorch Implementation

Jupyter Notebook10Updated 2 years ago

awaciqloffline-rlpytorchreinforcement-learning

acforvs/ThunderKittensFork

Tile primitives for speedy kernels

00Updated 1 year ago

acforvs/DeepSpeedFork

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

00Updated 2 years ago

acforvs/deep-rl-classFork

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

Jupyter Notebook10Updated 3 years ago

acforvs/introtodeeplearningFork

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Jupyter Notebook00Updated 4 years ago

acforvs/modded-nanogptFork

NanoGPT (124M) quality in 7.8 8xH100-minutes

Python00Updated 1 year ago

acforvs/TransPathFork

No description provided.

00Updated 2 years ago

acforvs/Cointegrated-Pairs-Trading

Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021

Python81Updated 4 years ago

pandaspython3statsmodels

acforvs/starter-hugo-academicFork

🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.

00Updated 3 years ago

acforvs/trlxFork

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python00Updated 2 years ago

acforvs/accelerateFork

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

00Updated 2 years ago

acforvs/text-generation-inferenceFork

Large Language Model Text Generation Inference

00Updated 2 years ago

acforvs/CodeTFFork

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

00Updated 2 years ago

acforvs/talks

My public talks are presented here

60Updated 3 years ago

acforvs/transformer

PyTorch implementation of the original transformer, from scratch

Python10Updated 2 years ago

attention-is-all-you-needpythonpython3pytorchtransformertransformer-pytorch

acforvs/DHCFork

Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021

10Updated 4 years ago

acforvs/optaxFork

Optax is a gradient processing and optimization library for JAX.

Python00Updated 2 years ago

acforvs/tauFork

Pipeline Parallelism for PyTorch

Python00Updated 3 years ago

acforvs/Gradient-Descent-Homework

Gradient Descent Homework for the ML Course @ SPbU

Jupyter Notebook30Updated 4 years ago

pandasplotlypython3sklearn

acforvs/Kaggle-In-house-classification

Kaggle classification contest report (in Russian)

Jupyter Notebook41Updated 4 years ago

catboostnumpypandasplotlypython3sklearn

acforvs/LeetCode-solutions

LeetCode solutions

C++40Updated 4 years ago

cppcpp17leetcode-cppleetcode-solutions

acforvs/open_spielFork

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++00Updated 3 years ago

acforvs/tests

No description provided.

Python00Updated 3 years ago

acforvs/mit-deep-learningFork

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

40Updated 4 years ago

acforvs/Decision-Tree

Decision Tree Implementation as a part of my ML hw @ SPbU

Python40Updated 4 years ago

numpypython3

Vlad

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity