GitHunt

Soichiro Nishimori

nissymori

PhD student. Interested in Game AI, JAX-based RL, offline RL and exploration.

The University of Tokyo
Tokyo, Japan

Languages

Python82%HTML9%Ruby9%

Loading contributions...

Top Repositories

Repositories

14
NI
nissymori/JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

Python1735Updated 3 months ago
awaccqld4rldecision-transformerflaxiqljaxoffline-reinforcement-learningoffline-rlreinforcement-learningsingle-filetd3bc
NI
nissymori/nissymori.github.io

No description provided.

HTML20Updated 1 week ago
NI
nissymori/mahjax

A GPU-Accelerated Mahjong Simulator for RL in JAX

Python232Updated 1 month ago
gamejaxmahjongmahjong-aireinforcement-learning
NI
nissymori/PUORL

No description provided.

Python50Updated 9 months ago
NI
nissymori/SymPO

No description provided.

Python70Updated 7 months ago
NI
nissymori/direct-preference-optimizationFork

Reference implementation for DPO (Direct Preference Optimization)

00Updated 1 year ago
NI
nissymori/rejaxFork

No description provided.

Python00Updated 1 year ago
NI
nissymori/SRPOFork

[NeurIPS 2023] The official code for paper "State Regularized Policy Optimization on Data with Dynamics Shift"

00Updated 2 years ago
NI
nissymori/td-gammonFork

TD-Gammon implementation

Python00Updated 2 years ago
NI
nissymori/D4RLFork

A collection of reference environments for offline reinforcement learning

Python00Updated 2 years ago
NI
nissymori/a2c-minatarFork

No description provided.

00Updated 2 years ago
NI
nissymori/CDAForkArchived

code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith

Python00Updated 3 years ago
NI
nissymori/reinforceFork

A simple REINFORCE algorithm implementation in PyTorch

Python00Updated 3 years ago
NI
nissymori/mjaiFork

Game server for Japanese Mahjong AI.

Ruby00Updated 4 years ago

Gists

Recent Activity