November 15, 2019

162 words 1 min read

seungeunrho/minimalRL

seungeunrho/minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)


repo name	seungeunrho/minimalRL
repo link	https://github.com/seungeunrho/minimalRL
homepage
language	Python
size (curr.)	54 kB
stars (curr.)	1329
created	2019-04-23
license	MIT License

minimalRL-pytorch

Implementations of basic RL algorithms with minimal lines of codes! (PyTorch based)

Each algorithm is complete within a single file.
Length of each file is up to 100~150 lines of codes.
Every algorithm can be trained within 30 seconds, even without GPU.
Envs are fixed to “CartPole-v1”. You can just focus on the implementations.

Algorithms

REINFORCE (67 lines)
Vanilla Actor-Critic (98 lines)
DQN (112 lines, including replay memory and target network)
PPO (119 lines, including GAE)
DDPG (147 lines, including OU noise and soft target update)
A3C (129 lines)
ACER (149 lines)
A2C added! (188 lines)
Any suggestion ..?

Dependencies

PyTorch
OpenAI GYM

Usage

# Works only with Python 3.
# e.g.
python3 REINFORCE.py
python3 actor_critic.py
python3 dqn.py
python3 ppo.py
python3 ddpg.py
python3 a3c.py
python3 a2c.py
python3 acer.py

python pytorch algorithm code

comments powered by Disqus

amzn/metalearn-leap

amzn/metalearn-leap

November 10, 2019

Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot experiment presented in the paper.

ycszen/TorchSeg

ycszen/TorchSeg

September 16, 2019

Fast, modular reference implementation and easy training of Semantic Segmentation algorithms in PyTorch.

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

September 9, 2019

PyTorch implementations of deep reinforcement learning algorithms and environments

juefeix/pnn.pytorch.update

juefeix/pnn.pytorch.update

July 27, 2019

This repo houses the new PNN code, along with our responses to the issue raised in the recent Reddit discussion. The code is based on Michael Klachkos repo with slight modification in model.py and main.py. All changes are marked.

facebookresearch/maskrcnn-benchmark

facebookresearch/maskrcnn-benchmark

July 2, 2019

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

codertimo/BERT-pytorch

codertimo/BERT-pytorch

June 30, 2019

Google AI 2018 BERT pytorch implementation

karpathy/pytorch-made

karpathy/pytorch-made

April 15, 2019

MADE (Masked Autoencoder Density Estimation) implementation in PyTorch

salesforce/matchbox

salesforce/matchbox

April 8, 2019

Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.

asvcode/Vision_UI

asvcode/Vision_UI

November 15, 2019

UI visual interface for fastai - now compatible with Google Colab