Tag - rl

by _akhaliq on 2022-10-18 (UTC).

You Only Live Once: Single-Life Reinforcement Learning
abs: https://t.co/PG3tqv89DA pic.twitter.com/WrwzfA2Bg8
— AK (@_akhaliq) October 18, 2022

research rl

by ak92501 on 2022-03-08 (UTC).

The (Un)Surprising Effectiveness of Pre-Trained Vision Models for Control
abs: https://t.co/kFVZx80f2u pic.twitter.com/Tm723A7aqC
— AK (@ak92501) March 8, 2022

research rl cv

by ak92501 on 2021-09-01 (UTC).

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
pdf: https://t.co/6z9WZacJyk
abs: https://t.co/wUX0MThFGc pic.twitter.com/vR6V68Zdgv
— AK (@ak92501) September 1, 2021

rl tool research

by NandoDF on 2021-07-27 (UTC).

A nice multi-agent reinforcement learning library by ⁦@instadeepai⁩ including our old DIAL method ⁦@iassael⁩ ⁦@j_foerst⁩ ⁦@shimon8282⁩ - built using Acme ⁦ https://t.co/F2XmdcA7WM
— Nando de Freitas (@NandoDF) July 27, 2021

tool rl

by PythonWeekly on 2021-07-01 (UTC).

RLCard - A Toolkit for Reinforcement Learning in Card Games. https://t.co/UB6RGcCVdS #Python #CardGames pic.twitter.com/SHj50MJWob
— Python Weekly (@PythonWeekly) July 1, 2021

tool rl

by OriolVinyalsML on 2021-06-11 (UTC).

MuZero removed simulators in MBRL vs AlphaGo. VQ Models for Planning generalize to partial observable & stochastic environments. How?

1. Discretize states w/ VQVAE
2. Train a LM over states
3. Plan w/ MCTS using the LM

Led by @yazhe_li & @sherjilozair https://t.co/thvB6Ke1EA pic.twitter.com/tsXGcrweTZ
— Oriol Vinyals (@OriolVinyalsML) June 11, 2021

research rl

by rsalakhu on 2021-04-10 (UTC).

New work on Efficient Transformers in RL using Actor-Learner Distillation:

Compressing online larger “Learner model” towards a tractable “Actor model” in distributed RL setting with partially-observable environments.https://t.co/jnExWiPabS

with E. Parisotto #ICLR2021 pic.twitter.com/kC5SSRsWrn
— Russ Salakhutdinov (@rsalakhu) April 10, 2021

research rl

by ericjang11 on 2021-04-08 (UTC).

Excellent example of low-resource ML research showing AlphaZero scaling laws on a single RTX 2080. https://t.co/MGgwzKJPJq
— Eric Jang 🇺🇸🇹🇼 (@ericjang11) April 8, 2021

research rl

by ak92501 on 2021-04-06 (UTC).

Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
pdf: https://t.co/ncLWJgbf6d
abs: https://t.co/ozkZ6WgNRl pic.twitter.com/N2bJv4kd6j
— AK (@ak92501) April 6, 2021

research rl

by GoogleAI on 2021-03-19 (UTC).

Today we present an approach for iterative digital game design that leverages #MachineLearning to train agents as play-testers, allowing designers to efficiently balance a game and align it with their original vision. Learn more at https://t.co/eAC9UoPBN5 pic.twitter.com/HjqRMcnnez
— Google AI (@GoogleAI) March 19, 2021

research rl

by slashML on 2021-02-23 (UTC).

20 hours of new lectures on Deep Learning and Reinforcement Learning with lots of examples https://t.co/3cXv2E0rcd
— /MachineLearning (@slashML) February 23, 2021

rl learning video

by DeepMind on 2021-02-19 (UTC).

DreamerV2, a collaboration between DeepMind, @GoogleAI and the @UofT, is the first RL agent based on a world model to achieve human-level performance on the Atari benchmark. Read more ⬇️ https://t.co/lFFuHH2Uk9
— DeepMind (@DeepMind) February 19, 2021

research rl

Tag: rl

Tags