⚙️ The First Reinforcement Learners

Before Q-learning, there was the Soviet “self-learning automaton.

Dec 16, 2025

∙ Paid

When people talk about reinforcement learning, they usually start the story with Richard Sutton and Andrew Barto in the 1980s — or maybe with Q-learning in the 1990s.

But if you roll the timeline back 30 years and shift the map east, you’ll find something remarkable happening in Moscow, at the Institute of Automation and Remote Control (IARC).

In 1959, a …

Continue reading this post for free, courtesy of Valeriy Manokhin.

Or purchase a paid subscription.

Valeriy’s Substack

⚙️ The First Reinforcement Learners

Before Q-learning, there was the Soviet “self-learning automaton.

Continue reading this post for free, courtesy of Valeriy Manokhin.