
Hacker News: Front Page
shared a link post in group #Stream of Goodies

gdmarmerola.github.io
Introduction to Thompson Sampling: the Bernoulli bandit
Thompson Sampling is a very simple yet effective method to addressing the exploration-exploitation dilemma in reinforcement/online learning. In this series of posts, I’ll introduce some applications o