강화학습

1.Reinforcement Learning - Multi-armed Bandits

post-thumbnail