paint-brush
Contextual Multi-Armed Bandit Problems in Reinforcement Learningby@teenl0ve
5,370 reads
5,370 reads

Contextual Multi-Armed Bandit Problems in Reinforcement Learning

by Valentine ShkulovJuly 29th, 2023
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

This article dives into context-based multi-armed bandit problems in reinforcement learning, where the reward depends on the context. We discussed and implemented three different algorithms: LinUCB, Decision Trees, and Neural Networks to solve these problems, highlighting their unique strengths and considerations. Although we didn't explicitly compare their performance, we emphasized the importance of choosing the right approach based on the characteristics of the problem at hand.
featured image - Contextual Multi-Armed Bandit Problems in Reinforcement Learning
Valentine Shkulov HackerNoon profile picture
Valentine Shkulov

Valentine Shkulov

@teenl0ve

L O A D I N G
. . . comments & more!

About Author

Valentine Shkulov HackerNoon profile picture
Valentine Shkulov@teenl0ve

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite