paint-brush
Reinforcement Learning - The Value Functionby@jingles
426 reads
426 reads

Reinforcement Learning - The Value Function

by Hong Jing (Jingles)6mAugust 16th, 2019
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

The value function is an efficient way to determine the value of being in a state. In a game of tic-tac-toe, getting 2 Xs in a row does not win the game, hence there is no reward. The value of state A is the sum of all next states’ probability multiplied by the reward for reaching that state A. In this case, a state A has a chance of winning the game by placing it at the top of a row. A state D is a state D with only 1 possible route to state E, since the only outcome is to receive the reward.

Company Mentioned

Mention Thumbnail
featured image - Reinforcement Learning - The Value Function
Hong Jing (Jingles) HackerNoon profile picture
Hong Jing (Jingles)

Hong Jing (Jingles)

@jingles

A data scientist who also enjoy developing products on the Web.

About @jingles
LEARN MORE ABOUT @JINGLES'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Hong Jing (Jingles) HackerNoon profile picture
Hong Jing (Jingles)@jingles
A data scientist who also enjoy developing products on the Web.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Also published here
Aitopics
Coffee-web
Learnrepo
Its401