Logo
Unionpedia
Communication
Get it on Google Play
New! Download Unionpedia on your Androidâ„¢ device!
Free
Faster access than browser!
 

State–action–reward–state–action

Index State–action–reward–state–action

State–action–reward–state–action (Sarsa) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning. [1]

5 relations: Algorithm, Machine learning, Markov decision process, Q-learning, Reinforcement learning.

Algorithm

In mathematics and computer science, an algorithm is an unambiguous specification of how to solve a class of problems.

New!!: State–action–reward–state–action and Algorithm · See more »

Machine learning

Machine learning is a subset of artificial intelligence in the field of computer science that often uses statistical techniques to give computers the ability to "learn" (i.e., progressively improve performance on a specific task) with data, without being explicitly programmed.

New!!: State–action–reward–state–action and Machine learning · See more »

Markov decision process

Markov decision processes (MDPs) provide a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker.

New!!: State–action–reward–state–action and Markov decision process · See more »

Q-learning

Q-learning is a reinforcement learning technique used in machine learning.

New!!: State–action–reward–state–action and Q-learning · See more »

Reinforcement learning

Reinforcement learning (RL) is an area of machine learning inspired by behaviourist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward.

New!!: State–action–reward–state–action and Reinforcement learning · See more »

Redirects here:

State-Action-Reward-State-Action.

References

[1] https://en.wikipedia.org/wiki/State–action–reward–state–action

OutgoingIncoming
Hey! We are on Facebook now! »