16 relations: Artificial neural network, Atari 2600, Backgammon, Deep learning, Expected value, Function approximation, Game theory, Google DeepMind, Markov decision process, Peter Norvig, Prentice Hall, Probably approximately correct learning, Reinforcement learning, State-Action-Reward-State-Action, Stuart J. Russell, Temporal difference learning.

## Artificial neural network

In machine learning and cognitive science, artificial neural networks (ANNs) are a family of statistical learning models inspired by biological neural networks (the central nervous systems of animals, in particular the brain) and are used to estimate or approximate functions that can depend on a large number of inputs and are generally unknown.

New!!: Q-learning and Artificial neural network ·

## Atari 2600

The Atari 2600, or Atari VCS before 1982, is a home video game console released on September 11, 1977 by Atari, Inc.

New!!: Q-learning and Atari 2600 ·

## Backgammon

Backgammon is one of the oldest board games for two players.

New!!: Q-learning and Backgammon ·

## Deep learning

Deep learning (deep machine learning, or deep structured learning, or hierarchical learning, or sometimes DL) is a branch of machine learning based on a set of algorithms that attempt to model high-level abstractions in data by using model architectures, with complex structures or otherwise, composed of multiple non-linear transformations.

New!!: Q-learning and Deep learning ·

## Expected value

In probability theory, the expected value of a random variable is intuitively the long-run average value of repetitions of the experiment it represents.

New!!: Q-learning and Expected value ·

## Function approximation

The need for function approximations arises in many branches of applied mathematics, and computer science in particular.

New!!: Q-learning and Function approximation ·

## Game theory

Game theory is the study of strategic decision-making.

New!!: Q-learning and Game theory ·

## Google DeepMind

Google DeepMind is a British artificial intelligence company.

New!!: Q-learning and Google DeepMind ·

## Markov decision process

Markov decision processes (MDPs), named after Andrey Markov, provide a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker.

New!!: Q-learning and Markov decision process ·

## Peter Norvig

Peter Norvig (born December 14, 1956) is an American computer scientist.

New!!: Q-learning and Peter Norvig ·

## Prentice Hall

Prentice Hall is a major educational publisher owned by Pearson PLC.

New!!: Q-learning and Prentice Hall ·

## Probably approximately correct learning

In computational learning theory, probably approximately correct learning (PAC learning) is a framework for mathematical analysis of machine learning.

New!!: Q-learning and Probably approximately correct learning ·

## Reinforcement learning

Reinforcement learning is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward.

New!!: Q-learning and Reinforcement learning ·

## State-Action-Reward-State-Action

State-Action-Reward-State-Action (SARSA) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning.

New!!: Q-learning and State-Action-Reward-State-Action ·

## Stuart J. Russell

Stuart Jonathan Russell (born 1962) is a computer scientist known for his contributions to artificial intelligence.

New!!: Q-learning and Stuart J. Russell ·

## Temporal difference learning

Temporal difference (TD) learning is a prediction-based machine learning method.

New!!: Q-learning and Temporal difference learning ·