24 relations: Angular velocity, Artificial neural network, Atari 2600, Backpropagation, Convolution, Convolutional neural network, Deep learning, DeepMind, Deterministic system, Expected value, Function approximation, Game theory, Intelligent agent, Machine learning, Markov decision process, Peter Norvig, Prentice Hall, Probably approximately correct learning, Pseudocode, Reinforcement learning, State–action–reward–state–action, Stochastic process, Stuart J. Russell, Temporal difference learning.

## Angular velocity

In physics, the angular velocity of a particle is the rate at which it rotates around a chosen center point: that is, the time rate of change of its angular displacement relative to the origin.

New!!: Q-learning and Angular velocity · See more »

## Artificial neural network

Artificial neural networks (ANNs) or connectionist systems are computing systems vaguely inspired by the biological neural networks that constitute animal brains.

New!!: Q-learning and Artificial neural network · See more »

## Atari 2600

The Atari 2600 (or Atari Video Computer System before November 1982) is a home video game console from Atari, Inc. Released on September 11, 1977, it is credited with popularizing the use of microprocessor-based hardware and games contained on ROM cartridges, a format first used with the Fairchild Channel F in 1976.

New!!: Q-learning and Atari 2600 · See more »

## Backpropagation

Backpropagation is a method used in artificial neural networks to calculate a gradient that is needed in the calculation of the weights to be used in the network.

New!!: Q-learning and Backpropagation · See more »

## Convolution

In mathematics (and, in particular, functional analysis) convolution is a mathematical operation on two functions (f and g) to produce a third function, that is typically viewed as a modified version of one of the original functions, giving the integral of the pointwise multiplication of the two functions as a function of the amount that one of the original functions is translated.

New!!: Q-learning and Convolution · See more »

## Convolutional neural network

In machine learning, a convolutional neural network (CNN, or ConvNet) is a class of deep, feed-forward artificial neural networks, most commonly applied to analyzing visual imagery.

New!!: Q-learning and Convolutional neural network · See more »

## Deep learning

Deep learning (also known as deep structured learning or hierarchical learning) is part of a broader family of machine learning methods based on learning data representations, as opposed to task-specific algorithms.

New!!: Q-learning and Deep learning · See more »

## DeepMind

DeepMind Technologies Limited is a British artificial intelligence company founded in September 2010.

New!!: Q-learning and DeepMind · See more »

## Deterministic system

In mathematics, computer science and physics, a deterministic system is a system in which no randomness is involved in the development of future states of the system.

New!!: Q-learning and Deterministic system · See more »

## Expected value

In probability theory, the expected value of a random variable, intuitively, is the long-run average value of repetitions of the experiment it represents.

New!!: Q-learning and Expected value · See more »

## Function approximation

In general, a function approximation problem asks us to select a function among a well-defined class that closely matches ("approximates") a target function in a task-specific way.

New!!: Q-learning and Function approximation · See more »

## Game theory

Game theory is "the study of mathematical models of conflict and cooperation between intelligent rational decision-makers".

New!!: Q-learning and Game theory · See more »

## Intelligent agent

In artificial intelligence, an intelligent agent (IA) is an autonomous entity which observes through sensors and acts upon an environment using actuators (i.e. it is an agent) and directs its activity towards achieving goals (i.e. it is "rational", as defined in economics).

New!!: Q-learning and Intelligent agent · See more »

## Machine learning

Machine learning is a subset of artificial intelligence in the field of computer science that often uses statistical techniques to give computers the ability to "learn" (i.e., progressively improve performance on a specific task) with data, without being explicitly programmed.

New!!: Q-learning and Machine learning · See more »

## Markov decision process

Markov decision processes (MDPs) provide a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker.

New!!: Q-learning and Markov decision process · See more »

## Peter Norvig

Peter Norvig (born December 14, 1956) is an American computer scientist.

New!!: Q-learning and Peter Norvig · See more »

## Prentice Hall

Prentice Hall is a major educational publisher owned by Pearson plc.

New!!: Q-learning and Prentice Hall · See more »

## Probably approximately correct learning

In computational learning theory, probably approximately correct learning (PAC learning) is a framework for mathematical analysis of machine learning.

New!!: Q-learning and Probably approximately correct learning · See more »

## Pseudocode

Pseudocode is an informal high-level description of the operating principle of a computer program or other algorithm.

New!!: Q-learning and Pseudocode · See more »

## Reinforcement learning

Reinforcement learning (RL) is an area of machine learning inspired by behaviourist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward.

New!!: Q-learning and Reinforcement learning · See more »

## State–action–reward–state–action

State–action–reward–state–action (Sarsa) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning.

New!!: Q-learning and State–action–reward–state–action · See more »

## Stochastic process

--> In probability theory and related fields, a stochastic or random process is a mathematical object usually defined as a collection of random variables.

New!!: Q-learning and Stochastic process · See more »

## Stuart J. Russell

Stuart Jonathan Russell (born 1962) is a computer scientist known for his contributions to artificial intelligence.

New!!: Q-learning and Stuart J. Russell · See more »

## Temporal difference learning

Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function.

New!!: Q-learning and Temporal difference learning · See more »

## Redirects here:

Deep Q-learning, Q learning, Q-Learning.

## References

[1] https://en.wikipedia.org/wiki/Q-learning