Get it on Google Play
New! Download Unionpedia on your Android™ device!
Faster access than browser!

Pearson correlation coefficient

Index Pearson correlation coefficient

In statistics, the Pearson correlation coefficient (PCC, pronounced), also referred to as Pearson's r, the Pearson product-moment correlation coefficient (PPMCC) or the bivariate correlation, is a measure of the linear correlation between two variables X and Y. It has a value between +1 and −1, where 1 is total positive linear correlation, 0 is no linear correlation, and −1 is total negative linear correlation. [1]

81 relations: Absolute value, Angle, Asymptotic distribution, Beta function, Bias of an estimator, Biometrika, Bootstrapping (statistics), Cauchy distribution, Cluster analysis, Coefficient of determination, Confidence interval, Consistent estimator, Correction for attenuation, Correlation and dependence, Cosine similarity, Covariance, Cumulative distribution function, Directional statistics, Distance correlation, Dot product, Efficiency (statistics), Euclidean vector, Exchangeable random variables, Expected value, Explained sum of squares, Fisher transformation, Francis Galton, Gamma function, Heavy-tailed distribution, Hypergeometric function, Independence (probability theory), Independent and identically distributed random variables, Invariant estimator, Inverse hyperbolic functions, Invertible matrix, Karl Pearson, Law of large numbers, Line (geometry), Marginal distribution, Maximal information coefficient, Maximum likelihood estimation, Mean, Mean of circular quantities, Moment (mathematics), Multiple correlation, Multivariate normal distribution, Negative relationship, Nonparametric statistics, Normal distribution, Normally distributed and uncorrelated does not imply independent, ..., Null hypothesis, Numerical stability, One- and two-tailed tests, Outlier, P-value, Partial correlation, Percentile, Principal component analysis, Probability distribution, Quadrant count ratio, Resampling (statistics), Robust statistics, RV coefficient, Sample (statistics), Sampling distribution, Scatter plot, Simple linear regression, Sine, Spearman's rank correlation coefficient, Square root of a matrix, Standard deviation, Standard error, Standard score, Statistical hypothesis testing, Statistical population, Statistics, Student's t-distribution, TeX, Total sum of squares, Trigonometric functions, Variance. Expand index (31 more) »

Absolute value

In mathematics, the absolute value or modulus of a real number is the non-negative value of without regard to its sign.

New!!: Pearson correlation coefficient and Absolute value · See more »


In plane geometry, an angle is the figure formed by two rays, called the sides of the angle, sharing a common endpoint, called the vertex of the angle.

New!!: Pearson correlation coefficient and Angle · See more »

Asymptotic distribution

In mathematics and statistics, an asymptotic distribution is a probability distribution that is in a sense the "limiting" distribution of a sequence of distributions.

New!!: Pearson correlation coefficient and Asymptotic distribution · See more »

Beta function

In mathematics, the beta function, also called the Euler integral of the first kind, is a special function defined by for.

New!!: Pearson correlation coefficient and Beta function · See more »

Bias of an estimator

In statistics, the bias (or bias function) of an estimator is the difference between this estimator's expected value and the true value of the parameter being estimated.

New!!: Pearson correlation coefficient and Bias of an estimator · See more »


Biometrika is a peer-reviewed scientific journal published by Oxford University Press for the Biometrika Trust.

New!!: Pearson correlation coefficient and Biometrika · See more »

Bootstrapping (statistics)

In statistics, bootstrapping is any test or metric that relies on random sampling with replacement.

New!!: Pearson correlation coefficient and Bootstrapping (statistics) · See more »

Cauchy distribution

The Cauchy distribution, named after Augustin Cauchy, is a continuous probability distribution.

New!!: Pearson correlation coefficient and Cauchy distribution · See more »

Cluster analysis

Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters).

New!!: Pearson correlation coefficient and Cluster analysis · See more »

Coefficient of determination

In statistics, the coefficient of determination, denoted R2 or r2 and pronounced "R squared", is the proportion of the variance in the dependent variable that is predictable from the independent variable(s).

New!!: Pearson correlation coefficient and Coefficient of determination · See more »

Confidence interval

In statistics, a confidence interval (CI) is a type of interval estimate, computed from the statistics of the observed data, that might contain the true value of an unknown population parameter.

New!!: Pearson correlation coefficient and Confidence interval · See more »

Consistent estimator

In statistics, a consistent estimator or asymptotically consistent estimator is an estimator—a rule for computing estimates of a parameter θ0—having the property that as the number of data points used increases indefinitely, the resulting sequence of estimates converges in probability to θ0.

New!!: Pearson correlation coefficient and Consistent estimator · See more »

Correction for attenuation

Correction for attenuation is a statistical procedure, due to Spearman (1904), to "rid a correlation coefficient from the weakening effect of measurement error" (Jensen, 1998), a phenomenon known as regression dilution.

New!!: Pearson correlation coefficient and Correction for attenuation · See more »

Correlation and dependence

In statistics, dependence or association is any statistical relationship, whether causal or not, between two random variables or bivariate data.

New!!: Pearson correlation coefficient and Correlation and dependence · See more »

Cosine similarity

Cosine similarity is a measure of similarity between two non-zero vectors of an inner product space that measures the cosine of the angle between them.

New!!: Pearson correlation coefficient and Cosine similarity · See more »


In probability theory and statistics, covariance is a measure of the joint variability of two random variables.

New!!: Pearson correlation coefficient and Covariance · See more »

Cumulative distribution function

In probability theory and statistics, the cumulative distribution function (CDF, also cumulative density function) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. In the case of a continuous distribution, it gives the area under the probability density function from minus infinity to x. Cumulative distribution functions are also used to specify the distribution of multivariate random variables.

New!!: Pearson correlation coefficient and Cumulative distribution function · See more »

Directional statistics

Directional statistics (also circular statistics or spherical statistics) is the subdiscipline of statistics that deals with directions (unit vectors in Rn), axes (lines through the origin in Rn) or rotations in Rn.

New!!: Pearson correlation coefficient and Directional statistics · See more »

Distance correlation

In statistics and in probability theory, distance correlation or distance covariance is a measure of dependence between two paired random vectors of arbitrary, not necessarily equal, dimension.

New!!: Pearson correlation coefficient and Distance correlation · See more »

Dot product

In mathematics, the dot product or scalar productThe term scalar product is often also used more generally to mean a symmetric bilinear form, for example for a pseudo-Euclidean space.

New!!: Pearson correlation coefficient and Dot product · See more »

Efficiency (statistics)

In the comparison of various statistical procedures, efficiency is a measure of quality of an estimator, of an experimental design, or of a hypothesis testing procedure.

New!!: Pearson correlation coefficient and Efficiency (statistics) · See more »

Euclidean vector

In mathematics, physics, and engineering, a Euclidean vector (sometimes called a geometric or spatial vector, or—as here—simply a vector) is a geometric object that has magnitude (or length) and direction.

New!!: Pearson correlation coefficient and Euclidean vector · See more »

Exchangeable random variables

In statistics, an exchangeable sequence of random variables (also sometimes interchangeable) is a sequence such that future observations behave like earlier observations.

New!!: Pearson correlation coefficient and Exchangeable random variables · See more »

Expected value

In probability theory, the expected value of a random variable, intuitively, is the long-run average value of repetitions of the experiment it represents.

New!!: Pearson correlation coefficient and Expected value · See more »

Explained sum of squares

In statistics, the explained sum of squares (ESS), alternatively known as the model sum of squares or sum of squares due to regression ("SSR" – not to be confused with the residual sum of squares RSS), is a quantity used in describing how well a model, often a regression model, represents the data being modelled.

New!!: Pearson correlation coefficient and Explained sum of squares · See more »

Fisher transformation

In statistics, hypotheses about the value of the population correlation coefficient ρ between variables X and Y can be tested using the Fisher transformation (aka Fisher z-transformation) applied to the sample correlation coefficient.

New!!: Pearson correlation coefficient and Fisher transformation · See more »

Francis Galton

Sir Francis Galton, FRS (16 February 1822 – 17 January 1911) was an English Victorian era statistician, progressive, polymath, sociologist, psychologist, anthropologist, eugenicist, tropical explorer, geographer, inventor, meteorologist, proto-geneticist, and psychometrician.

New!!: Pearson correlation coefficient and Francis Galton · See more »

Gamma function

In mathematics, the gamma function (represented by, the capital Greek alphabet letter gamma) is an extension of the factorial function, with its argument shifted down by 1, to real and complex numbers.

New!!: Pearson correlation coefficient and Gamma function · See more »

Heavy-tailed distribution

In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution.

New!!: Pearson correlation coefficient and Heavy-tailed distribution · See more »

Hypergeometric function

In mathematics, the Gaussian or ordinary hypergeometric function 2F1(a,b;c;z) is a special function represented by the hypergeometric series, that includes many other special functions as specific or limiting cases.

New!!: Pearson correlation coefficient and Hypergeometric function · See more »

Independence (probability theory)

In probability theory, two events are independent, statistically independent, or stochastically independent if the occurrence of one does not affect the probability of occurrence of the other.

New!!: Pearson correlation coefficient and Independence (probability theory) · See more »

Independent and identically distributed random variables

In probability theory and statistics, a sequence or other collection of random variables is independent and identically distributed (i.i.d. or iid or IID) if each random variable has the same probability distribution as the others and all are mutually independent.

New!!: Pearson correlation coefficient and Independent and identically distributed random variables · See more »

Invariant estimator

In statistics, the concept of being an invariant estimator is a criterion that can be used to compare the properties of different estimators for the same quantity.

New!!: Pearson correlation coefficient and Invariant estimator · See more »

Inverse hyperbolic functions

In mathematics, the inverse hyperbolic functions are the inverse functions of the hyperbolic functions.

New!!: Pearson correlation coefficient and Inverse hyperbolic functions · See more »

Invertible matrix

In linear algebra, an n-by-n square matrix A is called invertible (also nonsingular or nondegenerate) if there exists an n-by-n square matrix B such that where In denotes the n-by-n identity matrix and the multiplication used is ordinary matrix multiplication.

New!!: Pearson correlation coefficient and Invertible matrix · See more »

Karl Pearson

Karl Pearson HFRSE LLD (originally named Carl; 27 March 1857 – 27 April 1936) was an English mathematician and biostatistician. He has been credited with establishing the discipline of mathematical statistics. He founded the world's first university statistics department at University College London in 1911, and contributed significantly to the field of biometrics, meteorology, theories of social Darwinism and eugenics. Pearson was also a protégé and biographer of Sir Francis Galton.

New!!: Pearson correlation coefficient and Karl Pearson · See more »

Law of large numbers

In probability theory, the law of large numbers (LLN) is a theorem that describes the result of performing the same experiment a large number of times.

New!!: Pearson correlation coefficient and Law of large numbers · See more »

Line (geometry)

The notion of line or straight line was introduced by ancient mathematicians to represent straight objects (i.e., having no curvature) with negligible width and depth.

New!!: Pearson correlation coefficient and Line (geometry) · See more »

Marginal distribution

In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset.

New!!: Pearson correlation coefficient and Marginal distribution · See more »

Maximal information coefficient

In statistics, the maximal information coefficient (MIC) is a measure of the strength of the linear or non-linear association between two variables X and Y.

New!!: Pearson correlation coefficient and Maximal information coefficient · See more »

Maximum likelihood estimation

In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of a statistical model, given observations.

New!!: Pearson correlation coefficient and Maximum likelihood estimation · See more »


In mathematics, mean has several different definitions depending on the context.

New!!: Pearson correlation coefficient and Mean · See more »

Mean of circular quantities

In mathematics, a mean of circular quantities is a mean which is sometimes better-suited for quantities like angles, daytimes, and fractional parts of real numbers.

New!!: Pearson correlation coefficient and Mean of circular quantities · See more »

Moment (mathematics)

In mathematics, a moment is a specific quantitative measure, used in both mechanics and statistics, of the shape of a set of points.

New!!: Pearson correlation coefficient and Moment (mathematics) · See more »

Multiple correlation

In statistics, the coefficient of multiple correlation is a measure of how well a given variable can be predicted using a linear function of a set of other variables.

New!!: Pearson correlation coefficient and Multiple correlation · See more »

Multivariate normal distribution

In probability theory and statistics, the multivariate normal distribution or multivariate Gaussian distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions.

New!!: Pearson correlation coefficient and Multivariate normal distribution · See more »

Negative relationship

In statistics, there is a negative relationship or inverse relationship between two variables if higher values of one variable tend to be associated with lower values of the other.

New!!: Pearson correlation coefficient and Negative relationship · See more »

Nonparametric statistics

Nonparametric statistics is the branch of statistics that is not based solely on parameterized families of probability distributions (common examples of parameters are the mean and variance).

New!!: Pearson correlation coefficient and Nonparametric statistics · See more »

Normal distribution

In probability theory, the normal (or Gaussian or Gauss or Laplace–Gauss) distribution is a very common continuous probability distribution.

New!!: Pearson correlation coefficient and Normal distribution · See more »

Normally distributed and uncorrelated does not imply independent

In probability theory, two random variables being linearly uncorrelated does not imply their independence (however, for some measures of non-linear correlation such as the distance correlation, uncorrelated implies independent).

New!!: Pearson correlation coefficient and Normally distributed and uncorrelated does not imply independent · See more »

Null hypothesis

In inferential statistics, the term "null hypothesis" is a general statement or default position that there is no relationship between two measured phenomena, or no association among groups.

New!!: Pearson correlation coefficient and Null hypothesis · See more »

Numerical stability

In the mathematical subfield of numerical analysis, numerical stability is a generally desirable property of numerical algorithms.

New!!: Pearson correlation coefficient and Numerical stability · See more »

One- and two-tailed tests

In statistical significance testing, a one-tailed test and a two-tailed test are alternative ways of computing the statistical significance of a parameter inferred from a data set, in terms of a test statistic.

New!!: Pearson correlation coefficient and One- and two-tailed tests · See more »


In statistics, an outlier is an observation point that is distant from other observations.

New!!: Pearson correlation coefficient and Outlier · See more »


In statistical hypothesis testing, the p-value or probability value or asymptotic significance is the probability for a given statistical model that, when the null hypothesis is true, the statistical summary (such as the sample mean difference between two compared groups) would be the same as or of greater magnitude than the actual observed results.

New!!: Pearson correlation coefficient and P-value · See more »

Partial correlation

In probability theory and statistics, partial correlation measures the degree of association between two random variables, with the effect of a set of controlling random variables removed.

New!!: Pearson correlation coefficient and Partial correlation · See more »


A percentile (or a centile) is a measure used in statistics indicating the value below which a given percentage of observations in a group of observations fall.

New!!: Pearson correlation coefficient and Percentile · See more »

Principal component analysis

Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components.

New!!: Pearson correlation coefficient and Principal component analysis · See more »

Probability distribution

In probability theory and statistics, a probability distribution is a mathematical function that provides the probabilities of occurrence of different possible outcomes in an experiment.

New!!: Pearson correlation coefficient and Probability distribution · See more »

Quadrant count ratio

The quadrant count ratio (QCR) is a measure of the association between two quantitative variables.

New!!: Pearson correlation coefficient and Quadrant count ratio · See more »

Resampling (statistics)

In statistics, resampling is any of a variety of methods for doing one of the following.

New!!: Pearson correlation coefficient and Resampling (statistics) · See more »

Robust statistics

Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal.

New!!: Pearson correlation coefficient and Robust statistics · See more »

RV coefficient

In statistics, the RV coefficient is a multivariate generalization of the squared Pearson correlation coefficient (because the RV coefficient takes values between 0 and 1).

New!!: Pearson correlation coefficient and RV coefficient · See more »

Sample (statistics)

In statistics and quantitative research methodology, a data sample is a set of data collected and/or selected from a statistical population by a defined procedure.

New!!: Pearson correlation coefficient and Sample (statistics) · See more »

Sampling distribution

In statistics, a sampling distribution or finite-sample distribution is the probability distribution of a given random-sample-based statistic.

New!!: Pearson correlation coefficient and Sampling distribution · See more »

Scatter plot

A scatter plot (also called a scatterplot, scatter graph, scatter chart, scattergram, or scatter diagram) is a type of plot or mathematical diagram using Cartesian coordinates to display values for typically two variables for a set of data.

New!!: Pearson correlation coefficient and Scatter plot · See more »

Simple linear regression

In statistics, simple linear regression is a linear regression model with a single explanatory variable.

New!!: Pearson correlation coefficient and Simple linear regression · See more »


In mathematics, the sine is a trigonometric function of an angle.

New!!: Pearson correlation coefficient and Sine · See more »

Spearman's rank correlation coefficient

In statistics, Spearman's rank correlation coefficient or Spearman's rho, named after Charles Spearman and often denoted by the Greek letter \rho (rho) or as r_s, is a nonparametric measure of rank correlation (statistical dependence between the rankings of two variables).

New!!: Pearson correlation coefficient and Spearman's rank correlation coefficient · See more »

Square root of a matrix

In mathematics, the square root of a matrix extends the notion of square root from numbers to matrices.

New!!: Pearson correlation coefficient and Square root of a matrix · See more »

Standard deviation

In statistics, the standard deviation (SD, also represented by the Greek letter sigma σ or the Latin letter s) is a measure that is used to quantify the amount of variation or dispersion of a set of data values.

New!!: Pearson correlation coefficient and Standard deviation · See more »

Standard error

The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation.

New!!: Pearson correlation coefficient and Standard error · See more »

Standard score

In statistics, the standard score is the signed number of standard deviations by which the value of an observation or data point differs from the mean value of what is being observed or measured.

New!!: Pearson correlation coefficient and Standard score · See more »

Statistical hypothesis testing

A statistical hypothesis, sometimes called confirmatory data analysis, is a hypothesis that is testable on the basis of observing a process that is modeled via a set of random variables.

New!!: Pearson correlation coefficient and Statistical hypothesis testing · See more »

Statistical population

In statistics, a population is a set of similar items or events which is of interest for some question or experiment.

New!!: Pearson correlation coefficient and Statistical population · See more »


Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data.

New!!: Pearson correlation coefficient and Statistics · See more »

Student's t-distribution

In probability and statistics, Student's t-distribution (or simply the t-distribution) is any member of a family of continuous probability distributions that arises when estimating the mean of a normally distributed population in situations where the sample size is small and population standard deviation is unknown.

New!!: Pearson correlation coefficient and Student's t-distribution · See more »


TeX (see below), stylized within the system as TeX, is a typesetting system (or "formatting system") designed and mostly written by Donald Knuth and released in 1978.

New!!: Pearson correlation coefficient and TeX · See more »

Total sum of squares

In statistical data analysis the total sum of squares (TSS or SST) is a quantity that appears as part of a standard way of presenting results of such analyses.

New!!: Pearson correlation coefficient and Total sum of squares · See more »

Trigonometric functions

In mathematics, the trigonometric functions (also called circular functions, angle functions or goniometric functions) are functions of an angle.

New!!: Pearson correlation coefficient and Trigonometric functions · See more »


In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its mean.

New!!: Pearson correlation coefficient and Variance · See more »

Redirects here:

Bivariate correlation, PPMCC, Pearson Product Moment Correlation Coefficient, Pearson coefficient, Pearson correlation, Pearson product moment correlation coefficient, Pearson product-moment, Pearson product-moment correlation, Pearson product-moment correlation coefficient, Pearson product–moment correlation coefficient, Pearson r, Pearson's coefficient of correlation, Pearson's correlation, Pearson's correlation coefficient, Pearson's product, Pearson's r, Pearsonr, Pearson’s correlation coefficient, Pmcc, Product moment correlation coefficient, Product-moment correlation, Product-moment correlation coefficient.


[1] https://en.wikipedia.org/wiki/Pearson_correlation_coefficient

Hey! We are on Facebook now! »