Measures how two variables vary with each other.

Given two variables and :

For two samples of data:

What it tells us

  • Positive tells us that when one variable goes up, the other variable tends to go up
  • Negative tells us that as one variable goes up, the other tends to go down
  • Zero tells us that there is no linear relationship between the two variables

It DOES NOT tell us if two variables are connected in some way, its just used to make a simple observation.

Limitations

Covariance can be heavily skewed if the two variables have different scales. To counteract this, you can either normalize the data (partially the reasoning behind Normalization techniques), or compute Correlation. #probability