
The correlation coefficient is an index that describes the relationship and can take on values between −1.0 and +1.0, with a positive correlation coefficient indicating a positive correlation and a negative correlation coefficient indicating a negative correlation. While examining scatterplots gives us some idea about the relationship between two variables, we use a statistic called the correlation coefficient to give us a more precise measurement of the relationship between the two variables. However, if the points are far away from one another, and the imaginary oval is very wide, this means that there is a weak correlation between the variables (see below). If the points are close to one another and the width of the imaginary oval is small, this means that there is a strong correlation between the variables (see below).


If we drew an imaginary oval around all of the points on the scatterplot, we would be able to see the extent, or the magnitude, of the relationship. When examining scatterplots, we also want to look not only at the direction of the relationship (positive, negative, or zero), but also at the magnitude of the relationship. When all the points on a scatterplot lie on a straight line, you have what is called a perfect correlation between the two variables (see below).Ī scatterplot in which the points do not have a linear trend (either positive or negative) is called a zero correlation or a near-zero correlation (see below).Įngage NY, Module 6, Lesson 7, p 85 - - CC BY-NC This pattern means that when the score of one observation is high, we expect the score of the other observation to be low, and vice versa.Įngage NY, Module 6, Lesson 7, p 85 - - CC BY-NC When the points on a scatterplot graph produce a upper-left-to-lower-right pattern (see below), we say that there is a negative correlation between the two variables. This pattern means that when the score of one observation is high, we expect the score of the other observation to be high as well, and vice versa. When the points on a scatterplot graph produce a lower-left-to-upper-right pattern (see below), we say that there is a positive correlation between the two variables. In a scatterplot, each point represents a paired measurement of two variables for a specific subject, and each subject is represented by one point on the scatterplot.Ĭorrelation Patterns in Scatterplot GraphsĮxamining a scatterplot graph allows us to obtain some idea about the relationship between two variables.

Scatterplots display these bivariate data sets and provide a visual representation of the relationship between variables. In this case, there is a tendency for students to score similarly on both variables, and the performance between variables appears to be related. If we carefully examine the data in the example above, we notice that those students with high SAT scores tend to have high GPAs, and those with low SAT scores tend to have low GPAs. Can you think of other scenarios when we would use bivariate data? In our example above, we notice that there are two observations (verbal SAT score and GPA) for each subject (in this case, a student). Bivariate data are data sets in which each subject has two observations associated with it. Mathematicians seem to simply call these scenarios "non-linear" or "curvilinear" relationships, without seeming to notice that there are invariably two distinct relationships being identified by the data.\)īivariate Data, Correlation Between Values, and the Use of ScatterplotsĬorrelation measures the relationship between bivariate data. While I have always used the term "split" effect to describe such phenomenon, I have not been able to find this phenomenon acknowledged or identified (by any particular term) amongst economists or mathematicians.
#Scatter plots full
Thus, we often see two or more different effects express themselves through a full range of data. This is because at very high rates of taxation, people either lose interest in working, or they start to seek ways of hiding their income from the government. However, after a certain tax rate is reached, we start to see a new effect take place wherein the tax revenue drops off as the tax rate is increased further. I call this phenomenon a "split" effect.įor example, in the Laffer curve, we at first see the government raise more tax revenue as tax rates increase because they collect more money from citizens. However, sometimes one effect drops off and then a new effect takes over. In economics, we're always interested in identifying "effects" that take place between variables.

In Problem #3, illustrations A and B, you show something we see in economics quite a bit.
