Page No 104:
The unit of correlation coefficient between height in feet and weight in kgs is
As there is non-existent of correlation between the height in feet and weight in kilograms, so the unit of correlation between the two is zero.
The range of simple correlation coefficient is
(i) 0 to infinity
(ii) minus one to plus one
(iii) minus infinity to infinity
The range of simple correlation coefficient is from (â€“) 1 to (+) 1
If rxy is positive the relation between X and Y is of the type
(i) When Y increases X increases
(ii) When Y decreases X increases
(iii) When Y increases X does not change
When the variables Y and X share positive relationship (i.e. when Y and X both increases simultaneously), then the value of rxy is positive.
Page No 105:
If rxy = 0 the variable X and Y are
(i) linearly related
(ii) not linearly related
The value of rxy becomes 0 when the two variables are not linearly related to each other. It may happen that both the variables may be non-linearly related to each other. It does not necessarily imply that both are independent of each other.
Of the following three measures which can measure any type of relationship
(i) Karl Pearsonâ€™s coefficient of correlation
(ii) Spearmanâ€™s rank correlation
(iii) Scatter diagram
Scatter diagram can measure any type of relationship whether the variables are highly related or not at all related. Just by looking at the diagram, the viewer can easily conclude the relationship between the two variables involved. On the other hand, Karl Pearsonâ€™s coefficient of correlation is not suitable for the series where deviations are calculated from assumed mean. Likewise, Spearmanâ€™s rank correlation also disqualifies to measure any kind of relationship as its domain is restricted only to the qualitative variables (leaving quantitative variables).
If precisely measured data are available the simple correlation coefficient is
(i) more accurate than rank correlation coefficient
(ii) less accurate than rank correlation coefficient
(iii) as accurate as the rank correlation coefficient
Generally, all the properties of Karl Pearsonâ€™s coefficient of correlation are similar to that of the rank correlation coefficient. However, rank correlation coefficient is generally lower or equal to Karl Pearsonâ€™s coefficient. The reason for this difference between the two coefficients is because the rank correlation coefficient uses ranks instead of the full set of observations that leads to some loss of information. If the precisely measured data are available, then both the coefficients will be identical.
Why is r preferred to covariance as a measure of association?
Although correlation coefficient is similar to the covariance in a manner that both measure the degree of linear relationship between two variables, but the former is generally preferred to covariance due to the following reasons.
- The value of the correlation coefficient (r) lies between 0 and 1. Symbolically â€“1 â‰¤ r â‰¤ +1
- The correlation coefficient is scale free.
Can r lie outside the â€“1 and 1 range depending on the type of data?
No, the value of r cannot lie outside the range of â€“1 to 1. If r = â€“ 1, then there exists perfect negative correlation and if r = 1, then there exists perfect positive correlation between the two variables. If at any point of time the calculated value of r is outside this range, then there must be some mistake committed in the calculation.
Does correlation imply causation?
No, correlation does not imply causation. The correlation between the two variables does not imply that one variable causes the other. In other words, cause and effect relationship is not a prerequisite for the correlation. Correlation only measures the degree and intensity of the relationship between the two variables, but surely not the cause and effect relationship between them.
When is rank correlation more precise than simple correlation coefficient?
Rank Correlation method is more precise than simple correlation coefficient when the variables cannot be measured quantitatively. In other words, rank correlation method measures the correlation between the two qualitative variables. These variable attributes are given the ranks on the basis of preferences. For example, selecting the best candidate in a dance competition depends on the ranks and preferences awarded to him/her by the judges. Secondly, the rank correlation method is preferred over the simple correlation coefficient when extreme values are present in the data. In such case using simple correlation coefficient may be misleading.
Does zero correlation mean independence?
Correlation measures the linear relationship between the two variables. So, r being 0 implies the absence of linear relationship. But they may be non-linearly related. Hence, if two variables are not correlated, it does not necessarily follow that they are independent.
Can simple correlation coefficient measure any type of relationship?
No, the simple correlation coefficient cannot measure any type of relationship. The simple correlation coefficient can measure only the direction and magnitude of linear relationship between the two variables. It cannot measure non-linear relationship like quadratic, trigonometric, cubic, etc. Therefore, in such cases, the purview of simple correlation coefficient falls short. For example, the simple correlation coefficient may depict that X and Y are not correlated in the equation X= Y2, hence it may be concluded that both the variables are independent, but such conclusion may be wrong.
Collect the price of five vegetables from your local market every day for a week. Calculate their correlation coefficients. Interpret the result.
This question is about multivariate correlation that is out of syllabus
Measure the height of your classmates. Ask them the height of their benchmate. Calculate the correlation coefficient of these two variables. Interpret the result.
|Height of Classmate
|Height of Benchmate
List some variables where accurate measurement is difficult.
The following are the some variables where the accurate measurement is difficult.
- Temperature and number of people falling ill.
- Change in temperature with the height of mountain.
- Low rainfall and agricultural productivity
- High population growth and degree of poverty
- Number of tourists and change in the political atmosphere in India.
Interpret the values of r as 1, â€“1 and 0.
The value of r being 1 implies that there is a perfect positive correlation between the two variables involved. A high value of r (i.e. close to 1) represents a strong positive linear relationship between the two variables.
If r = â€“1, then the correlation is perfectly negative. A negative value of r indicates an inverse relation. A low value of r (i.e. close to â€“1) represents a strong negative linear relationship between the variables. On the other hand, if the value of r = 0, then it implies that the two variables are uncorrelated to each other. But this should not be misunderstood as the variables are independent of each other. The value of r equals zero confirms only the non-existence of any linear relation but the variables may be non-linearly related to each other.
Why does rank correlation coefficient differ from Pearsonian correlation coefficient?
Generally, all the properties of Karl Pearsonâ€™s coefficient of correlation are similar to that of the rank correlation coefficient. However, rank correlation coefficient is generally lower or equal to Karl Pearsonâ€™s coefficient. Rank correlation coefficient is generally preferred to measure the correlation between the two qualitative variables. These variable attributes are given the ranks on the basis of preferences. The difference between the two coefficients is due to the fact that the rank correlation coefficient uses ranks instead of the full set of observations that leads to some loss of information. If the precisely measured data are available, then both the coefficients will be identical. Secondly, if extreme values are present in the data, then the rank correlation coefficient is more precise and reliable and consequently its value differs from that of the Karl Pearsonâ€™s coefficient.
Calculate the correlation coefficient between the heights of fathers in inches (X) and their sons (Y)
Note: As per textbook, correlation coefficient is 0.603. However, as per the above solution, correlation coefficient should be 0.44.
Calculate the correlation coefficient between X and Y and comment on their relationship:
As the value of r is zero, so there is no linear correlation between X and Y
Page No 106:
Calculate the correlation coefficient between X and Y and comment on their relationship
As the correlation coefficient between the two variables is +1, so the two variables are perfectly positive correlated.