How can correlation be used




















Misuse of correlation is so common that some statisticians have wished that the method had never been devised. In statistical terms, correlation is a method of assessing a possible two-way linear association between two continuous variables. If the coefficient is a positive number, the variables are directly related i. If, on the other hand, the coefficient is a negative number, the variables are inversely related i. To emphasise this point, a mathematical relationship does not necessarily mean that there is correlation.

In statistical terms, it is inappropriate to say that there is correlation between x and y. This is so because, although there is a relationship, the relationship is not linear over this range of the specified values of x. Hence, it would be inconsistent with the definition of correlation and it cannot therefore be said that x is correlated with y. There are two main types of correlation coefficients: Pearson's product moment correlation coefficient and Spearman's rank correlation coefficient.

The correct usage of correlation coefficient type depends on the types of variables being studied. We will focus on these two correlation types; other types are based on these and are often used when multiple variables are being considered. It is used when both variables being studied are normally distributed. This coefficient is affected by extreme values, which may exaggerate or dampen the strength of relationship, and is therefore inappropriate when either or both variables are not normally distributed.

For a correlation between variables x and y, the formula for calculating the sample Pearson's correlation coefficient is given by 3.

It is appropriate when one or both variables are skewed or ordinal 1 and is robust when extreme values are present. For a correlation between variables x and y, the formula for calculating the sample Spearman's correlation coefficient is given by.

The distinction between Pearson's and Spearman's correlation coefficients in applications will be discussed using examples below. The data depicted in figures 1 — 4 were simulated from a bivariate normal distribution of observations with means 2 and 3 for the variables x and y respectively. The standard deviations were 0. Scatter plots were generated for the correlations 0. In Fig. The trend in Fig.

That is, the higher the correlation in either direction positive or negative , the more linear the association between two variables and the more obvious the trend in a scatter plot. In Figure 3 , the values of y increase as the values of x increase while in figure 4 the values of y decrease as the values of x increase.

Simple application of the correlation coefficient can be exemplified using data from a sample of women attending their first antenatal clinic ANC visits. We can expect a positive linear relationship between maternal age in years and parity because parity cannot decrease with age, but we cannot predict the strength of this relationship.

The task is one of quantifying the strength of the association. That is, we are interested in the strength of relationship between the two variables rather than direction since direction is obvious in this case. Maternal age is continuous and usually skewed while parity is ordinal and skewed. With these scales of measurement for the data, the appropriate correlation coefficient to use is Spearman's. The Spearman's coefficient is 0. In this case, maternal age is strongly correlated with parity, i.

The Pearson's correlation coefficient for these variables is 0. In this case the two correlation coefficients are similar and lead to the same conclusion, however in some cases the two may be very different leading to different statistical conclusions. Correlation coefficients are usually found for two variables at a time, but you can use a multiple correlation coefficient for three or more variables.

With a regression analysis , you can predict how much a change in one variable will be associated with a change in the other variable. The result is a regression equation that describes the line on a graph of your variables.

You can use this equation to predict the value of one variable based on the given value s of the other variable s. If two variables are correlated, it could be because one of them is a cause and the other is an effect. A confounding variable is a third variable that influences other variables to make them seem causally related even though they are not. Instead, there are separate causal links between the confounder and each variable. Even if you statistically control for some potential confounders, there may still be other hidden variables that disguise the relationship between your study variables.

There are many other variables that may influence both variables, such as average income, working conditions, and job insecurity. A correlational research design investigates relationships between two variables or more without the researcher controlling or manipulating any of them.

Controlled experiments establish causality, whereas correlational studies only show associations between variables. In general, correlational research is high in external validity while experimental research is high in internal validity. A correlation is usually tested for two variables at a time, but you can test correlations between three or more variables.

A correlation coefficient is a single number that describes the strength and direction of the relationship between your variables. Different types of correlation coefficients might be appropriate for your data based on their levels of measurement and distributions. Have a language expert improve your writing. Check your paper for plagiarism in 10 minutes. Do the check. Generate your APA citations for free! APA Citation Generator.

Home Knowledge Base Methodology An introduction to correlational research. An introduction to correlational research Published on July 7, by Pritha Bhandari.

Positive correlation Both variables change in the same direction As height increases, weight also increases Negative correlation The variables change in opposite directions As coffee consumption increases, tiredness decreases Zero correlation There is no relationship between the variables Coffee consumption is not correlated with height Table of contents Correlational vs experimental research When to use correlational research How to collect correlational data How to analyze correlational data Correlation and causation Frequently asked questions about correlational research.

Here's why students love Scribbr's proofreading services Trustpilot. What is a correlation? A positive correlation means that both variables change in the same direction. For one, the data is not always reliable—particularly if the survey questions are poorly written or the overall design or delivery is weak. The use of surveys relies on participants to provide useful data. Researchers need to be aware of the specific factors related to the people taking the survey that will affect its outcome.

For example, some people might struggle to understand the questions. A person might answer a particular way to try to please the researchers or to try to control how the researchers perceive them such as trying to make themselves "look better". Sometimes, respondents might not even realize that their answers are incorrect or misleading because of mistaken memories. Many areas of psychological research benefit from analyzing studies that were conducted long ago by other researchers, as well as reviewing historical records and case studies.

For example, in an experiment known as "The Irritable Heart," researchers used digitalized records containing information on American Civil War veterans to learn more about post-traumatic stress disorder PTSD. Using records, databases, and libraries that are publically accessible or accessible through their institution can help researchers who might not have a lot of money to support their research efforts.

Free and low-cost resources are available to researchers at all levels through academic institutions, museums, and data repositories around the world. Another potential benefit is that these sources often provide an enormous amount of data that was collected over a very long period of time, which can give researchers a way to view trends , relationships, and outcomes related to their research.

While the inability to change variables can be a disadvantage of some methods, it can be a benefit of archival research.

That said, using historical records or information that was collected a long time ago also presents challenges. For one, important information might be missing or incomplete and some aspects of older studies might not be useful to researchers in a modern context. A primary issue with archival research is reliability. Researchers can also be presented with ethical quandaries—for example, should modern researchers use data from studies that were conducted unethically or with questionable ethics?

You've probably heard the phrase, "correlation does not equal causation. For example, researchers might perform a correlational study that suggests there is a relationship between academic success and a person's self-esteem. However, the study cannot show that academic success changes a person's self-esteem.

To determine why the relationship exists, researchers would need to consider and experiment with other variables, such as the subject's social relationships, cognitive abilities, personality, and socioeconomic status.

Ever wonder what your personality type means? Sign up to find out more in our Healthy Mind newsletter. Heath W. Psychology Research Methods. New York: Cambridge University Press; Schneider FW.

Applied Social Psychology. Importance and use of correlational research. Nurse Researcher. Carpenter S. Visualizing Psychology. Physical and mental health costs of traumatic war experiences among civil war veterans. Archives of General Psychiatry. Post SG. The echo of Nuremberg: Nazi data and ethics. J Med Ethics. Lau F. Chapter 12 Methods for Correlational Studies. In: Lau F, Kuziemsky C, editors.

Akoglu H.



0コメント

  • 1000 / 1000