QuestionPro Blog

Correlation isn’t Causality

September 14, 2009 · 1 Comment

Business GraphI came across a published report recently that made me wonder why people persist in reporting that there is a causal relationship when the data doesn’t justify the assertion. Actually, the reasons aren’t all that hard to figure out. Usually, it’s because the relationship seems obvious, and sometimes it is when the person writing the report has a bias they wish to share.

But I’m getting ahead of myself.  Let’s start with a couple of definitions:

A correlation is simply the test of the relationship between two variables.  Pearson’s coefficient, commonly used to test linear relationships between scale variables, will be 1 (or -1) for perfect correlation.  Other coefficients are used for different types of variables. Tools such as SPSS that calculate correlation coefficients generally provide some guidance as to whether the relationship is significant – the strength of the correlation.

What correlation tells you is given the value of the one variable, what to expect for the value of another variable.

Causality, on the other hand, is a statement that if the value of one variable is changed then the value of the second variable will change accordingly.  Correlation is necessary, but not sufficient, for a cause-and-effect relationship.

It is easy to find good examples of correlations where assuming a causal relationship would be absurd. The Wikipedia article on the topic shows a chart of Mexican lemons imported from Mexico to the US plotted against total US highway fatalities. This is an example of a coincidental correlation.

Another type of misinterpretation occurs when the order of the cause and effect is reversed. Daniel Huff’s excellent “How To Lie With Statistics” discusses the relationship between smoking and college grades.  Apparently the results were used to promote the idea that giving up smoking would lead to improved grades.  But it is equally feasible that lower grades caused students to take up smoking.

We can get into trouble by using more sophisticated statistical techniques without paying enough attention to the meaning of the data and the variables being used to express results. Regression analysis is a powerful tool, but look at the correlations first.  Even the jargon can encourage misinterpretation and misstatements; when you are performing analysis for the ‘dependent’ variable it is easy to conclude causality where none exists.

More subtle problems can occur when some other factor is the cause for both the correlated variables.   This article describes a study where eating breakfast was correlated with elementary school success.  This could have resulted in the conclusion that breakfast eating caused them to be better learners. The article continues, “It turns out, however, that those who don’t eat breakfast are also more likely to be absent or tardy — and it is absenteeism that is playing a significant role in their poor performance. When researchers retested the breakfast theory, they found that, independent of other factors, breakfast only helps undernourished children perform better.” The article is from the Statistical Assessment Service – STATS – which is a non-partisan resource whose mission is to provide education on the use and abuse of science and statistics in the media.

I can’t be sure which of the fallacies were behind the ill-considered statements that were the inspiration for this article without access to the raw data.  The Kauffman Foundation does some excellent work studying entrepreneurship.  But their report on “The Use of Credit Card Debt by New Firms” draws some conclusions that are not justified by the data shown. The report states that “credit card debt reduces a firm’s probability of survival” (emphasis mine).  It appears that the authors want to warn entrepreneurs to avoid using credit cards. All the more surprising then that two positive examples for credit card funding (Spike Lee and the Blair Witch Project movie) are named in the report. I don’t want to be hypercritical of Kaffman or the report, as there are some interesting and useful results presented.  But from the data shown it seems equally likely that the businesses that failed were going to fail anyway, regardless of taking on credit debt.  In fact, businesses that failed during the three years of the study actually had lower credit card debt at the end of the first year.  Perhaps they did not borrow aggressively enough!

How then do you avoid drawing the wrong conclusions about cause-and-effect?  And how can you deliver results from research that provide useful guidance for actions that forward the organizational goals?

First, avoid making statements that imply the correlations imply causality.  Consider the other possibilities such as reverse causality or another variable that wasn’t measured.  However, don’t be too pedantic or academic either.  It is often fair to say that there may be a cause-and-effect relationship.  And frequently the changes that will positively impact one variable will be beneficial to the organization as long as they make sense on the face of it.

If you really need to confirm causality, you’ll generally need to do some sort of study that is repeated over time.  By including the same people in the sample, you’ll have good assurance that changes you see in Overall Satisfaction can be connected with the changes you make from one wave to the next – such as for Speed of Connecting to a Customer Service Representative.  If you don’t use the same people, you’ll have to take more care to make sure the samples are the same as far as possible.

For more examples that will help you critically review your own and others’ work, check out this great list of correlation/causality fallacies.

And finally, I couldn’t resist this cartoon on the topic from XKCD:

Idiosyncratically,

Mike Pritchard

Mike Pritchard is President of 5 Circles Research, a Seattle-based firm specializing in helping people conduct their own surveys through consulting and training. More information on services, including training classes for do-it-yourself surveyors and the SurveyTips blog, can be found atwww.5circles.com.

Categories: customer research

Deeper insights from Customer Satisfaction (Beyond Net Promoter)

July 15, 2009 · 2 Comments

[This is a guest post from Mike Pritchard, President of 5 Circles Research, a consulting and training firm based in the Seattle area]

With all the debate over the past few years about the Net Promoter Score, some people seem to have forgotten that Customer Satisfaction research can lead to deep insights about customers that will help organizations in many ways. Even the idea that higher levels of satisfaction are related to improved profits seems to have become a cliché. “Satisfaction would be nice,” some seem to say, “but I can’t focus on it now. Spending more time on sales is more important“.

Perhaps one reason for the lack of respect for customer satisfaction is the fact that the original studies are quite old. Perhaps too, managers think that focusing on the fundamentals isn’t as likely to pay off quickly. After all, it is easy to see and measure sales activity. But if you are chasing after the wrong customers that activity is likely to be inefficient at best, or even wasted. In today’s economy, it is critical to focus on the customers and prospects that will be profitable. And that doesn’t mean just the most obvious. Just because a customer is vocal doesn’t mean that they’ll be profitable.

There are a number of published studies showing the link between customer satisfaction and financial performance, including a paper from the Burke Institute.

Benefits from satisfied customers

Each industry is slightly different, but there are some consistent principles:

  1. Satisfied customers tend to continue to buy from the same company. They are easier to market and sell to (for repeat purchases, increased usage or cross selling).
  2. It costs much less to retain existing customers than to acquire new ones.
  3. Satisfied customers tell others about their positive experiences, while dissatisfied customers tell even more people about their negative experience.

By understanding what distinguished satisfied customers, sales efforts can be directed more effectively. Obviously it is much better to target the prospects that will become satisfied customers than those who are a drain on resources. But too many companies don’t understand how to identify those good prospects. Focusing on the prospects whose product needs are aligned with what you are offering is a start, but you can do much better with the right kind of research.

What to measure

The question used for the Net Promoter Score is one of the three main questions that are used to develop the Secure Customer Index. The question wording will vary with the type of product or service, but you should ask about Overall Satisfaction, Willingness to Continue Use (or Repurchase), and Likelihood of Recommending. So a typical set of questions would look like this:

  • How satisfied are you overall with XYZ?
  • How likely are you to purchase another XYZ in the next six months?
  • How likely are you to recommend XYZ to friends or colleagues?

Question considerations

Most researchers prefer to place the Overall Satisfaction question early in the survey, perhaps as one of the first questions. This avoids the survey taker being influenced by other questions (such as detailed questions about specific attributes or contact with customer support). But if the experience was some time ago the other questions may refresh memory and thus the results may give a truer overall perception. There is no absolute for placement, but you should be consistent to allow tracking to be valid.

The Repurchase question wording is quite varied and situation dependent. It is generally helpful to include a timeframe when appropriate. For example, asking about renewal at the end of the current subscription works well. For an appliance, asking a specific timeframe isn’t likely to do much (except perhaps to make the survey taker nervous that your products won’t last long!). In this case, ask “How likely would you be to buy another Brand A refrigerator when you next need one“, or something similar.

The Recommendation question wording is usually fairly standard, but try to tailor it to be a good fit for the product or service. Some researchers like to also ask a question about whether the survey taker has actually recommended, but this is uncommon – perhaps because there is a fear of encouraging the survey taker to lie.

The scales used for these high-level satisfaction questions should match scales you use elsewhere in the survey. If you are using the built-in Net Promoter Score question type from QuestionPro, you’ll want to use the same 1-10 scale for the other questions. If you aren’t using Net Promoter, choose at least a 5 point scale, perhaps a Likert scale.

How to classify customers

You’ll need to decide how to code the responses in order to perform the classification. A common approach for 5 point scales is to only count a score of 5 for the most satisfied etc. With a 10 point scale, you might call scores of 9 or 10 most satisfied, and 1 or 2 least satisfied (top box and bottom box). Then you classify each customer in your survey based on the responses, using the following scheme.

SCIclassification.png

Secure customers are those who are most satisfied overall, most likely to repurchase, and most likely to recommend (scoring top on all three questions). These are the most valuable customers overall – because they buy the most, are the best advocates, and generally cost less to service. They probably won’t need expensive changes to remain classified as secure, but it is important that the company continues to provide appropriate support to keep them in the category. For the example study in the article, this group was 88% likely to remain a customer after 1 year, and 33% were likely to increase purchases. These results will vary by industry, may be driven by other factors including economic conditions, but the difference between secure and other customers is likely to remain. The percentage classified as Secure is known as the Secure Customer Index, and is a simple measure that can be tracked over time.

Favorable customers are generally well satisfied, scoring top or second for all the three questions. Some call this group Satisfied, but we prefer a different term as the word satisfied doesn’t adequately convey that there are benefits to be gained by directing attention to the category. Improvements directed at this group tend to be cost-effective because they are the easiest to move to the secure category where they become even more valuable. In the example study, this group was 57% likely to remain a customer, and 20% likely to increase purchasing.

The Vulnerable group is those who have middle of the road scores on all measures. This group is not usually as important to target as others, in part because the impact of changes is not as assured. Over time, the percentage of customers in this group should be minimized.

The Dissatisfied group is comprised of those customers who score lowest on any of the three satisfaction measures. It is often tempting to focus energy on making changes that improve perceptions by this group, but this may not pay off. Rather, learning the causes of the dissatisfaction will help the company to avoid seeking more customers who may also be dissatisfied for the same reasons when there are no immediate fixes. For example a customer who is driven mainly by low prices is probably not a desirable customer for a company seeking to differentiate through added services. Better targeting should minimize the size of this group.

Profiling for success

Once you have identified which customer falls into which group, you can examine other information to determine what are the common characteristics. For example, are the Secure customers buying a particular combination of products? Perhaps one of your products is a dud, and those who buy it have lower perceptions overall even if the rest of the products being purchased are received well. If so, it might be a good idea to de-emphasize sales until the product can be improved. Perhaps Secure customers are more likely to be from a particular industry. Your sales people would love to have a clearer target to go after, and your marketing efforts can be refined to provide better support.

Further information

Beyond the approaches described here, customer satisfaction research can be used to identify drivers of satisfaction among product attributes and other characteristics of the company that will help prioritize your efforts.

Just remember, learning more about your customers can be very valuable – as long as you measure the right things and then do the right things.

Idiosyncratically,

Mike Pritchard

More Reading:


[Mike Pritchard is President of 5 Circles Research, a Seattle-based firm specializing in helping people conduct their own surveys through consulting and training. More information about training classes for do-it-yourself surveyors and the SurveyTips blog can be found at www.5circles.com.]

Categories: Best Practice · Newsletter