How data is misinterpreted?
Understand the common, erroneous methods that make data misrepresent the facts.
For those of us who deal with data realize that it has the power to create misleading impressions and deceive us into accepting unsupported beliefs.
Confusing correlation with causation, shaping data to fit preconceived theories, selectively including only supportive data for a particular claim — these are just a few examples of the pitfalls that can arise in data analysis. And note that the latter two examples, mentioned above, are as different from as each other as a tomato is from a carrot.
Now, let’s explore some of the most frequent ways data can be misunderstood or misinterpreted.
- Correlation Implies Causation: Mistaking a correlation between two variables as evidence of one causing the other, overlooking other influencing factors due to their timing or correlation. Assuming that increased advertising spending directly caused a rise in sales without considering other factors like seasonal trends.
- Cherry-Picking: Selectively highlighting data that supports a claim while ignoring contrary data, leading to biased conclusions. Showcasing only customer reviews praising a product while ignoring negative feedback may create an inaccurate perception of its quality.
- Confirmation Bias: Preferring data that aligns with preexisting beliefs, ignoring contradictory evidence and hindering objectivity. Ignoring market research indicating a shift in customer preferences because it contradicts established views.
- Sampling Bias: Making conclusions from an unrepresentative subset of data, leading to inaccurate generalizations. For example, making decisions based solely on feedback from a vocal minority of customers, disregarding the broader customer base. Making decisions based solely on feedback from a vocal minority of customers, disregarding the broader customer base.
- Data Dredging: Conducting multiple tests on data until a significant result is found, leading to false positive outcomes. Repeatedly analyzing marketing campaigns until finding one with a positive response, leading to a misconception of its effectiveness.
- Overfitting: Creating overly complex models that fit current data well but perform poorly on new data, lacking generality. Developing a complex sales prediction model that performs well historically but fails to forecast accurately in new market conditions.
- Vanity Metrics: Emphasizing metrics that seem impressive but lack meaningful insights into actual performance. Focusing solely on website page views without considering conversion rates or customer engagement.
- Gerrymandering: Deliberately manipulating data or boundaries to influence outcomes. Presenting sales data in a way that exaggerates success in certain regions.
- Regression to the mean: The tendency for extreme observations to be followed by more moderate ones, which can lead to misinterpretations about the effectiveness of interventions or treatments. Attributing a sales decline after a record-breaking quarter to a failed marketing strategy, when it might simply be a natural regression.
- Summary Metrics: Relying solely on summary metrics without considering the underlying data distribution can lead to misunderstandings and incorrect conclusions. Focusing solely on overall revenue without considering customer segmentation might miss lucrative opportunities in specific market segments.
Check out this link for further reading.
These fallacies can cause poor choices that impede the company’s ability to meet customer needs, refine its product, and attain long-term success. Notably, two of these fallacies carry more significant risks by fostering unrealistic expectations, potentially resulting in discontent and harm to the brand.
— Cherry-Picking creates an incomplete view, as glitches aren’t addressed, hindering product improvement.
— Confirmation Bias reinforces the positive view, missing opportunities to rectify issues. This harms adaptability and might disappoint customers.
Learn more on the nuances of data and storytelling.
- The fine nuances of data storytelling.
- What every analyst needs to understand about data and analytics.
- Challenges faced by every data person.
- The 8 stages of an analysis lifecycle that drives results.
- 10 ways to make your data tell a compelling story.
- 5 ways to make an impact with your analysis.
- 3 most common types of data analysis
- What 15 years of business analysis taught me?
- 10 questions i always ask as an analyst for a successful outcome.




