👀
Crash Visualization
  • Welcome
  • Preface
    • Who the book is written for
    • How the book is organized
  • 1. Introduction of Data Visualization
    • 1.1 What is data visualization?
    • 1.2 Why does visualization matter?
  • 2. Tricks in Visualization
    • 2.1 Choose Appropriate Chart
    • 2.2 Features of Charts
      • 2.2.1 Table
      • 2.2.2 Column Chart
      • 2.2.3 Line Chart
      • 2.2.4 Pie Chart
      • 2.2.5 Scatter Chart
      • 2.2.6 Map Chart
    • 2.3 Misused Graph
    • 2.4 Tips in Visualization
  • 3. Matplotlib
    • 3.1 Basic Concepts
    • 3.2 Line Chart
    • 3.3 Area Chart
    • 3.4 Column Chart
    • 3.5 Histogram Chart
    • 3.6 Scatter Chart
    • 3.7 Lollipop Chart
    • 3.8 Pie Chart
    • 3.9 Venn Chart
    • 3.10 Waffle Chart
    • 3.11 Animation
  • 4. Seaborn
    • 4.1 Trends
    • 4.2 Ranking
      • 4.2.1 Barplot
      • 4.2.2 Boxplot
    • 4.3 Composition
      • 4.3.1 Stacked Chart
    • 4.4 Correlation
      • 4.4.1 Scatter Plot
      • 4.4.2 Linear Relationship
      • 4.4.3 Heatmap
      • 4.4.4 Pairplot
    • 4.5 Distribution
      • 4.5.1 Boxplot
      • 4.5.2 Violin plot
      • 4.5.3 Histogram plot
      • 4.5.4 Density plot
      • 4.5.5 Joint plot
  • 5. Bokeh
    • 5.1 Basic Plotting
    • 5.2 Data Sources
    • 5.3 Annotations
    • 5.4 Categorical Data
    • 5.5 Presentation and Layouts
    • 5.6 Linking and Interactions
    • 5.7 Network Graph
    • 5.8 Widgets
  • 6. Plotly
    • 6.1 Fundamental Concepts
      • 6.1.1 Plotly Express
      • 6.1.2 Plotly Graph Objects
    • 6.2 Advanced Charts
      • 6.2.1 Advanced Scatter Chart
      • 6.2.2 Advanced Bar Chart
      • 6.2.3 Advanced Pie Chart
      • 6.2.4 Advanced Heatmap
      • 6.2.5 Sankey Chart
      • 6.2.6 Tables
    • 6.3 Statistical Charts
      • 6.3.1 Common Statistical Charts
      • 6.3.2 Dendrograms
      • 6.3.3 Radar Chart
      • 6.3.4 Polar Chart
      • 6.3.5 Streamline Chart
    • 6.4 Financial Charts
      • 6.4.1 Funnel Chart
      • 6.4.2 Candlestick Chart
      • 6.4.3 Waterfall Chart
  • Support
    • Donation
Powered by GitBook
On this page
  • 1. Irrelevant Data
  • 2. Misuse Graph
  • 3. Manipulate Axes
  • 4. Manipulate Data
  • 5. Complexity
  • 6. Data Visualization Don’ts

Was this helpful?

  1. 2. Tricks in Visualization

2.3 Misused Graph

Previous2.2.6 Map ChartNext2.4 Tips in Visualization

Last updated 4 years ago

Was this helpful?

Our brain is prewired to process visual content much quicker than text, which is why data design is so effective. By “seeing” the data, it is easier for your brain to intake, synthesize, and retain the information presented.

While graphs can be a valuable tool to help summarize data into a compelling story, they can also inadvertently be used incorrectly or worse, to mislead. It can make results appear different than they are and/or lead others to draw incorrect conclusions. Here are a few typical tricks in visualization.

1. Irrelevant Data

Anyone who has taken an intro to psych or a statistics class has heard the adage, “.” Just because two trends seem to fluctuate in tandem, these rule posits, that doesn’t prove that they are meaningfully related to one another.

2. Misuse Graph

We know that pie charts are for slices which, together, make up 100%. If the numbers don’t add up to 100%, the pie chart has been misused. As you can see below, the percentages from each section add up to more than 100%, and it 's difficult to extract information at once. This is a common problem when people try to visualize survey data that has multiple answers.

3. Manipulate Axes

A common misleading feature in graphs is a skewed scale of axes.

Omit Baseline

In most cases, the baseline for a graph is 0. But sometimes people can make it a different number to skew the graph. This misleading tactic is frequently used to make one group look better than another. It is known as a "truncated graph".

For example, if you took a cursory glance at this graph you would probably think that the interest rate in 2012 is much higher than the rest. But a closer look shows that the interest rate is quite stable, only fluctuate between 3.140% and 3.154%.

Manipulate X-Axis

Notice: The X-Axis of the left graph is not even.

Manipulate Y-Axis

Expanding or compressing the scale on a graph can make changes in data seem more or less significant than they are.

4. Manipulate Data

Missing Points

Something similar can be said about problems with the x-axis. When some data are missing, it may tell a completely different story than when all of the data are presented.

Cherry-Picking Data

In the first graph below, a reader could obviously be misleading into thinking that the UK National debt has never been higher! This graph could be used to justify a politician voting on some piece of legislation that would lower the debt.

However, when you take a look at the full-time series, you can see that national debt is actually pretty low in comparison.

5. Complexity

  • Graphs should be no more complex than the data that they portray.

  • Unnecessary complexity can display irrelevant decoration, color, 3D effects.

It can be introduced in a simple and clear way.

6. Data Visualization Don’ts

Don’t intentionally misrepresent data

It’s important to mention. Whether done intentionally, or unintentionally, misrepresented data has consequences. For instance, any of the following errors can undermine the validity of your data set or even your reputation:

  • An axis that starts at a place that exaggerates differences within the data

  • Using uneven intervals between numbers

  • Using inaccurate or inconsistent scales on size comparisons

  • Using colors that are inappropriate for the data set being described

Don’t try to present too much information

Squishing too much information into your visualization is confusing and just plain ugly. Here are a couple of signs that your visualization has too much information:

  • There are more than six colors in your visual.

  • The chart is crowded, and it is difficult, if not impossible, to differentiate between the data points within the first couple of seconds

  • You need multiple text boxes to explain the data points.

Don’t try to make bad data look better

Bad data is bad data. No amount of creativity can produce a good graph from dubious data.

A common trick of the graph manipulator is to blow out the scale of a graph to minimize or maximize a change. It is known as the in the data visualization world. Axis manipulation is almost the opposite of truncating data because they include the axis and baselines but change them so much that they lose meaning.

axis changing
See more here
correlation does not imply causation
Truncated graph
Cherry Picking
Junk Pie chart