When there were only a couple of days left in our September #SWDchallenge, which solicited your takes on visualizing uncertainty, we were afraid that we might have set forth too complex a task. Even with the added incentive of having three participants receive their own copies of Alberto Cairo’s upcoming book, “How Charts Lie,” we had yet to receive more than a handful of entries.

We were prepared to write a recap all about how appropriately conveying uncertainty is one of the most daunting tasks in data visualization, and how this difficulty manifested itself (we expected, at the time) in a relatively small pool of participants. As the person responsible for writing this recap, I planned for an unusually short timeline between the close of the submission period and the completion of the monthly review.

Life, however, proved to be quite a jokester. An apt one, at that.

Because in the last 48 hours of the September #SWDchallenge, the number of submissions basically tripled.

We were surprised and thrilled by this development—not only because of the volume of submissions, but also because of their quality and variety.

We shared them with Alberto, who had this to say about the challenge and the collected submissions of our participants:

“Revealing uncertainty, I think, is one of the greatest challenges that visualization faces. Charts, graphs, maps, and other graphics are often perceived by the public as precise and accurate, even if the quantities they so neatly encode are surrounded by hidden fuzzy clouds of variation.
“I like to think of uncertainty broadly, as an expression of the difference between what we believe we know with varying degrees of confidence and what we think we don't know. In other words, uncertainty is an expression of ignorance, of both known unknowns and unknown unknowns. Some forms of this expression are quantifiable, and some aren't, and the variety of projects we've seen in this #SWDchallenge address both, either visualizing quantified fuzziness or discussing challenges with the definition and demarcation of the variables being shown. Together, the entries to the challenge constitute a good catalogue of ideas that may inspire future experimentation."

This “catalogue of ideas” included a few different strategies and techniques that several participants identified and employed, to good effect.

Use Gestalt principles to differentiate actual data from forecasted data

In our workshops, we talk about how people visually perceive the world, and certain ways that we subconsciously find order in what we see. These are commonly called “Gestalt principles” after the school of psychology that first studied and enumerated them, and they describe various ways we interpret the things we see as being related or connected. In practice, they’re extremely useful for decluttering and organizing our charts.

Taken together, we can use these principles to convey uncertainty in visualizations that involve a mix of actual data and forecasted values. Several of our participants this month used these sorts of techniques.

Enclosure: Christian, Gordan, Lance, and Lisa used a shaded area behind a line to show where actual data ended, and forecasted data began.
Closure: Angie, Cassandra, and Jami differentiated between actual values and predictions by using a solid line for the former and a dotted line for the latter.
Similarity: Adam, Charles, and Ligia employed different colors and/or intensities to distinguish between observations and predictions.

Include ranges of possible values, distributions, or confidence intervals

The use of the enclosure principle doesn’t have to be restricted only to showing where actual data stops and forecasted values begin. Using background shading in different ways can help readers understand more of the context surrounding the chart they see, and the uncertainty inherent in the data you present.

Claire’s submission, which attempts to answer the question, “When is it sweater weather in St. Louis?,” uses TWO types of shaded areas: the first (shaded in gray) shows the range between the maximum and minimum temperature for that day of the year in recorded St. Louis history; the second (shaded in color) shows the period of time when the temperature for the day of the year has, at least once, fallen in “sweater weather” range. Claire also includes the average historical daily temperature as a bold, thin line that forms the spine of the chart, and helps the reader to see more clearly not only when it’s possible that it will be a sweater-weather day, but when it’s actually likely to be one.

Claire wasn’t the only person to use shading for the purposes of providing additional context. Ash’s chart is focused on a summary statistic (represented as a bright orange line), but also includes markers and shaded areas to show the distribution of values that are the component parts of that line. Ben’s chart, being entirely based on prediction, dispenses with any measures of centrality, and displays only ranges. Georgios, Joost, Pris, Augusto, Steve, and Cedric all used intensity of color to show the likelihood of a value falling within a given range; Cedric also included a ridgeline-plot-esque view of the underlying distribution, to provide the viewer even greater insight into the data.

Marks that appear hand-drawn can suggest less certainty than crisp, digital marks

There’s something funny about things that are hand-drawn: we perceive them as being more temporary, less official, and somehow less complete than things that are generated digitally. This is, incidentally, why we are such ardent advocates of beginning all of our projects with low-tech tools like pen and paper, whiteboards, and sticky notes; it’s easy to create, reorganize, and discard ideas if we don’t put them into our computers.

So for conveying uncertainty, using hand-drawn visuals (as Layisha and Sarah) did—or using the appearance of hand-drawn visuals (as Rob and Elvira) did—can help suggest to our audience that what we are presenting to them is somehow less than definitive. (When we talk about creating the “appearance” of hand-drawn visuals, we mean using tools like a library called “xkcd”—as Elvira used—that makes charts resemble the hand drawn style of the eponymous comic; or using libraries like the “sketchy” ones Elijah Meeks has created in D3 and Semiotic that mimic analog drawings.)

Animation can heighten the tension of uncertainty

If your presentation medium allows it, then using animation could be another way to convey uncertainty. Now, for some, hearing the words “animation” and “uncertainty” might evoke memories of the New York Times’ Election Needle, which debuted in November 2016.

Why did this visualization create such a furor? Largely because we were unaccustomed to news outlets explicitly conveying uncertainty in this manner—while polling data is often presented with a “margin of error: +/-X%” footnote, the main visual is usually a solid, bold, labeled, authoritative bar chart. The footnote provides the caveat of uncertainty, but the audience’s perception is that These Numbers Are The True Numbers.

The Times’ Election Needle flipped that around, and put the uncertainty front and center. Many readers of the Times’ website were likely already anxious about the election; in looking to relieve some of this tension by seeking out information presented in a traditional authoritative style, they instead found their anxiety exacerbated by this new uncertainty-oriented display.

Tension, in the presentation of a story, is not something to be avoided. Stories devoid of tension lie flat and fail to engage. But that tension needs to be resolved. Animation can be an excellent way to generate tension and convey uncertainty, while also eventually reaching a resolution of that tension.

Alexander submitted an animated look at the performance of two different drugs intended to treat patients with plaque psoriasis. (The link to the animation is here, as Part 2 of the supplemental material for a paper hosted on an open-access scientific journal.) The animation begins at Day 0 of a treatment plan, and each frame is a step forward in time one more day of the treatment. Each dot in the scatterplot represents a patient; each patient has a different severity of plaque psoriasis. That severity is graded from 0 to 100 on the PASI (Psoriasis Area and Severity Index) scale, with 100 being the most severe. The horizontal axis shows the PASI score for a patient at the beginning of the treatment; the vertical axis shows their PASI score on whatever the current day of treatment is.

The tension of this animation comes from watching hundreds of patients, day by day, improve (or not improve, or get worse) while under two competing treatment plans. The uncertainty comes from the fact that these are human beings, and that individual people’s health statuses do not change identically, smoothly, or in a uniform direction. By following an individual mark through the animation, we can see this uneven progression in action; we can watch a patient improve slowly, or plateau, while at the same time others improve dramatically and get to a state of being 100% psoriasis-free.

A static chart could show you how many patients reached each individual PASI score, or their percentage improvement on the PASI scale, after X number of days, but the animation draws the viewer in, generates engagement and tension as they watch patients’ individual journeys towards better health, and see those journeys resolve at the end of the animation.

Thanks to everyone who participated in the challenge (submissions below). A special congratulations to Crystal, Kate, and Rob, who will be receiving their own copies of “How Charts Lie” (which is scheduled for release in the U.S. on October 15, 2019).

Be on the lookout for the next #SWDchallenge, which we will announce in the first week of October.

Adam | Visualizing Uncertainty (STRAVA RUN stats)

How many miles will I have pounded on the trails and tarmac, before the sleigh bells come a-ringing?

Well, up until the last few months, I had been plagued with injury this year, so I really haven't put the miles in compared to previous years. However I'm now back and trying to get fit with some goals in sight for some winter races.

So with this in mind I wanted to take a look at presenting some trends to visualize where I might end up by the end of the year, along with plotting standard deviations across the months to help me better visualize this unknown.

an expression of ignorance

Use Gestalt principles to differentiate actual data from forecasted data

Include ranges of possible values, distributions, or confidence intervals

Marks that appear hand-drawn can suggest less certainty than crisp, digital marks

Animation can heighten the tension of uncertainty

Adam | Visualizing Uncertainty (STRAVA RUN stats)

Alexander | Visualizing how patient with psoriasis improve over time

Angie | World urban population (2050-2035)

Ash | NHL Attendance: 2000 to 2018

Augusto | 2019 Brazilian Championship prediction

Ben | School Reports

Cassandra | Music Streaming Industry Forecast

﻿Cedric | Daily Temperatures in Berlin, Germany

Charles | Wind Power in the US

Christian | Deaths of Despair

Claire | When will it be sweater weather in St. Louis?

Connor | Modern Design versus Comic-style

Crystal | Effect Size uncertainty

Ela | Churn types with flow

Elvira | Productive hours with uncertainty

Georgios | 50,000 personal heart rate values from 2016 to 2019

Gianni | The demographic projection of the Italian population from 2020 to 2065

Gordan | Forecasting ranges

Jami | Arizona Population Projections

JB | Pollutant level in France data experience

Johanie | M9 - visualize uncertainty

Joost | Visualizing uncertainty

Kate | You Are What You Absorb

Lance | Actual & Predicted Criminal Offenders in Australia

Layisha | Cultural Attitudes Regarding Feminism

Ligia | Inflation Rate of Brazil

Lisa | Is Venice Sinking?

Mohammed | Job Automation in England

Paul | President Trump's Approval Rating

Pris | 48 Hours To Go Viral

Rob | Geographic Uncertainty in Spatial Treemaps

Sarah | What happens when you add the word "feminist?"

Simon | The population forecasts for England over the next 20 years

Steven | Showing uncertainty in survey results

Taylor | Visualizing uncertainty with gganimate

Vijaya Shree | Visualizing Uncertainties - Food Preferences Survey 2019

how it came to be

strip away the non-essential

Cedric | Daily Temperatures in Berlin, Germany