18 Nov 21

Twitter's misinformation labels

In serving the public conversation, Twitter's goal is to make it easy to find credible information from authoritative sources on Twitter and to limit the spread of potentially harmful and misleading content. In 2020, before I joined, Twitter created a visible annotation on Tweets known to contain misleading information. This is how the labels looked initially:

Original label design, as of March 2020

Labels provide a way for Twitter to move beyond the binary of leaving harmful content up or taking down and address potentially misleading information in a way proportionate to the severity of harm it poses. When I joined the team, I was tasked with helping improve the design of the labeling system to reach that goal.

The original design had key problems we were trying to solve:

All labels applied looked the same, even though internally they're applied to categories of different levels of harm. We wanted to make these categories transparent to readers.
The design wasn't flexible enough for nuance. Labels were rendered just as one string of blue, bold text, and the content team didn't have much room to experiment with copy hierarchy.
The labels didn't visually fit our evolving design system.

Small label, huge design problem

Even though the labels look like a small surface on the product, the complexity behind getting them just right is surprising. This redesign was a huge cross-functional effort, with many key players across design, content strategy, policy, product, and engineering.

We explored a spectrum of designs (that, unfortunately, I can't share here!), and evaluated each of them in about a dozen rounds of critique and iteration.

We took some of the designs to qualitative research and got positive feedback on the following designs. Users felt that these were more clear, transparent, and helpful than the original.

Alternatives that got positive feedback in qual research.

AB Testing

We ran an AB test in production, for millions of customers, with control, A and B variants. Here's the official Tweet announcing the experiment:

Last year, we started using labels to let you know when a Tweet may include misleading information.

For some of you on web, we’ll be testing a new label design with more context to help you better understand why a Tweet may be misleading. https://t.co/p1KONJz5Vo pic.twitter.com/m55f4RlMDg
— Twitter Support (@TwitterSupport) July 1, 2021

After a few months of running the experiment, we found that the A/B test was a success:

The new designs improved all metrics we cared about, compared to the control.
Surprisingly, the design without background colors didn't perform as well as the one with transparent backgrounds.

With the new confidence in the labels, we decided to roll out the new labels to all users on the platform. Here's the announcement Tweet:

Redesigned labels for potentially misleading Tweets are now rolling out to more of you.

In our test, more people clicked into the new labels and fewer people Retweeted or liked potentially misleading Tweets with these labels. We'll continue to improve our label design. https://t.co/MKYKtHJOFA pic.twitter.com/LimMdwbtuF
— Twitter Support (@TwitterSupport) November 16, 2021

Results

The new designs improved engagement with the curated content.
The new designs reduced engagement with misleading content.
The new designs more transparently categorize the labels according to potential harm.

Next steps

Even though this redesign is more successful than the original one, there is still room to improve: one common feedback is that the new component we designed is too similar to Quote Tweets and other Tweet attachments, and could be hard for users to tell them apart. We're already planning up a new round of iteration for 2022, and I'll update this post as soon as the results come out.