A data journalism workflow

January 27, 2013

—

1–2 minutes

to read

First published on Guardian Data

Data journalism workflow, by Mark McCormick

Before a dataset results in a data journalism story, there’s a whole process of sifting and finessing and generally sorting the data out. The split is roughly 70% tidying up the data, 30% doing the fun stuff of visualising and presenting it. So, how do we get through that 70%?

Guardian graphic artist Mark McCormick has helped us visualise that process. So, yes, this is a graphic about how we produce data that often results in, er, graphics. A data visualisation about data visualisations, if you will.

You can get a pdf of this here
Each of these steps could be a piece in itself – and over the next few weeks, we will break them down. But, in short, it goes something like this:

• We locate the data or receive it from a variety of sources, from breaking news stories, government data, journalists’ research and so on

• We then start looking at what we can do with the data – do we need to mash it up with another dataset? How can we show changes over time?

• Those spreadsheets often have to be seriously tidied up – all those extraneous columns and weirdly merged cells really don’t help. And that’s assuming it’s not a PDF, the worst format for data known to humankind

• Now we’re getting there. Next up we can actually start to perform the calculations that will tell us if there’s a story or not – and then sanity check them to see if it just sounds wrong

• At the end of that process is the output – will it be a story or a graphic or a visualisation, and what tools will we use?

You can get a pdf of this here. What have we missed – or how do you do it?

More data

Data journalism and data visualisations from the Guardian

7 responses to “A data journalism workflow”

How To Begin With Datajournalism : MEDIATYPE, journalisme en mutation

May 6, 2015 at 7:47 am

[…] on Guardian Data, by Simon Rogers, data journalist & data editor at […]

Reply
Ricardo Pietrobon

January 12, 2014 at 5:56 am

Simon, very interesting, but this also implies that the workflow necessarily starts with obtaining data and then deriving the story from there? How about being interested in a given area and then start looking for and/or generating data to be able to explore it?

Reply
Datenjournalismus selbst probiert – zweiter Schritt: die richtigen Daten finden | Medienpraxis

May 23, 2013 at 11:59 am

[…] typischen Workflow bei einer Analyse hat Simon Rogers, ehemaliger Dateneditor beim Guardian, auf seiner Website aufgezeichnet (eine deutsche Übersetzung von David Bauer gibt es hier). Die Vorgehensweise ist wie […]

Reply
Data Viz News [7] | Visual Loop

May 18, 2013 at 2:48 pm

[…] A data journalism workflow | Simon Rogers […]

Reply
Von Daten zu Geschichten: ein Datenjournalismus-Workflow | David Bauer. Journalist+

May 18, 2013 at 12:00 pm

[…] hat den Prozess, den datenjournalistische Projekte typischerweise durchlaufen, visualisiert. A data journalism workflow beginnt bei den verschiedenen Ausgangspunkten, die ein datenjournalistisches Projekt anstossen, und […]

Reply
whisky

May 10, 2013 at 5:41 pm

Tks Simon, I am from http://www.opendata.tw/ and this figure help me a lot to explain the data journalism workflow.
I just make a chinese (Taiwan) translation and paste it here : https://www.facebook.com/photo.php?fbid=546047748767602
Actually I use this to promote our own data journalism camp which will be hold at 25/26 of this month.

Reply
Guardian explains data visualisation | Simon Pipe

January 28, 2013 at 11:42 am

[…] Simon Rogers does all this (data journalism, that is; though for all I know, they also have whips and a room for cold baths) at The Guardian, and explains the process here. […]

Reply