About#

The image below describes the ‘anatomy’ of a Data Hazard label.

All Hazard labels have an descriptive icon, title, description, examples and suggesed safety precautions.

How the project started#

The Data Hazards project started in 2021. We (Natalie Zelenka and Nina Di Cara) spoke together about wanting a way to communicate what might go wrong in data science projects, because we were frustrated by the repetitive themes we were seeing in harmful technologies that we talked about in Data Ethics Club. We were also concerned that many projects that have significant societal impact do not have those impacts scrutinised by an ethics committee, because they do not technically have research participants.

After this conversation we came up with the idea of Hazard labels for communicating these potential harms, and called them Data Hazards. We decided they should be visual, like COSHH chemical hazards are, and that they should be a way for people at all stages of data science technology development to communicate about the same potential outcomes (no matter how far away those outcomes might seem).

You can see the current Data Hazard labels here.
You can read our original proposal here.

These days the project is bigger than just us, and we have many contributors who suggest new content, changes to the labels, help us to teach others about ethical hazards or run their own events. If you would like to get involved (we’d love you to!) then we’ve listed lots of ways you could on our Contributing page.

Once we had thought of the original list of Hazards we wanted a way for researchers to think about them in a format that encouraged them to reflect, invite different opinions and make them think more broadly about the potential ethical concerns from their project. This led to the development of our workshop format and all the materials we have since made for self-reflection and teaching. All our resources are designed (and licensed) for re-use by others.

Ethos#

The Data Hazards are currently intended to be used creatively and flexibly, in whatever way they are useful to the user. Sometimes this means they are flashcards for teaching students about ethics, sometimes they are displayed with new research to communicate potential harms, and sometimes they are used in workshops as prompts.

We believe it is important when using the Data Hazards to help investigate risks in a project, that people beyond the original researcher are consulted on potential hazards. This is because we believe that knowledge, including in the sciences, is not objective, and that our perspectives are shaped by our lived socio-political experiences (this is based on standpoint theory). This means that ethical problems are not going to have a single correct answer, and that to get a well-rounded understanding of the ethical issues of any new technology we need people from lots of different standpoints to analyse it from their perspective. This is the best way we can understand the harms it could possibly cause. We also need to make sure that we are paying attention to how technology might be more likely to adversely affect people from minoritised backgrounds.

We developed our workshop format to help researchers to gather these different views.

In summary, the Data Hazards exist to prompt discussion, reflection and thought. They are not a checkbox exercise, and there is no requirement for a group to come to a consensus. In an individual context you will likely come to a conclusion, but someone else may have a different view. We hope that the Data Hazards discussion and reflective activities will help researchers be aware of a broader variety of potential ethical risks in tech projects, and that ethics is complex, situational and worth discussing.

Contact#

The Data Hazards Project was founded by Dr Natalie Zelenka and Dr Nina Di Cara, and is now co-led by Natalie, Nina and Dr Huw Day. Our details are provided below.
We’re always keen to hear from people interested in the project, or wanting to get involved. Feel free to send us both an email, or you can find us on Twitter!

Nina Di Cara

Natalie Zelenka

Huw Day

Senior Research Associate, School of Psychological Science, University of Bristol

Senior Research Fellow in Health Data Science, Institute of Health Informatics, University College London

Data Scientist, Jean Golding Institute, University of Bristol

GitHub: @ninadicara

GitHub: @NatalieZelenka

GitHub: @HuwWDay

Twitter/X: @ninadicara

Twitter/X: @NatZelenka

Twitter/X: @disco_huw

Email: nina.dicara@bristol.ac.uk

Email: n.zelenka@ucl.ac.uk

Email: huw.day@bristol.ac.uk