# About ## How the project started The Data Hazards project started 2021. We (Natalie Zelenka and Nina Di Cara) wanted a way to communicate what might go wrong in data science projects, because we were frustrated by the repetitive themes we were seeing in harmful technologies that we talked about in [Data Ethics Club](https://dataethicsclub.com). We were also concerned that many projects that have significant societal impact do not have those impacts scrutinised by an Ethics Committee, because they do not technically have research participants. After that conversation we came up with the idea of Hazard labels for communicating these potential harms, and called them Data Hazards. We decided they should be visual, like COSHH chemical hazards are, and that they should be a way for people at all stages of data science technology development to communicate about the same potential outcomes (no matter how far away those outcomes might seem). You can see [the current Data Hazard labels here](data-hazards). You can [read our original proposal here](materials/misc/proposal). Once we had thought of the original list of Hazards we wanted a way for researchers to think about them in a format that encouraged them to reflect, invite different opinions and make them think more broadly about the potential ethical concerns from their project. This led to the development of our workshop format and [all the materials we have since made](materials) for self-reflection and teaching. All our resources are designed for re-use by others. ## Ethos The Data Hazards are built on the foundations of [standpoint theory](https://en.wikipedia.org/wiki/Standpoint_theory). This is an epistemological theory that knowledge (including in the sciences) is not objective, and that our perspectives are shaped by our lived socio-political experiences. This means that ethical problems are not going to have a single correct answer, and that to get a well-rounded understanding of the ethical issues of any new technology we need people from lots of different standpoints to analyse it from their perspective. This is the best way we can understand the harms it could possibly cause. We also need to make sure that we are paying attention to how technology might be more likely to adversely affect people from minoritised backgrounds. In summary, the Data Hazards exist to prompt discussion, reflection and thought. They are not a checkbox exercise, and there is no requirement for a group to come to a consensus. In an individual context you will likely come to a conclusion, but someone else may have a different view. We hope that the Data Hazards discussion and reflective activities will help researchers be aware of a broader variety of potential ethical risks in tech projects, and that ethics is complex, situational and worth discussing. ## Timeline (project-timeline)= ## Project timeline Here's a rough project timeline to let you know what we'll be up to: ````{panels} :container: timeline :column: col-6 p-0 :card: --- :column: +left --- :column: +entry right __March-April 2021__: Behind the scenes plans ^^^ {fa}`check,text-success mr-1` Thinking, reading and planning {fa}`check,text-success mr-1` Writing [proposal](materials/misc/proposal) {fa}`check,text-success mr-1` Getting feedback on initial ideas --- :column: +entry left __May-Aug 2021__: Prepare for first Data Hazards workshop ^^^ {fa}`check,text-success mr-1` Get website online {fa}`check,text-success mr-1` Submit ethics application {fa}`check,text-success mr-1` Get initial feedback on Data Hazards labels {fa}`check,text-success mr-1` Draft workshop materials {fa}`check,text-success mr-1` Get feedback on workshop materials {fa}`check,text-success mr-1` Begin advertising workshop {fa}`check,text-success mr-1` Set up [Open Science Framework project](https://osf.io/3fv7t/) and [preregister](https://osf.io/pcv7j) analysis --- :column: +right --- :column: +left --- :column: +entry right __Sept 2021__ Run first Data Hazards workshops (academic-focused) ^^^ {fa}`check,text-success mr-1` Run [first Data Hazards workshop](events/2021-09-21_workshop) on __21st Sept 2021__. --- :column: +entry left __Oct 2021__ Use workshop feedback to improve data hazards and present early results ^^^ {fa}`check,text-success mr-1` Present early results from workshop at [AI Ethics Best Practices and the Future of Innovation](https://www.eventbrite.co.uk/e/ai-ethics-best-practices-and-the-future-of-innovation-tickets-173883098027) as part of [Bristol Tech Festival](https://bristoltechfest.org/) on __13th Oct 2021__ ([slides](events/bristol-tech-fest)). {fa}`check,text-success mr-1` Look at workshop feedback to make improvements to: - data hazards labels - workshop exercises/materials --- :column: +right --- :column: +left --- :column: +entry right __Jan 2021__ Awarded £20,000 Enhancing Research Culture funding ^^^ {fa}`check,text-success mr-1` --- :column: +entry left __Feb-May 2022__ Developed new labels and facilitator training materials ^^^ {fa}`check,text-success mr-1` hired animator to create animated explainers for Data Hazards and new Hazard labels {fa}`check,text-success mr-1` development and release of run-your-own workshop materials {fa}`check,text-success mr-1` Data Hazards discussion session: Mozfest 2022 --- :column: +right --- :column: +left --- :column: +entry right __June 2022__ ^^^ {fa}`check,text-success mr-1` Ran Data Hazards workshop as part of the Jean Golding Institute Showcase - Run first Data Hazards facilitator workshop in-person as part of Bristol Data Week - Run second (online) Data Hazards facilitator workshop as part of Bristol Data Week ````