How to do bad data science well - multiple methods and complex data sources

Registrations are closed

We are no longer taking bookings for this seminar. Please visit the RTC website (research-training-centre.sps.ed.ac.uk) to find out more about future seminars and events.

How to do bad data science well - multiple methods and complex data sources

In this seminar, Dr Ben Collier will discuss the experimental research he conducted at the Cambridge Cybercrime Centre.

By Research Training Centre

Date and time

Mon, 13 Mar 2023 06:00 - 07:00 PDT

Location

Online

About this event

This is a live-stream of an on-campus event. If you are a University of Edinburgh student or staff member and would prefer to attend in person, please register for the on-campus event using the following link: how-to-do-bad-data-science-well.eventbrite.co.uk

This seminar forms part of the Research Training Centre's Talking Methods seminar series delivered in association with the Scottish Graduate School of Social Science.

Abstract

Data science was set to revolutionise statistics and social research, providing us with all kinds of new data sources and methods to open new windows into the social world. But, as many of us who’ve spent time working on data science projects will know, often the data is rubbish, the models don’t converge or converge suspiciously well, and the outputs make sense to us but are impossible to turn into a policy brief or short article. I’ll talk about research we did at the Cambridge Cybercrime Centre, which took half a dozen quite shaky data sources, a number of suspicious models, and some experimental practices that we chose to call ‘immersive data ethnography’ and turned them into a project with real findings and impact. In this research, we combined large-scale web scraping of cybercrime forums and chat channels with interviews and ethnographic study on online deviant communities, working back and forth between different kinds of data and analysis in an interdisciplinary team. By combining our strengths - in psychology, sociology, data science, ethnography, programming, and visualisation - we managed to make these extremely complex and confusing sources (including forums with tens of millions of posts) into a fascinating new resource for social scientists, and built our own AI and ML tools for others to explore them. I’ll also show how we discovered that far from magic hackers and techno-wizards, the reality of cybercrime is far more boring than we ever thought possible.

Biography

Ben Collier is a Lecturer in Digital Methods at the University of Edinburgh. Formerly employed as a statistician at Scottish Government in Transport and Health, Ben’s research interests are interdisciplinary, blending qualitative, quantitative, and data science approaches. Current projects include research on digital infrastructure, including a forthcoming history of the Tor anonymity network; mixed-methods research on cybercrime ecosystems; and a large-scale study of the use of data-driven digital strategic communications in the UK public sector.

University Profile: www.sps.ed.ac.uk/staff/ben-collier

Organised by

Sales Ended