publication Archives

October 8, 2025October 8, 2025

Forthcoming article in the Journal of Victorian Culture

I'm very excited to share the abstract for a forthcoming article in the Journal of Victorian Culture!

Seeing Library Collections Through New Lenses: The Potential of Large-Scale Digital Collections

Digital collections have fundamentally transformed historical research, enabling search with extraordinary precision across vast archives. While challenges such as including limited digitization, transcription errors, and infrastructure vulnerabilities remain, significant progress has been achieved through national collections, open platforms and innovative research projects. For example, The Alan Turing Institute and British Library's Living with Machines project (2018-23) demonstrated how interdisciplinary collaboration between historians, data scientists, and library professionals can develop powerful tools for analysing huge collections at scale.

Emerging technologies, including Machine Learning and Large Language Models, are making sophisticated computational methods accessible to historians. However, research still requires careful attention to the documentation of digitization processes, selection biases, and technical constraints. The characteristics of historical records – ambiguous, incomplete, subjective and inconsistent – represent both challenges and opportunities for computational methods. Approaches for presenting historical records must resist oversimplification and retain their contextual richness.

The future of digital history depends on historians actively engaging with and shaping these technologies. Through continued collaboration between cultural heritage institutions and researchers, the field can ensure that large-scale digital collections serve historical scholarship's values while enabling innovative research methodologies.

May 3, 2024May 6, 2024

New data paper and datasets from crowdsourcing on Living with Machines

After lots of hard work by me, Nilo Pedrazzini, Miguel V., Arianna Ciula and Barbara McGillivray, we have a data paper in the Journal of Open Humanities Data: Language of Mechanisation Crowdsourcing Datasets from the Living with Machines Project.

And huge thanks to the thousands of Zooniverse volunteers who annotated 19th century newspaper articles to create the datasets we've published alongside the data paper!

Abstract: We present the ‘Language of Mechanisation’ datasets with examples of re-use in visualisations and analysis. These reusable CSV files, published on the British Library’s Research Repository, contain automatically-transcribed text from 19th century British newspaper articles. Volunteers on the Zooniverse crowdsourcing platform took part in tasks that asked ‘How did the word x change over time and place?’ They annotated articles with pre-selected meanings (senses) for the words coach, car, trolley and bike.

The datasets can support scholarship on a range of historical and linguistic research areas, including research on crowdsourcing and online volunteering behaviours, data processing and data visualisations methodologies.

The two datasets described are at:

Language of Mechanisation: annotated historical newspaper articles https://doi.org/10.23636/5t9m-0g59
OCR and crowdsourced annotations, Language of Mechanisation, JSON files https://doi.org/10.23636/z634-km37

December 31, 2019December 13, 2024

2019: an overview(ish)

A very incomplete page…

Projects: Living with Machines

Continued recruiting the project team
Set up the project website (graphic identity and WordPress template by an agency, working with the project team)
Helped devise the Communications strategy

Publications

Ridge, M. (forthcoming). Crowdsourcing in cultural heritage: A practical guide to designing and running successful projects. In K. Schuster & S. Dunn (Eds.), Routledge Handbook of Research Methods in Digital Humanities. Routledge.

Talks and teaching

June: I was at Indiana University-Purdue University Indianapolis to teach Collections as Data with Thomas Padilla for the HILT digital humanities summer school.

An invited talk on 'Voyages of discovery with digital collections' for the Eskenazi Museum of Art, Indiana University, Bloomington, June 2019

Blog posts

Other

Peer reviewer, Digital Humanities 2019

December 31, 2018December 13, 2024

2018: an overview

2017-18 was a bit of an odd year and I've subsequently reduced the number of invitations I accept each year.

Projects

2018 finished with a bang, as the press release for the British Library/Alan Turing Institute's Living with Machines project went live. I'd been working on the proposal since early 2017. In this project, we're experimenting with 'radical collaborations' around applying data science methods to historical newspaper collections to advance the potential of digital history.

Talks and teaching

January: a lecture on 'Scholarly crowdsourcing: from public engagement to creating knowledge socially' for the Introduction to Digital Humanities Masters course at King's College London, and an 'Overview of Information Visualisation' for the CHASE Winter School: Introduction to Digital Humanities.

February: a full-day workshop on Information Visualisation for PhD students in the Digital Humanities for CHASE.

March: a talk on 'Crowdsourcing: the British Library experience' for CILIP's Multimedia Information & Technology (MmIT) Group's event on 'The wisdom of the crowd? Crowdsourcing for information professionals'.

April: a talk on 'Challenges and opportunities in digital scholarship' for a British Library Research Collaboration Open House, and took part in a panel for the Association of Art Historians (AAH) conference on ‘Sharing knowledge through online engagement’ around Art UK's Art Detective project at the Courtauld Institute of Art.

May: I was in Rotterdam for a EuropeanaTech panel on User Generated & Institutional Data Transcription projects and gave a talk on 'Open cultural data in the GLAM sector' for a CPD25 workshop on The GLAM sector: what can we learn from Galleries, Libraries, Archives and Museums

June: with Thomas Padilla I co-taught 'Collections as data' for the HILT Digital Humanities Summer School, June 4–8, 2018, University of Pennsylvania. I then went onto Oberlin College to give a keynote on 'Digital collections as departure points' at the Academic Art Museums and Libraries Summit.

September: a talk on 'Crowdsourcing at the British Library: lessons learnt and future directions' at the Digital Humanities Congress | University of Sheffield, 6th – 8th September 2018. And a 'provocation' for the Building Library Labs event, 'A modest proposal: crowdsourcing is good for all of us'.

November: I travelled to Bonn to do a keynote on 'Libraries and their Communities: Participation from Town Halls to Mobile Phones' for the 2018 SWIB (Semantic Web in Libraries) conference, and gave a preview talk on Living with Machines for the British Library Labs 2018 Symposium.

Publications

An article on Breathing life into digital collections at the British Library for ACCESS / Journal of the Australian School Library Association, 2018.

A chapter for a Routledge publication on research methods in the Digital Humanities, called 'Crowdsourcing in cultural heritage: a practical guide to designing and running successful projects' (in process).

Other

I was a peer reviewer for conference proposals and articles for museum studies and digital humanities events and journals.

I also gave internal talks on IIIF and the Universal Viewer and taught Data Visualisation and Crowdsourcing workshops on the British Library's Digital Scholarship Training Programme.

I wrote a number of blog posts, newsletters and press releases for work. I've collected some of those blog posts and newsletter updates for the British Library at Updates from Digital Scholarship at the British Library.

Blog post 'Notes from ‘AI, Society & the Media: How can we Flourish in the Age of AI’' and 'Cross-post: Seeking researchers to work on an ambitious data science and digital humanities project'

January 19, 2017December 26, 2023

Chapter: 'The contributions of family and local historians to British history online'

Participatory Heritage, edited by Henriette Roued-Cunliffe and Andrea Copeland, has just been published by Facet.

A pre-print is online at https://hcommons.org/deposits/item/hc:38017

My chapter is 'The contributions of family and local historians to British history online'. My abstract:

Community history projects across Britain have collected and created images, indexes and transcriptions of historical documents ranging from newspaper articles and photographs, to wills and biographical records. Based on analysis of community- and institutionally-led participatory history sites, and interviews with family and local historians, this chapter discusses common models for projects in which community historians cooperated to create digital resources. For decades, family and local historians have organised or contributed to projects to collect, digitise and publish historical sources about British history. What drives amateur historians to voluntarily spend their time digitising cultural heritage? How do they cooperatively or collaboratively create resources? And what challenges do they face?

My opening page:

IN 1987, THE Family History Department of the Church of the Latter Day Saints began a project with the British Genealogical Records Users Committee to transcribe and index the 1881 British census. Some community history societies were already creating indexes for the 1851 census, so they were well placed to take on another census project. Several tons of photocopies were distributed to almost 100 family history societies for double transcription and checking; later, a multi-million-dollar mainframe computer created indexes from the results (Young, 1996, 1998a; Tice, 1990). This ‘co-operative indexing’ took eight years – the process of assigning parts for transcription alone occupied 43 months – and while the project was very well received, in 1998 it was concluded that ‘a national project of this scope has proved too labour intensive, time consuming and expensive’ to be repeated (Young, 1998b). However, many years later, the US 1940 census was indexed in just four months by over 160,000 volunteers (1940 US Census Community Project, 2012), and co-operative historical projects flourish.

This example illustrates the long history of co-operative transcription and indexing projects, the significant contribution they made to the work of other historians and the vital role of community history organizations and volunteers in participatory heritage projects. The difference between the reach and efficiency of projects initiated in the 1980s and the 2010s also highlights the role of networked technologies in enabling wider participation in cooperative digitization projects. This chapter examines the important contributions of community historians to participatory heritage, discussing how family and local historians have voluntarily organized or contributed to projects to collect, digitize and publish historical sources about British history. This insight into grassroots projects may be useful for staff in cultural heritage institutions who encounter or seek to work with community historians.

The questions addressed in this chapter are drawn from research which sought to understand the impact of participatory digital history projects on users. This research involved reviewing a corpus of over 400 digital history projects, analysing those that aimed to collect, create or enhance records about historical materials. The corpus included both community- and institutionally led participatory history sites. Points of analysis included ‘microcopy’ (small pieces of text such as slogans, instructions and navigation) and the visible affordances, or website interface features, that encourage, allow or disable various participatory functions.

Bio

Mia Ridge is a Digital Curator in the British Library’s Digital Scholarship team. She has a PhD in digital humanities (2015, Department of History, Open University) entitled Making Digital History: the impact of digitality on public participation and scholarly practices in historical research. Previously, she conducted human-computer interaction-based research on crowdsourcing in cultural heritage.