Upcoming talks and travel

Trinity lecture poster
Trinity lecture poster

My edited volume on ‘Crowdsourcing our Cultural Heritage’ for Ashgate, featuring chapters from some of the most amazing people working in the field was published in October 2014. (I know I’m biased, but seriously.) You can read my introduction on the OU repository: Crowdsourcing Our Cultural Heritage: Introduction.

By day, I’m usually at work at the British Library, so drop me a line if you’d like to meet for coffee and a chat. This means my availability for events is limited, but you can drop me a line if you’d like to book me for an event.

Some upcoming trips: in February I’m doing a workshop in Edinburgh for Dr Anouk Lang’s Beyond the Black Box: Building Algorithmic and Statistical Literacy through Digital Humanities Tools and Resources and in Santa Barbara for Always Already Computational: Library Collections as Data. I’m keynoting at DIGIKULT 2017 in Sweden in March, and in June I’m in Sydney for the Future Library Congress at EduTECH. I will be popping down to Melbourne so let me know if you’re up for a coffee. I’ll also be giving a keynote in France in October.

Some recent papers

This is rarely up-to-date or complete, but…

In November 2016 I was in Riga, Latvia to give the closing keynote at the Europeana Network Association AGM 2016. In October I spoke at ‘What should be in your digital toolbox‘, gave a keynote, ‘Digital history: evolution or transformation?’ at The Science of Evolution and the Evolution of the Sciences conference in Leuven, Belgium around October 12th and 13th, 2016 and at Internet Librarian International then chaired the Museums Computer Group’s Museums+Tech conference. In August I was in York for ‘Negotiating Expertise’ and in Helsinki for Museum Theme Days 2016 in September.

In June 2016 I was in Luxembourg for a workshop on Network Visualisation in the Cultural Heritage Sector. My talk notes for Network visualisations and the ‘so what?’ problem are online. I also keynoted at LIBER (Ligue des Bibliothèques Européennes de Recherche – Association of European Research Libraries) in Helsinki. My slides are online but may not make much sense without notes.

In March 2016 I was at Rice University in Houston then Austin (at the iSchool in UT Austin then St Edwards), then I was on a panel on ‘Build the Crowdsourcing Community of Your Dreams’ at SXSWi 2016 with Ben Brumfield, Meghan Ferriter and Siobhan Leachman.

In January 2016 I was back in Oxford for a workshop on ‘DIY Digitisation’ at the Bodleian Libraries.

Here’s a summary of talks, fellowships, writing, etc in 2015, 2014, 2013, 2012 and 2011. You can also follow me on twitter (@mia_out) for updates.

Previous papers are generally listed at miaridge.com or on my blog, Open Objects.

Position paper: From libraries as patchwork to datasets as assemblages?

Photo of beach view

My position paper for Always Already Computational: Collections as Data. Every attendee wrote one – read the others at Collections as Data – National Forum Position Statements.

From libraries as patchwork to datasets as assemblages?

Dr Mia Ridge, Digital Curator, British Library

The British Library’s collections are vast, and vastly varied, with 180-200 million items in most known languages. Within that, there are important, growing collections of manuscript and sound archives, printed materials and websites, each with its own collecting history and cataloguing practices. Perhaps 1-2% of these collections have been digitised, a process spanning many years and many distinct digitisation projects, and an ensuing patchwork of imaging and cataloguing standards and licences. This paper represents my own perspective on the challenges of providing access to these collections and others I’ve worked with over the years.

Many of the challenges relate to the volume and variety of the collections. The BL is working to rationalise the patchwork of legacy metadata systems into a smaller number of strategic systems.[1] Other projects are ingesting masses of previously digitised items into a central system, from which they can be displayed in IIIF-compatible players.[2]

The BL has had an ‘open metadata’ strategy since 2010, and published a significant collection of metadata, the British National Bibliography, as linked open data in 2011.[3] Some digitised items have been posted to Wikimedia Commons,[4] and individual items can be downloaded from the new IIIF player (where rights statements allow). The BL launched a data portal, https://data.bl.uk/, in 2016. It’s work-in-progress – many more collections are still to be loaded, the descriptions and site navigation could be improved – but it represents a significant milestone many years in the making. The BL has particularly benefitted from the work of the BL Labs team in finding digitised collections and undertaking the paperwork required to make the freely available. The BL Labs Awards have helped gather examples for creative, scholarly and entrepreneurial uses of digitised collections collection re-use, and BL Labs Competitions have led to individual case studies in digital scholarship while helping the BL understand the needs of potential users.[5] Most recently, the BL has been working with the BBC’s Research and Education Space project,[6] adding linked open data descriptions about articles to its website so they can be indexed and shared by the RES project.

In various guises, the BL has spent centuries optimising the process of delivering collection items on request to the reading room. Digitisation projects are challenging for systems designed around the ‘deliverable item’, but the digital user may wish to access or annotate a specific region of a page of a particular item, but the manuscript itself may be catalogued (and therefore addressable) only at the archive box or bound volume level. The visibility of research activities with items in the reading rooms is not easily achieved for offsite research with digitised collections. Staff often respond better to discussions of the transformational effect of digital scholarship in terms of scale (e.g. it’s faster and easier to access resources) than to discussions of newer methods like distant reading and data science.

The challenges the BL faces are not unique. The cultural heritage technology community has been discussing the issues around publishing open cultural data for years,[7]in part because making collections usable as ‘data’ requires cooperation, resources and knowledge from many departments within an institution. Some tensions are unavoidable in enhancing records for use externally – for example curators may be reluctant or short of the time required to pin down their ‘probable’ provenance or date range, let alone guess at the intentions of an earlier cataloguer or learn how to apply modern ontologies in order to assign an external identifier to a person or date field.

While publishing data ‘as is’ in CSV files exported from a collections management system might have very little overhead, the results may not be easily comprehensible, or may require so much cleaning to remove missing, undocumented or fuzzy values that the resulting dataset barely resembles the original. Publishing data benefits from workflows that allow suitably cleaned or enhanced records to be re-ingested, and export processes that can regularly update published datasets (allowing errors to be corrected and enhancements shared), but these are all too rare. Dataset documentation may mention the technical protocols required but fail to describe how the collection came to be formed, what was excluded from digitisation or from the publishing process, let alone mention the backlog of items without digital catalogue records, let alone digitised images. Finally, users who expect beautifully described datasets with high quality images may be disappointed when their download contains digitised microfiche images and sparse metadata.

Rendering collections as datasets benefits from an understanding of the intangible and uncertain benefits of releasing collections as data and of the barriers to uptake, ideally grounded in conversations with or prototypes for potential users. Libraries not used to thinking of developers as ‘users’ or lacking the technical understanding to translate their work into benefits for more traditional audiences may find this challenging. My hope is that events like this will help us deal with these shared challenges.

[1] The British Library, ‘Unlocking The Value: The British Library’s Collection Metadata Strategy 2015 – 2018’.

[2] The International Image Interoperability Framework (IIIF) standard supports interoperability between image repositories. Ridge, ‘There’s a New Viewer for Digitised Items in the British Library’s Collections’.

[3] Deloit et al., ‘The British National Bibliography: Who Uses Our Linked Data?’

[4] https://commons.wikimedia.org/wiki/Commons:British_Library

[5] http://www.bl.uk/projects/british-library-labs, http://labs.bl.uk/Ideas+for+Labs

[6] https://bbcarchdev.github.io/res/

[7] For example, the ‘Museum API’ wiki page listing machine-readable sources of open cultural data was begun in 2009 http://museum-api.pbworks.com/w/page/21933420/Museum%C2%A0APIs following discussion at museum technology events and on mailing lists.

Photo of beach view
The view from UC Santa Barbara is alright, I suppose

Workshop: Data visualisation for ‘Beyond the Black Box’

Beyond the Black Box is a programme of advanced digital humanities workshops at the University of Edinburgh, designed to foster statistical, algorithmic and quantitative literacy. It is directed by Anouk Lang, administered by Robyn Pritzker and funded by a grant from the British Academy.

I was invited to give a workshop on Data Visualisation. My slides are below, and my exercises are collected in a Google Doc for easier access to links.

I developed a new exercise for this and the CHASE workshop, and have blogged about it at Trying computational data generation and entity extraction.

Discussing positive and negative traits of interactive scholarly visualisations.

Workshop: Information Visualisation, CHASE Arts and Humanities in the Digital Age 2017

I ran a full-day workshop on Information Visualisation for the CHASE Arts and Humanities in the Digital Age training programme at Birkbeck, London, in February 2017. The abstract:

Visualising data to understand it or convince others of an argument contained within it has a long history. Advances in computer technology have revolutionised the process of data visualization, enabling scholars to ask increasingly complex research questions by analysing large scale datasets with freely available tools.

This workshop will give you an overview of a variety of techniques and tools available for data visualisation and analysis in the arts and humanities. The workshop is designed to help participants plan visualisations by discussing data formats used for the building blocks of visualisation, such as charts, maps, and timelines. It includes discussion of best practice in visual design for data visualisations and practical, hands-on activities in which attendees learn how to use online tools such as Viewshare to create visualisations.

At the end of this course, attendees will be able to:

  • Create a simple data visualisation
  • Critique visualisations in terms of choice of visualisation type and tool, suitability for their audience and goals, and other aspects of design
  • Recognise and discuss how data sets and visualisation techniques can aid researchers

Please remember to bring your laptop.

Slides

Exercises for CHASE’s ADHA 2017 Introduction to Information Visualisation

  • Exercise 1: comparing n-gram tools
  • Exercise 2: Try entity extraction
  • Exercise 3: exploring scholarly data visualisations
  • Viewshare Exercise 1: Ten minute tutorial – getting started
  • Viewshare Exercise 2: Create new views and widgets

Chapter: ‘The contributions of family and local historians to British history online’

Participatory Heritage, edited by Henriette Roued-Cunliffe and Andrea Copeland, has just been published by Facet.

My chapter is ‘The contributions of family and local historians to British history online‘. My abstract:

Community history projects across Britain have collected and created images, indexes and transcriptions of historical documents ranging from newspaper articles and photographs, to wills and biographical records. Based on analysis of community- and institutionally-led participatory history sites, and interviews with family and local historians, this chapter discusses common models for projects in which community historians cooperated to create digital resources. For decades, family and local historians have organised or contributed to projects to collect, digitise and publish historical sources about British history. What drives amateur historians to voluntarily spend their time digitising cultural heritage? How do they cooperatively or collaboratively create resources? And what challenges do they face?

Mia Ridge is a Digital Curator in the British Library’s Digital Scholarship team. She has a PhD in digital humanities (2015, Department of History, Open University) entitled Making Digital History: the impact of digitality on public participation and scholarly practices in historical research. Previously, she conducted human-computer interaction-based research on crowdsourcing in cultural heritage.

9781783301232

Talk: Planning for big data (lessons from cultural heritage)

I was invited to give an hour-long talk for the Association for Project Management’s Knowledge Management SIG event on ‘What does big data mean for project and knowledge managers?’. I shared lessons from work in cultural heritage, including the British Library and Cooper Hewitt Design Museum, on ‘Planning for Big Data’.

Panel: Build the Crowdsourcing Community of Your Dreams, SXSW

Panel photo

Having successfully passed the SXSW ‘panel picker’ process, I went to SXSW Interactive 2016 to discuss ‘building the crowdsourcing community of your dreams’ with Ben Brumfield, Meghan Ferriter and Siobhan Leachman (aka @benwbrum, @meghaninmotion and @SiobhanLeachman). We were in the ‘Art, Science, & Inspiration’ track, and while it may have been luck with timing or our title, the venue was standing room only for a while.

Our slides are online, and we put together a list of further resources to tweet during the panel at http://bit.ly/GLAMcrowd.

Siobhan storified our session and also posted her talk notes. She’s such a passionate volunteer, and you couldn’t get a better account of ‘How cultural institutions encouraged me to participate in crowdsourcing & the factors I consider before donating my time‘.

Panel photo
SXSW crowdsourcing panel photo by Effie Kapsalis @digitaleffie

 

If you’re interested in our panel, you might also be interested in the later ‘SXSW 2016 – Give It Away to Get Rich: Open Cultural Heritage‘.

Everything SXSW - lamp posts protected from extreme flyering, pedicabs, sunshine and a lounge
Everything SXSW – lamp posts protected from extreme flyering, pedicabs, sunshine and a lounge

Talk: St. Edwards University, Austin

View of downtown Austin
View of downtown Austin
The view of downtown Austin from St Edwards

As part of my trip to Texas for SXSW, I was invited to present on ‘Crowdsourcing, learning and citizen scholarship’ at St Edwards University on March 10, 2016.

Having given an online seminar for Rebecca Frost Davis in a previous role, it was a pleasure to meet her at last, and hear about her work as Director of Instructional and Emerging Technology.

My talk discussed how crowdsourcing projects might offer an opportunity for students to contribute to both cultural heritage and citizen science projects.

Talk: Crowdsourcing in Cultural Heritage, iSchool, UT Austin

As part of my trip to Texas for SXSW, I was invited to present on ‘Crowdsourcing in Cultural Heritage’ at a colloquium at a School of Information Research Event at UT Austin on March 8, 2016.

My thanks to the organisers for their excellent hospitality, and to the attendees for their thoughtful and probing questions!

My abstract: Why and how are museums, libraries, archives and academic projects creating crowdsourcing projects to help digitize collections or enhance their knowledge about them? Based on a review of hundreds of heritage crowdsourcing projects, this talk will highlight examples of successful projects, discuss why members of the public volunteer their time, and consider the different outcomes possible.

Austin's Capitol building
Austin’s Capitol building

Workshop: Crowdsourcing and Cultural Heritage, Rice University

Photo of campus gate

As part of my trip to Texas for SXSW, I was invited to give a workshop on ‘Crowdsourcing and Cultural Heritage’ in the Fondren Library at Rice University’s Humanities Research Center Sawyer Seminar series on March 7, 2016. My slides are below. My visit was a great chance to find out more about the teaching and projects at the Research Center, and my thanks go to the organisers for their excellent hospitality.

Abstract: This workshop will provide an overview of crowdsourcing in cultural heritage and consider the ethics and motivations for participation. International case studies will be discussed to provide real life illustrations of design tips and to inspire creative thinking.

Photo of campus gate
Rice University

2015: an overview

An incomplete list of publications, papers, etc. from 2015.

In December 2015 I was in Glasgow and Berlin to talk about crowdsourcing in history and cultural heritage. I was also invited to give a lecture on ‘Digital History’ for Digital Humanities @ Universität Bern and gave an Introduction to Information Visualisation for the CHASE doctoral training programme.

On October 26 I was at the British Museum for the Museums Computer Group’s annual conference and gave a talk on ‘Crowdsourcing, scholarship and the academy’ for the School of Advanced Studies in London, and another on Choosy crowds and the machine age: challenges for the future of humanities crowdsourcing, KCL. I also started working as a Digital Curator with the British Library.

In early September I was in Estonia for the ‘Community Involvement in Theme Museums‘ conference (2nd – 3rd) and then at Kings College London on ‘Choosy crowds and the machine age: challenges for the future of humanities crowdsourcing‘ for Citizen Humanities Comes of Age: Crowdsourcing for the Humanities in the 21st Century (9th – 10th).

Over the summer I worked on the Hidden Museum Project with the Oxford University Museums, testing QR codes, beacons and other methods for delivering different kinds of content on mobile devices in the Museum of the History of Science, the Museum of Natural History and the Ashmolean. Ben Brumfield and I consulted and wrote for the Wellcome Library on the Wellcome Library Transcribing Recipes crowdsourcing project.

In July I spoke on ‘Open Data: Trends and Practice within Cultural Heritage. AKA, the good, the bad, and the unstructured…’ at Pelagios: Linked Pasts and on ‘Let Your Projects Shine: Lightweight Usability Testing for Digital Humanities Projects’ at Oxford’s Digital Humanities Summer School.

In the last week of July I taught ‘Crowdsourcing Cultural Heritage’ with Ben Brumfield at the HILT Summer School (Humanities Intensive Learning + Teaching) at Indiana University-Purdue University Indianapolis (IUPUI) Indianapolis, Indiana.

In late June/early July, I was in Sydney for Digital Humanities 2015, gave a half-day workshop on Linking humanities data geospatially with Pelagios and Recogito with Leif Isaksen, and presented a paper (‘Small ontologies, loosely joined’: linked open data for the First World War) in a panel on Linked Open Data and the First World War at Digital Humanities 2015 (based on my experiences as a Fellow at Trinity College Dublin working on histories of World War One with the CENDARI project).

In June 2015 I submitted my thesis (!), presented at Connected Life in Oxford and taught a workshop on Information Visualisation for CHASE Arts and Humanities in the Digital Age.

In May 2015 I gave a keynote on Crowdsourcing our cultural heritage at Nordiske Arkivdage 2015 in Copenhagen and taught a workshop on scholarly data visualisation at the University of St Andrews.