Present: Tim (chair), Alex, Arran, Jane, Daniel W, Stefania, Anna-Maria (online), Daniel B (online), Felix, Sarah, Carol, Nina

Apologies: Helen, Kunika

Agenda:

  1. Data acquisition update - Anna-Maria
    1. Data acquisition meeting
    2. Overview of data acquisition so far
  2. Trade Directories - Alex, Daniel W
    1. Trade directories work in preparation for the occupations investigation & exhibit (Alex & Felix)
    2. ML approaches to making trade directories machine readable (Daniel)
  3. Folk songs update - Stefania and Daniel B

Notes:

  1. Data acquisition:

    1. report on data acquisition as of 28 April 2023

    CE_data_management28042023.pdf

    AMS talked through the report above. TB asked if there were different formats that the report could come in? Could we see what datasets are related to what investigation etc?

    JW asked how we can get movement on the dataset documentation. Clarification that everyone is responsible for recording the datasheets on their datasets. SZL confirmed that she has datasheets for a number of her datasets, but they are in progress. AMS recommended uploading version, so we have the most up-to-date datasheet at all times. AMS - it is important to do some reflection work on the reasons why we’re collecting data and what we intend to do with the datasets.

  2. Trade Directories: Alex and Felix - needed to do a bit of work related to the exhibit development. Trade Directory work at the Turing was doing particularly innovative work, but Felix and Alex needed to do some smaller (”lumpier”) work. Focused on Bradford and Newcastle due to the requirements of the exhibit. 2-3 weeks of data cleaning and modelling, using OpenRefine. Have mappable occupations data for about 30% of Newcastle, and think that can be scalable up to 70%. Difficulty of the data related to abbreviations etc. Interested in seeing how generalisable it can be. May experiment with segment everything - depending on what is decided with regards to DW’s investigation.

  1. Folk songs: Details of the investigation can be found here - Connecting workers’ songs with mining and textile collections: a cross-strand investigation. The investigation is coming back to the investigation meeting to share progress and establish the next steps.

    Reflections on the folk song investigation.pdf

    Working with MS Word as annotation tool with Jennifer. Focusing on annotating the lyrics to create an annotation schema. Focus on the human-in-the-loop aspects of working with Jenn’s expertise. There was an assumption that Word was widely accessible, but Jenn did have it and that is a good prompt for us to consider the openness of tools we use. Have done two 2-hour sessions annotating songs with Jennifer on Teams. Want to bring in reflections on the annotation process and reflections on working with a domain expert.

    Key action points from the discussion on next steps:

    AB asked AMS if she can suggests reading from OH projects that can help us not only to reflect on the content but also of the sonic affordances of the data. A training about modeling with Sarah is already under discussion.