ORAL HISTORY INVESTIGATION SESSION IN LEICESTER ON 28 JUNE
We discussed what could be, for us, a ‘good enough’ proof of concept that can provide design specifications for the National Collection from the perspective of Oral History. We realized that, despite a preliminary proof of concept is already accomplished (but it would require fine tuning with a NLP specialist), there is still a question to be answered:
Within what’s been done, how do we start to provide the structured data that enable to connect OH recordings with other datasets?
We agreed that in order to complete the proof of concept, we don’t need further Oral History datasets, but we would need to work on two different levels:
We reflected on the fact that visualizations usually have two steps, one is exploratory and the second is presentational for the public realm. Until now, the tool we are working is for expertise/internal use only, and further work needs to be done to reach the second level. There might be scope to apply this pipeline to the textile OH dataset for the Bradford Exhibition and develop a presentational visualization.
We agreed on the following next steps:
We also talked about a criticality which was raised at the Museum Analytics Conference in May, on the risk to limit to textual and numerical data a sonic historical form such as Oral History. We discussed the opportunity to bring the audio dimension back into the exploratory tool, connecting the clusters to the correspondent audio extracts. Stef highlighted that this is not a problem to implement, but it takes time and it is something that can be potentially developed in a future stage. We agreed that, for the purpose of the Worskhop, we could manually mix different voices related to one of the topics identified, to provide a use case scenario to discuss.
In the last part of the day, Colin Hyde and Sally Horrocks joined us, and we discussed the role of archive owners and Oral Historians who know the data very well and that can have a fundamental role in making sense of the clusters (also considering that, with the dimensionality reduction, you lose a lot of information). For example, there are particular words in each mining communities that only oral historical knowledge would be able to understand (Sally provided the example of the word ‘snap’ to mean lunch in the East Midlands mining community). In the CE vision of collaboration, human is as important as the automated process, and we need to identify what kind of expertise and technical capabilities contribute to refine the topics. There is also important information at the paralinguistic level (for example the tone of voice would tell you if one sentence is a sarcastic comment or not). The role of pauses and silences are also important, very often they are the most interesting part of the interviews, as well as the pace of different sections.
Sally and Colin also suggested further potential areas where the exploratory tool could help: