Kunika Kono, Felix Needham-Simpson, Stefania

We discussed the use of Prodigy (and other annotation tools) in relation to the folk songs investigation, and specific questions: what is involved in and required for selecting and setting up an annotation tool for data labeling? This in view of finalising the involvement of Jennifer Reid - the folk songs expert - who will be helping the investigation in developing the annotation schema over the next couple of months.

Before deciding on an annotation tool there are several things to consider/define, such as:

We agreed that there is a lot to think about, investigate and survey, and given that the scope of the folk songs investigation's annotation work is still in the process of formulation, we would explore Microsoft Word as a preliminary annotation tool for exploratory text labeling and annotation. In the next drop-in session, we will look at proof of concept workflow, from annotating in Word Document to using the annotated text in an NLP annotation tool and/or Machine Learning.

We discussed the opportunity to develop a ‘reflective diary’ of the annotation work with Jennifer, to observe how the technical and the human interact in the ‘human in the loop’ approach. We also relfected on the scope of the technical knowledge in the investigation and agreed that it would be helpful, for Stefania and Daniel, to understand how XML works and why it is useful, without necessarily learn how to use the language (this would require a dedicated training and an amount of time that it is outside the scope of the investigation itself). Instead, it would be most important for the co-investigators to start familiarizing with XML files, being able to identify the annotated labels from the Word document (and other, more developed annotation tools) and read about other examples/case studies where XML was used in digital humanities/digital history.

<aside> 💭 Read Stefania’s reflective notes on this session

</aside>

Action points