The wiki page for the data held by the project. Links to the Dataset database is below, as well as data documentation templates for new data acquired or created throughout the project and lists of potential datasets that might be useful in the future.
Datasets database
Datasets’ documentation
Datasets’ file naming conventions
Initial data set questionnaire (2021/22)
Data Storage Options
Datasets behind paywalls
Potential datasets for the future
- Two centuries of Indian print - British Library. Datasets on the Quarterly lists of books published in India (1713-1914) along with Transkribus training data for Bengali language OCR. Datasets downloadable through the BL Data Repository. https://www.bl.uk/projects/two-centuries-of-indian-print
- List of Manufacturers of Gas plants, appliances and Gas engineering services. Historic England.