Project Details
" Three Centuries of a Damstadt Newspaper" - Digitisation of the Darmstädter Tagblatt (1740-1986). Phase II (1949–1986)
Subject Area
Applied Linguistics, Computational Linguistics
Modern and Contemporary History
Communication Sciences
Modern and Contemporary History
Communication Sciences
Term
since 2019
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 422840794
This project is the second phase of the digitization of the newspaper "Darmstädter Tagblatt" which was published for more than two centuries between 1740-1986. Originally published weekly as Darmstädtisches Frag- und Anzeigen-Blättgen, a general advertiser, it developed into a daily newspaper within the latter half of the 19th century. In the first phase, the issues that appeared between 1740 and 1941 have been digitised including OCR (optical character recognition) generated text versions encoded in XML. A subset of the digitised texts was enriched with linguistic annotations (parts of speech, named entities). The project also started to reach out to corporate copyright holders (agencies, publishing houses) of the issues published after 1941 and managed to negotiate agreements with most of them. To ensure that all articles and images can be legally published the project developed the concept for a workflow in two parts: The first part utilizes corpus linguistic methods for detecting potential copyright holders in the text version, while the second part uses the results of the first part to search for the addresses or contact information of the copyright holders. Drawing on the work of the first phase, this second phase is dedicated not only to digitise and annotate all issues published between 1941 and 1986 but to further develop and implement the workflow for (semi-)automatic detection of copyright holders. Potential right holders which have been detected in the text version will be searched and contacted. The images and text versions will be made available for download and online reading via the online publishing infrastructure of the ULB Darmstadt (DWork, eXist, DFG-Viewer).
DFG Programme
Cataloguing and Digitisation (Scientific Library Services and Information Systems)