Project Details
Additional Project Partner of the Coordinated Funding Initiative for the Further Development of Optical Character Recognition Processes (OCR-D)
Applicant
Dr. Rainer Stotzka
Term
from 2017 to 2019
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 390834936
The Coordinated Funding Initiative for the Further Development of Optical Character Recognition Processes (OCR-D) has been approved by the DFG at 6 May 2015. We propose to extend the consortium of OCR-D by an additional member.In the last years the orientation of OCR-D has refined. Therefore the reviewer panel has recommended to move up in time the integration of the module results according to the DFG call Skalierbare Verfahren der Text- und Strukturerkennung für die Volltextdigitalisierung historischer Drucke. The present proposal extends the competences of the OCR-D partners by software technologies and development, workflow interoperability, and interfaces. The main objective is to assure the technical feasibility of a complex OCR workflow consisting of the module results. The proposed technical OCR-D framework consists of an OCD research data repository to preserve the resulting and intermediate data of the modules, a workflow engine to compose the OCR workflow and to chronicle the workflow provenance, and the definition of software interfaces to guarantee the interoperability of the modules. The resulting OCR results will be stored and prepared for publication and archival.
DFG Programme
Research data and software (Scientific Library Services and Information Systems)
Cooperation Partners
Professor Dr. Martin Grötschel; Barbara Schneider-Kempf; Professor Dr. Thomas Stäcker