TY - GEN
T1 - An experimental workflow development platform for historical document digitisation and analysis
AU - Neudecker, Clemens
AU - Schlarb, Sven
AU - Dogan, Zeki Mustafa
AU - Missier, Paolo
AU - Sufi, Shoaib
AU - Williams, Alan
AU - Wolstencroft, Katy
PY - 2011
Y1 - 2011
N2 - The paper presents a novel web-based platform for experimental workflow development in historical document digitisation and analysis. The platform has been developed as part of the IMPACT project, providing a range of tools and services for transforming physical documents into digital resources. It explains the main drivers in developing the technical framework and its architecture, how and by whom it can be used and presents some initial results. The main idea lies in setting up an interoperable and distributed infrastructure based on loose coupling of tools via web services that are wrapped in modular workflow templates which can be executed, combined and evaluated in many different ways. As the workflows are registered through a Web 2.0 environment, which is integrated with a workflow management system, users can easily discover, share, rate and tag workflows and thereby support the building of capacity across the whole community. Where ground truth is available, the workflow templates can also be used to compare and evaluate new methods in a transparent and flexible way.
AB - The paper presents a novel web-based platform for experimental workflow development in historical document digitisation and analysis. The platform has been developed as part of the IMPACT project, providing a range of tools and services for transforming physical documents into digital resources. It explains the main drivers in developing the technical framework and its architecture, how and by whom it can be used and presents some initial results. The main idea lies in setting up an interoperable and distributed infrastructure based on loose coupling of tools via web services that are wrapped in modular workflow templates which can be executed, combined and evaluated in many different ways. As the workflows are registered through a Web 2.0 environment, which is integrated with a workflow management system, users can easily discover, share, rate and tag workflows and thereby support the building of capacity across the whole community. Where ground truth is available, the workflow templates can also be used to compare and evaluate new methods in a transparent and flexible way.
KW - digitisation
KW - evaluation
KW - historical documents
KW - optical character recognition
KW - scientific workflow
KW - web service
UR - http://www.scopus.com/inward/record.url?scp=80054790697&partnerID=8YFLogxK
U2 - 10.1145/2037342.2037370
DO - 10.1145/2037342.2037370
M3 - Conference contribution
AN - SCOPUS:80054790697
SN - 9781450309165
T3 - ACM International Conference Proceeding Series
SP - 161
EP - 168
BT - HIP'11 - Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
T2 - 1st International Workshop on Historical Document Imaging and Processing, HIP'11, Held in Conjunction with ICDAR 2011
Y2 - 16 September 2011 through 17 September 2011
ER -