• shareshare
  • link
  • cite
  • add
auto_awesome_motion View all 3 versions
Publication . Article . Preprint . 2021 . Embargo end date: 01 Jan 2021

FAST CAT: Collaborative Data Entry and Curation for Semantic Interoperability in Digital Humanities

Collaborative Data Entry and Curation for Semantic Interoperability in Digital Humanities
Pavlos Fafalios; Kostas Petrakis; Georgios Samaritakis; Korina Doerr; Athina Kritsotaki; Yannis Tzitzikas; Martin Doerr;
Open Access
Published: 31 Dec 2021
Publisher: arXiv
Descriptive and empirical sciences, such as History, are the sciences that collect, observe and describe phenomena in order to explain them and draw interpretative conclusions about influences, driving forces and impacts under given circumstances. Spreadsheet software and relational database management systems are still the dominant tools for quantitative analysis and overall data management in these these sciences, allowing researchers to directly analyse the gathered data and perform scholarly interpretation. However, this current practice has a set of limitations, including the high dependency of the collected data on the initial research hypothesis, usually useless for other research, the lack of representation of the details from which the registered relations are inferred, and the difficulty to revisit the original data sources for verification, corrections or improvements. To cope with these problems, in this paper we present FAST CAT, a collaborative system for assistive data entry and curation in Digital Humanities and similar forms of empirical research. We describe the related challenges, the overall methodology we follow for supporting semantic interoperability, and discuss the use of FAST CAT in the context of a European (ERC) project of Maritime History, called SeaLiT, which examines economic, social and demographic impacts of the introduction of steamboats in the Mediterranean area between the 1850s and the 1920s.
Comment: This is a preprint of an article accepted for publication at the ACM Journal on Computing and Cultural Heritage (JOCCH)
Subjects by Vocabulary

Microsoft Academic Graph classification: Semantic interoperability Context (language use) Relational database management system computer.software_genre computer Data science Empirical research Archival research Data curation Data management business.industry business Computer science Dependency (project management)


Digital Libraries (cs.DL), Databases (cs.DB), FOS: Computer and information sciences, Computer Graphics and Computer-Aided Design, Computer Science Applications, Information Systems, Conservation, Computer Science - Digital Libraries, Computer Science - Databases

30 references, page 1 of 3

[1] Xu Chu, Ihab F Ilyas, Sanjay Krishnan, and Jiannan Wang. 2016. Data cleaning: Overview and emerging challenges. In Proceedings of the 2016 International Conference on Management of Data. 2201-2206. [OpenAIRE]

[2] Apostolos Delis. 2020. Seafaring Lives at the crossroads of Mediterranean maritime history. International Journal of Maritime History 32, 2 (2020), 464-478.

[3] Anastasia Dimou, Miel Vander Sande, Pieter Colpaert, Ruben Verborgh, Erik Mannens, and Rik Van de Walle. 2014. RML: a generic language for integrated RDF mappings of heterogeneous data. In 7th Workshop on Linked Data on the Web.

[4] Martin Doerr. 2003. The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata. AI magazine 24, 3 (2003), 75-75.

[5] Martin Doerr and Dolores Iorizzo. 2008. The dream of a global knowledge network-A new approach. Journal on Computing and Cultural Heritage (JOCCH) 1, 1 (2008), 1-23.

[6] Wolfgang Haak, Guido Brandt, Hylke N de Jong, Christian Meyer, Robert Ganslmeier, Volker Heyd, Chris Hawkesworth, Alistair WG Pike, Harald Meller, and Kurt W Alt. 2008. Ancient DNA, Strontium isotopes, and osteological analyses shed light on social and kinship organization of the Later Stone Age. Proceedings of the National Academy of Sciences 105, 47 (2008), 18226-18231.

[7] Peter Haase, Daniel M Herzig, Artem Kozlov, Andriy Nikolov, and Johannes Trame. 2019. metaphactory: A platform for knowledge graph management. Semantic Web 10, 6 (2019), 1109-1125.

[8] Tom Heath and Christian Bizer. 2011. Linked data: Evolving the web into a global data space. Synthesis lectures on the semantic web: theory and technology 1, 1 (2011), 1-136.

[9] Geneva Henry. 2014. Data curation for the humanities. Research data management: Practical strategies for information professionals (2014), 347-374.

[10] Giorgos Kadilierakis, Pavlos Fafalios, Panagiotis Papadakos, and Yannis Tzitzikas. 2020. Keyword Search over RDF Using DocumentCentric Information Retrieval Systems. In Extended Semantic Web Conference. Springer, 121-137.

Funded by
EC| ReKnow
Research Knowledge Documentation, Analysis and Exploration in Empirical and Descriptive Sciences
  • Funder: European Commission (EC)
  • Project Code: 890861
  • Funding stream: H2020 | MSCA-IF-EF-ST
EC| SeaLiT
Seafaring Lives in Transition. Mediterranean Maritime Labour and Shipping during Globalization, 1850s-1920s.
  • Funder: European Commission (EC)
  • Project Code: 714437
  • Funding stream: H2020 | ERC | ERC-STG
Related to Research communities
Digital Humanities and Cultural Heritage