In this article, the Mingei Online Platform is presented as an authoring platform for the representation of social and historic context encompassing a focal topic of interest. The proposed representation is employed in the contextualised presentation of a given topic, through documented narratives that support its presentation to diverse audiences. Using the obtained representation, the documentation and digital preservation of social and historical dimensions of Cultural Heritage are demonstrated. The implementation follows the Human-Centred Design approach and has been conducted under an iterative design and evaluation approach involving both usability and domain experts.
Digital Libraries (DLs), especially in the Cultural Heritage domain, are rich in narratives. Every digital object in a DL tells some kind of story, regardless of the medium, the genre, or the type of the object. However, DLs do not offer services about narratives, for example it is not possible to discover a narrative, to create one, or to compare two narratives. Certainly, DLs offer discovery functionalities over their contents, but these services merely address the objects that carry the narratives (e.g. books, images, audiovisual objects), without regard for the narratives themselves. The present work aims at introducing narratives as first-class citizens in DLs, by providing a formal expression of what a narrative is. In particular, this paper presents a conceptualisation of the domain of narratives, and its specification through the Narrative Ontology (NOnt for short), expressed in first-order logic. NOnt has been implemented as an extension of three standard vocabularies, i.e. the CIDOC CRM, FRBRoo, and OWL Time, and using the SWRL rule language to express the axioms. On the basis of NOnt, we have developed the Narrative Building and Visualising (NBVT) tool, and applied it in four case studies to validate the ontology. NOnt is also being validated in the context of the Mingei European project, in which it is applied to the representation of knowledge about Craft Heritage. Preprint
Project: SNSF | Forschungslogiken in den ... (183221), SNSF | Mining Goodreads: a text ... (183194), EC | READIT (792849), SNSF | Geteiltes Lesen. Literatu... (183012)
Abstract Prominent among the social developments that the web 2.0 has facilitated is digital social reading (DSR): on many platforms there are functionalities for creating book reviews, ‘inline’ commenting on book texts, online story writing (often in the form of fanfiction), informal book discussions, book vlogs, and more. In this article, we argue that DSR offers unique possibilities for research into literature, reading, the impact of reading and literary communication. We also claim that in this context computational tools are especially relevant, making DSR a field particularly suitable for the application of Digital Humanities methods. We draw up an initial categorization of research aspects of DSR and briefly examine literature for each category. We distinguish between studies on DSR that use it as a lens to study wider processes of literary exchange as opposed to studies for which the DSR culture is a phenomenon interesting in its own right. Via seven examples of DSR research, we discuss the chosen approaches and their connection to research questions in literary studies.
We investigate the accuracy of how author names are reported in bibliographic records excerpted from four prominent sources: WoS, Scopus, PubMed, and CrossRef. We take as a case study 44,549 publications stored in the internal database of Sapienza University of Rome, one of the largest universities in Europe. While our results indicate generally good accuracy for all bibliographic data sources considered, we highlight a number of issues that undermine the accuracy for certain classes of author names, including compound names and names with diacritics, which are common features to Italian and other Western languages.
Empathy, as defined in behavioral sciences, expresses the ability of human beings to recognize, understand and react to emotions, attitudes and beliefs of others. The lack of an operational definition of empathy makes it difficult to measure it. In this paper, we address two related problems in automatic affective behavior analysis: the design of the annotation protocol and the automatic recognition of empathy from spoken conversations. We propose and evaluate an annotation scheme for empathy inspired by the modal model of emotions. The annotation scheme was evaluated on a corpus of real-life, dyadic spoken conversations. In the context of behavioral analysis, we designed an automatic segmentation and classification system for empathy. Given the different speech and language levels of representation where empathy may be communicated, we investigated features derived from the lexical and acoustic spaces. The feature development process was designed to support both the fusion and automatic selection of relevant features from high dimensional space. The automatic classification system was evaluated on call center conversations where it showed significantly better performance than the baseline. Comment: Journal of Computer Speech and Language
International audience; Research e-infrastructures, digital archives and data services have become important pillars of scientific enterprise that in recent decades has become ever more collaborative, distributed and data-intensive. The archaeological research community has been an early adopter of digital tools for data acquisition, organisation, analysis and presentation of research results of individual projects. However, the provision of e-infrastructure and services for data sharing, discovery, access and re-use has lagged behind. This situation is being addressed by ARIADNE: the Advanced Research Infrastructure for Archaeological Dataset Networking in Europe. This EU-funded network has developed an e-infrastructure that enables data providers to register and provide access to their resources (datasets, collections) through the ARIADNE data portal, facilitating discovery, access and other services across the integrated resources. This article describes the current landscape of data repositories and services for archaeologists in Europe, and the issues that make interoperability between them difficult to realise. The results of the ARIADNE surveys on users' expectations and requirements are also presented. The main section of the article describes the architecture of the e-infrastructure, core services (data registration, discovery and access) and various other extant or experimental services. The on-going evaluation of the data integration and services is also discussed. Finally, the article summarises lessons learned, and outlines the prospects for the wider engagement of the archaeological research community in sharing data through ARIADNE.
Current Digital Libraries (DLs) are mostly built around large collections of scarcely related objects. We aim at enriching the information space of DLs by introducing narratives, consisting of two main components: networks of events related to one another and to the DL resources through semantic links, and narrations of those events in texts. In order to introduce narratives in DLs, we developed a conceptualisation based on narratology and we expressed it using the CIDOC CRM and CRMinf as reference ontologies. We used this expression to validate our conceptualisation, creating a narrative of the biography of Dante Alighieri as a realistic case study. To support this experiment, we developed a semi-automated tool that collects basic knowledge about objects and events from Wikidata. The developed ontology is general enough to be not limited to create biographies but other types of narratives as well.
The Cultural Heritage (CH) community is one of the most active in the realisation of Aggregative Data Infrastructures (ADIs). ADIs provide tools to integrate data sources to form uniform and richer information spaces. The realisation of ADIs for CH must be based on technology capable of coping with complex interoperability issues and sustainability issues. In this paper, we present the D-NET software toolkit framework and services, devised for the realisation of sustainable and customisable ADIs. In particular, we demonstrate the effectiveness of D-NET in the CH scenario by describing its usage in the realisation of a real-case ADI for the EC project Heritage of the People's Europe (HOPE). The HOPE ADI uses D-NET to implement a two-phase metadata conversion methodology that addresses data interoperability issues while facilitating sustainability by encouraging participation of data sources.
[EN] Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism detectors find it difficult to detect cases of paraphrase plagiarism. In this article, we analyze the relationship between paraphrasing and plagiarism, paying special attention to which paraphrase phenomena underlie acts of plagiarism and which of them are detected by plagiarism detection systems. With this aim in mind, we created the P4P corpus, a new resource that uses a paraphrase typology to annotate a subset of the PAN-PC-10 corpus for automatic plagiarism detection. The results of the Second International Competition on Plagiarism Detection were analyzed in the light of this annotation.The presented experiments show that (i) more complex paraphrase phenomena and a high density of paraphrase mechanisms make plagiarism detection more difficult, (ii) lexical substitutions are the paraphrase mechanisms used the most when plagiarizing, and (iii) paraphrase mechanisms tend to shorten the plagiarized text. For the first time, the paraphrase mechanisms behind plagiarism have been analyzed, providing critical insights for the improvement of automatic plagiarism detection systems. We would like to thank the people who participated in the annotation of the P4P corpus, Horacio Rodriguez for his helpful advice as experienced researcher, and the reviewers of this contribution for their valuable comments to improve this article. This research work was partially carried out during the tenure of an ERCIM "Alain Bensoussan" Fellowship Programme. The research leading to these results received funding from the EU FP7 Programme 2007-2013 (grant no. 246016), the MICINN projects TEXT-ENTERPRISE 2.0 and TEXT-KNOWLEDGE 2.0 (TIN2009-13391), the EC WIQ-EI IRSES project (grant no. 269180), and the FP7 Marie Curie People Programme. The research work of A. Barron-Cedeno and M. Vila was financed by the CONACyT-Mexico 192021 grant and the MECD-Spain FPU AP2008-02185 grant, respectively. The research work of A. Barron-Cedeno was partially done in the framework of his Ph.D. at the Universitat Politecnica de Valencia.