Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
2,668 Research products, page 1 of 267

  • Digital Humanities and Cultural Heritage
  • Research data
  • Research software
  • 2013-2022
  • Open Access
  • ZENODO
  • SEDICI (UNLP) - Universidad Nacional de La Plata

10
arrow_drop_down
Date (most recent)
arrow_drop_down
  • Open Access English
    Authors: 
    Sheridan A Stewart; Adam S Miner; Meghan C Halley; Laura K Nelson; Eleni Linos;
    Publisher: Zenodo

    Code for "Formally Comparing Topic Models and Human-Generated Qualitative Coding of Physician Mothers' Experiences of Workplace Discrimination" This repository contains the code used in "Formally Comparing Topic Models and Human-Generated Qualitative Coding of Physician Mothers' Experiences of Workplace Discrimination" by Adam S. Miner, Sheridan A. Stewart, Meghan C. Halley, Laura K. Nelson, and Eleni Linos. Please refer to the original paper in Big Data & Society. In this paper, we evaluate whether topic models identify themes similar to those found by human coders in a prior qualitative analysis of physician mothers' experiences of workplace discrimination. More detail is available at the main page for the repository.

  • Open Access
    Authors: 
    , Haneca; , Ervynck;
    Publisher: Zenodo

    In deze zenodo-record zijn de brongegevens terug te vinden waarop de analyses in het hoofdstuk "Kenniswinst archeologieregelgeving 2016-2021" zijn gebaseerd. Dit hoofdstuk maakt deel uit van het onderzoeksrapport "Ameels V, Carpentier F, De Ketelaere S, Ervynck A, Geuens J, Haneca K, Pieters M, Van Looveren J & Verhelst A 2023: Evaluatie archeologie 2016-2021, Onderzoeksrapport Agentschap Onroerend Erfgoed, Brussel.": 1. Een overzicht van alle ingediende eindverslagen (n = 576) en nota's met eindafwerking (n = 26) over de periode 1 april 2016 t.e.m. 31 december 2021 eindverslagen_notas_2016_2021_overzicht.csv (comma-separated values) eindverslagen_notas_2016_2021_overzicht.xlsx (Microsoft Excel Open XML Spreadsheet) eindverslagen_notas_2016_2021_overzicht.shp (ESRI shapefile) (Een overzicht van alle ingediende eindverslagen - vanaf 1 april 2016 tot vandaag - kan je ook downloaden als GIS-bestand op https://geo.onroerenderfgoed.be/downloads.) 2. De resultaten van de inhoudelijke screening (periodisering, materiaalcategorieën, ...) van deze documenten: eindverslagen_notas_2016_2021_screening.csv (comma-separated values) eindverslagen_notas_2016_2021_screening.xlsx (Microsoft Excel Open XML Spreadsheet) eindverslagen_notas_2016_2021_screening.shp (ESRI shapefile)

  • Open Access English
    Authors: 
    Li, Weixuan;
    Publisher: Zenodo

    Abraham Bredius’ seminal work Künstler-Inventare contains over 150 inventories of artists’ possessions in the Dutch Republic. However, this rich source has never been fully transformed into datasets, partially because Bredius was selective in his transcriptions of artists’ inventories with a focus on paintings. To fill this gap and to overcome Bredius’ biases, the Virtual Interior project assembled and transcribed 46 inventories of artists’ and art dealers’ homes in 17th-century Amsterdam. The selection of the inventories was based on three criteria: 1) The artists’ and art dealers’ inventories in my sample were drawn up during an artist’s lifetime or shortly after his death; 2) the inventories listed goods arranged by room; 3) the original documents of the inventories can still be traced in the Amsterdam City Archives. Forty-six inventories were selected and transcribed from the original sources or copied from existing publications. The majority of my samples can be found in the notarial archives (Archive nr. 5075) and the rest in the Archive of the Chamber of Insolvency (Desolate Boedel kamer, Archive nr. 5072), both preserved in the Amsterdam City Archives. Starting with the published inventories, Bart Reuvekamp from our project team traced them back to the original sources in the archive, transcribed centuries-old handwritten pages, and compiled the listed objects in digital form. In this way, our database is able to encompass all belongings present in painters’ workshops, filling in the blanks that Bredius and other authors neglected or chose to leave out – the missing information that once hindered our comprehension of how artists organized their studios. The dataset is organized as follows: 1_Intro_and_data_explanation.xlsx introduces the dataset and explains the columns in the following files 2_Inventory_list.csv hosts the details of the inventories in this dataset 3_Inventory_items.csv contains the full transcriptions and categorical data of the objects in the inventories 4_Inventory_relationships.csv captures all the people mentioned in the inventories in the pre- or postscript and/or as debtors or creditors 5_Inventory_category_reference.csv provides a reference table for the ‘object_type’ and ‘object_category’ columns in 3_Inventory_items.csv 6_Inventory_subjectmatter_reference.csv offers a reference table for the ‘painting_subject’ and ‘painting_genre’ columns in 3_Inventory_items.csv NB: This Künstler-Inventare database is a provisional version. The correction and final data process has not been fully finished in Version 1. The transcriptions might contain errors and need to be treated with caution. This project is financed by NWO Smart Culture - Big Data / Digital Humanities grant

  • Open Access
    Authors: 
    Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts;
    Publisher: Zenodo

    XOR TyDi query and document files for running PLAID-ColBERTv2 experiments from "Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking".

  • Open Access
    Authors: 
    Berger, Michael; Bolte, Henrike; Führer, Veronika; Hausleitner, Felix; Hutterer, Sarah; Lüthi, Tim; Nancu, Mihaela; Passoni, Erica; Pataki, Katalin; Schröcksnadel, Sophie; +3 more
    Publisher: Zenodo

    This is ground truth for the vast collection of sermons of Nikolaus von Dinkelsbühl (ca. 1360 to 17th March 1433), translated and reorganised by a German redactor, from the 15th century has never been edited until now. It consists of 361 folios of parchment and paper. The text speaks about various topics such as fasting and other religious practices. Being one of the leading intellectuals of his time, Nikolaus von Dinkelsbühl also contributed to the development of the University of Vienna. The manuscript was probably produced in the vicinity of Klosterneuburg in Austria and is still kept there today (Shelfmark: Cod. 48). Data collection and ground truth creation: The edition at hand was produced by an international team of researchers from various fields in the context of the Vienna HTR Winter School 2022 with the help of Transkribus Expert Client. We uploaded the images of the manuscript into the Transkribus platform, applied the line recognition tool and manually copied the transcribed text lines into the recognised line boxes. Various models were trained with the ground truth (20% of the entire codex) created by the team. Images of the Klosterneuburg, Augustiner-Chorherrenstift, Cod. 48 are available at: https://manuscripta.at/diglit/AT5000-48/0001

  • Open Access
    Authors: 
    Berg, Johanna; Aasa, Carl Ollvik; Appelgren Thorell, Björn; Aits, Sonja;
    Publisher: Zenodo

    Electronic health records (EHRs) are a rich source of information for medical research and public health monitoring. Information systems based on EHR data could also assist in patient care and hospital management. However, much of the data in EHRs is in the form of unstructured text, which is difficult to process for analysis. Natural language processing (NLP), a form of artificial intelligence, has the potential to enable automatic extraction of information from EHRs and several NLP tools adapted to the style of clinical writing have been developed for English and other major languages. In contrast, the development of NLP tools for less widely spoken languages such as Swedish has lagged behind. A major bottleneck in the development of NLP tools is the restricted access to EHRs due to legitimate patient privacy concerns. To overcome this issue we have generated a citizen science platform for collecting artificial Swedish EHRs with the help of Swedish physicians and medical students. These artificial EHRs describe imagined but plausible emergency care patients in a style that closely resembles EHRs used in emergency departments in Sweden. In the pilot phase, we collected a first batch of 50 artificial EHRs, which has passed review by an experienced Swedish emergency care physician. We make this dataset publicly available as OpenChart-SE corpus (version 1) under an open-source license for the NLP research community. The project is now open for general participation and Swedish physicians and medical students are invited to submit EHRs on the project website (https://github.com/Aitslab/openchart-se), where additional batches of quality-controlled EHRs will be released periodically. Dataset content OpenChart-SE, version 1 corpus (txt files and and dataset.csv) The OpenChart-SE corpus, version 1, contains 50 artificial EHRs (note that the numbering starts with 5 as 1-4 were test cases that were not suitable for publication). The EHRs are available in two formats, structured as a .csv file and as separate textfiles for annotation. Note that flaws in the data were not cleaned up so that it simulates what could be encountered when working with data from different EHR systems. All charts have been checked for medical validity by a resident in Emergency Medicine at a Swedish hospital before publication. Codebook.xlsx The codebook contain information about each variable used. It is in XLSForm-format, which can be re-used in several different applications for data collection. suppl_data_1_openchart-se_form.pdf OpenChart-SE mock emergency care EHR form. suppl_data_3_openchart-se_dataexploration.ipynb This jupyter notebook contains the code and results from the analysis of the OpenChart-SE corpus. More details about the project and information on the upcoming preprint accompanying the dataset can be found on the project website (https://github.com/Aitslab/openchart-se). Acknowledgement We thank all citizen scientists for contributing artificial EHRs and all members of our research groups, who provided helpful comments throughout the development of this project. This study was supported by a grant to Science for Life Laboratory from the Knut and Alice Wallenberg (KAW) Foundation (S.A. 2020.0182), which was distributed through the SciLifeLab and KAW National COVID-19 Research Program. The project is conducted in the AI Lund research environment at Lund University.

  • Research software . 2022
    Open Access
    Authors: 
    Pett, Daniel;
    Publisher: Zenodo

    This repository holds the Gatsby 5 framework and the front end build for the Bronze Age Index. If you use this software, please cite it using the metadata from this file.

  • Research software . 2022
    Open Access
    Authors: 
    Charles, William; Aussenac-Gilles, Nathalie; Hernandez, Nathalie;
    Publisher: Zenodo

    HHT (Hierarchical Historical Territories) is an ontology designed to describe multiple-hierarchy evolving territories in an historical context. This ontology relies on a discrete geometry description using building blocks. It is provided with an algorithm implemented with SHACL-Rules to detect and categorize changes inside a knowledge graph relying on HHT, and multiple datasets, including a description of territories of the French Third Republic, the NUTS nomenclature and the current french administrative hierarchy.

  • Research software . 2022
    Open Access
    Authors: 
    Charles, William; Aussenac-Gilles, Nathalie; Hernandez, Nathalie;
    Publisher: Zenodo

    HHT is an ontology to represent historical territories involved in multiple overlaying hierarchies and their evolutions. Its documentation is available at https://www.irit.fr/recherches/MELODI/ontologies/HHT/

  • Research data . Audiovisual . 2022
    Open Access German
    Authors: 
    van Oorschot, Frederike;
    Publisher: Zenodo

    Der Vortrag entfaltet hermeneutische und epistemologische Fragen, die durch die sich entwickelnde digitale Forschung in den Geisteswissenschaften (Digital Humanities, DH) ausgelöst werden. Im Vordergrund steht die Skizze einer auf die DH bezogenen Wissenschaftsphilosophie anhand der folgenden Leitfragen: Was bedeutet das Narrativ einer „neuen“ Wissenschaft? Wer ist Subjekt der DH? Wo finden sich neue epistemische Logiken? Was ist das forschungspolitische Setting der DH? Und wie verhält sich dies alles zum Selbstverständnis „klassischer“ Geisteswissenschaften? Dabei zielt der Vortrag auf eine „digitale Hermeneutik“ in den Geisteswissenschaften.

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
2,668 Research products, page 1 of 267
  • Open Access English
    Authors: 
    Sheridan A Stewart; Adam S Miner; Meghan C Halley; Laura K Nelson; Eleni Linos;
    Publisher: Zenodo

    Code for "Formally Comparing Topic Models and Human-Generated Qualitative Coding of Physician Mothers' Experiences of Workplace Discrimination" This repository contains the code used in "Formally Comparing Topic Models and Human-Generated Qualitative Coding of Physician Mothers' Experiences of Workplace Discrimination" by Adam S. Miner, Sheridan A. Stewart, Meghan C. Halley, Laura K. Nelson, and Eleni Linos. Please refer to the original paper in Big Data & Society. In this paper, we evaluate whether topic models identify themes similar to those found by human coders in a prior qualitative analysis of physician mothers' experiences of workplace discrimination. More detail is available at the main page for the repository.

  • Open Access
    Authors: 
    , Haneca; , Ervynck;
    Publisher: Zenodo

    In deze zenodo-record zijn de brongegevens terug te vinden waarop de analyses in het hoofdstuk "Kenniswinst archeologieregelgeving 2016-2021" zijn gebaseerd. Dit hoofdstuk maakt deel uit van het onderzoeksrapport "Ameels V, Carpentier F, De Ketelaere S, Ervynck A, Geuens J, Haneca K, Pieters M, Van Looveren J & Verhelst A 2023: Evaluatie archeologie 2016-2021, Onderzoeksrapport Agentschap Onroerend Erfgoed, Brussel.": 1. Een overzicht van alle ingediende eindverslagen (n = 576) en nota's met eindafwerking (n = 26) over de periode 1 april 2016 t.e.m. 31 december 2021 eindverslagen_notas_2016_2021_overzicht.csv (comma-separated values) eindverslagen_notas_2016_2021_overzicht.xlsx (Microsoft Excel Open XML Spreadsheet) eindverslagen_notas_2016_2021_overzicht.shp (ESRI shapefile) (Een overzicht van alle ingediende eindverslagen - vanaf 1 april 2016 tot vandaag - kan je ook downloaden als GIS-bestand op https://geo.onroerenderfgoed.be/downloads.) 2. De resultaten van de inhoudelijke screening (periodisering, materiaalcategorieën, ...) van deze documenten: eindverslagen_notas_2016_2021_screening.csv (comma-separated values) eindverslagen_notas_2016_2021_screening.xlsx (Microsoft Excel Open XML Spreadsheet) eindverslagen_notas_2016_2021_screening.shp (ESRI shapefile)

  • Open Access English
    Authors: 
    Li, Weixuan;
    Publisher: Zenodo

    Abraham Bredius’ seminal work Künstler-Inventare contains over 150 inventories of artists’ possessions in the Dutch Republic. However, this rich source has never been fully transformed into datasets, partially because Bredius was selective in his transcriptions of artists’ inventories with a focus on paintings. To fill this gap and to overcome Bredius’ biases, the Virtual Interior project assembled and transcribed 46 inventories of artists’ and art dealers’ homes in 17th-century Amsterdam. The selection of the inventories was based on three criteria: 1) The artists’ and art dealers’ inventories in my sample were drawn up during an artist’s lifetime or shortly after his death; 2) the inventories listed goods arranged by room; 3) the original documents of the inventories can still be traced in the Amsterdam City Archives. Forty-six inventories were selected and transcribed from the original sources or copied from existing publications. The majority of my samples can be found in the notarial archives (Archive nr. 5075) and the rest in the Archive of the Chamber of Insolvency (Desolate Boedel kamer, Archive nr. 5072), both preserved in the Amsterdam City Archives. Starting with the published inventories, Bart Reuvekamp from our project team traced them back to the original sources in the archive, transcribed centuries-old handwritten pages, and compiled the listed objects in digital form. In this way, our database is able to encompass all belongings present in painters’ workshops, filling in the blanks that Bredius and other authors neglected or chose to leave out – the missing information that once hindered our comprehension of how artists organized their studios. The dataset is organized as follows: 1_Intro_and_data_explanation.xlsx introduces the dataset and explains the columns in the following files 2_Inventory_list.csv hosts the details of the inventories in this dataset 3_Inventory_items.csv contains the full transcriptions and categorical data of the objects in the inventories 4_Inventory_relationships.csv captures all the people mentioned in the inventories in the pre- or postscript and/or as debtors or creditors 5_Inventory_category_reference.csv provides a reference table for the ‘object_type’ and ‘object_category’ columns in 3_Inventory_items.csv 6_Inventory_subjectmatter_reference.csv offers a reference table for the ‘painting_subject’ and ‘painting_genre’ columns in 3_Inventory_items.csv NB: This Künstler-Inventare database is a provisional version. The correction and final data process has not been fully finished in Version 1. The transcriptions might contain errors and need to be treated with caution. This project is financed by NWO Smart Culture - Big Data / Digital Humanities grant

  • Open Access
    Authors: 
    Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts;
    Publisher: Zenodo

    XOR TyDi query and document files for running PLAID-ColBERTv2 experiments from "Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking".

  • Open Access
    Authors: 
    Berger, Michael; Bolte, Henrike; Führer, Veronika; Hausleitner, Felix; Hutterer, Sarah; Lüthi, Tim; Nancu, Mihaela; Passoni, Erica; Pataki, Katalin; Schröcksnadel, Sophie; +3 more
    Publisher: Zenodo

    This is ground truth for the vast collection of sermons of Nikolaus von Dinkelsbühl (ca. 1360 to 17th March 1433), translated and reorganised by a German redactor, from the 15th century has never been edited until now. It consists of 361 folios of parchment and paper. The text speaks about various topics such as fasting and other religious practices. Being one of the leading intellectuals of his time, Nikolaus von Dinkelsbühl also contributed to the development of the University of Vienna. The manuscript was probably produced in the vicinity of Klosterneuburg in Austria and is still kept there today (Shelfmark: Cod. 48). Data collection and ground truth creation: The edition at hand was produced by an international team of researchers from various fields in the context of the Vienna HTR Winter School 2022 with the help of Transkribus Expert Client. We uploaded the images of the manuscript into the Transkribus platform, applied the line recognition tool and manually copied the transcribed text lines into the recognised line boxes. Various models were trained with the ground truth (20% of the entire codex) created by the team. Images of the Klosterneuburg, Augustiner-Chorherrenstift, Cod. 48 are available at: https://manuscripta.at/diglit/AT5000-48/0001

  • Open Access
    Authors: 
    Berg, Johanna; Aasa, Carl Ollvik; Appelgren Thorell, Björn; Aits, Sonja;
    Publisher: Zenodo

    Electronic health records (EHRs) are a rich source of information for medical research and public health monitoring. Information systems based on EHR data could also assist in patient care and hospital management. However, much of the data in EHRs is in the form of unstructured text, which is difficult to process for analysis. Natural language processing (NLP), a form of artificial intelligence, has the potential to enable automatic extraction of information from EHRs and several NLP tools adapted to the style of clinical writing have been developed for English and other major languages. In contrast, the development of NLP tools for less widely spoken languages such as Swedish has lagged behind. A major bottleneck in the development of NLP tools is the restricted access to EHRs due to legitimate patient privacy concerns. To overcome this issue we have generated a citizen science platform for collecting artificial Swedish EHRs with the help of Swedish physicians and medical students. These artificial EHRs describe imagined but plausible emergency care patients in a style that closely resembles EHRs used in emergency departments in Sweden. In the pilot phase, we collected a first batch of 50 artificial EHRs, which has passed review by an experienced Swedish emergency care physician. We make this dataset publicly available as OpenChart-SE corpus (version 1) under an open-source license for the NLP research community. The project is now open for general participation and Swedish physicians and medical students are invited to submit EHRs on the project website (https://github.com/Aitslab/openchart-se), where additional batches of quality-controlled EHRs will be released periodically. Dataset content OpenChart-SE, version 1 corpus (txt files and and dataset.csv) The OpenChart-SE corpus, version 1, contains 50 artificial EHRs (note that the numbering starts with 5 as 1-4 were test cases that were not suitable for publication). The EHRs are available in two formats, structured as a .csv file and as separate textfiles for annotation. Note that flaws in the data were not cleaned up so that it simulates what could be encountered when working with data from different EHR systems. All charts have been checked for medical validity by a resident in Emergency Medicine at a Swedish hospital before publication. Codebook.xlsx The codebook contain information about each variable used. It is in XLSForm-format, which can be re-used in several different applications for data collection. suppl_data_1_openchart-se_form.pdf OpenChart-SE mock emergency care EHR form. suppl_data_3_openchart-se_dataexploration.ipynb This jupyter notebook contains the code and results from the analysis of the OpenChart-SE corpus. More details about the project and information on the upcoming preprint accompanying the dataset can be found on the project website (https://github.com/Aitslab/openchart-se). Acknowledgement We thank all citizen scientists for contributing artificial EHRs and all members of our research groups, who provided helpful comments throughout the development of this project. This study was supported by a grant to Science for Life Laboratory from the Knut and Alice Wallenberg (KAW) Foundation (S.A. 2020.0182), which was distributed through the SciLifeLab and KAW National COVID-19 Research Program. The project is conducted in the AI Lund research environment at Lund University.

  • Research software . 2022
    Open Access
    Authors: 
    Pett, Daniel;
    Publisher: Zenodo

    This repository holds the Gatsby 5 framework and the front end build for the Bronze Age Index. If you use this software, please cite it using the metadata from this file.

  • Research software . 2022
    Open Access
    Authors: 
    Charles, William; Aussenac-Gilles, Nathalie; Hernandez, Nathalie;
    Publisher: Zenodo

    HHT (Hierarchical Historical Territories) is an ontology designed to describe multiple-hierarchy evolving territories in an historical context. This ontology relies on a discrete geometry description using building blocks. It is provided with an algorithm implemented with SHACL-Rules to detect and categorize changes inside a knowledge graph relying on HHT, and multiple datasets, including a description of territories of the French Third Republic, the NUTS nomenclature and the current french administrative hierarchy.

  • Research software . 2022
    Open Access
    Authors: 
    Charles, William; Aussenac-Gilles, Nathalie; Hernandez, Nathalie;
    Publisher: Zenodo

    HHT is an ontology to represent historical territories involved in multiple overlaying hierarchies and their evolutions. Its documentation is available at https://www.irit.fr/recherches/MELODI/ontologies/HHT/

  • Research data . Audiovisual . 2022
    Open Access German
    Authors: 
    van Oorschot, Frederike;
    Publisher: Zenodo

    Der Vortrag entfaltet hermeneutische und epistemologische Fragen, die durch die sich entwickelnde digitale Forschung in den Geisteswissenschaften (Digital Humanities, DH) ausgelöst werden. Im Vordergrund steht die Skizze einer auf die DH bezogenen Wissenschaftsphilosophie anhand der folgenden Leitfragen: Was bedeutet das Narrativ einer „neuen“ Wissenschaft? Wer ist Subjekt der DH? Wo finden sich neue epistemische Logiken? Was ist das forschungspolitische Setting der DH? Und wie verhält sich dies alles zum Selbstverständnis „klassischer“ Geisteswissenschaften? Dabei zielt der Vortrag auf eine „digitale Hermeneutik“ in den Geisteswissenschaften.