Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
8 Research products, page 1 of 1

  • Digital Humanities and Cultural Heritage
  • Research data
  • Open Access
  • Dataset
  • FI

Relevance
arrow_drop_down
  • Open Access
    Authors: 
    Tallavaara, Miikka; Jørgensen, Erlend Kirkeng;
    Publisher: Zenodo
    Project: AKA | Human population dynamics... (317567)

    This submission contains data and R-code that enable to reproduce the data manipulations and analyses in the paper “Why are population growth rate estimates of past and present hunter-gatherers so different?” by Miikka Tallavaara and Erlend Kirkeng Jørgensen (Philosophical transactions of the Royal Society B). Please, cite the paper and this Zenodo repository if you use the files included in this Zenodo record in your work. The submission includes a html-file titled “Why are population growth rate estimates of past and present hunter-gatherers so different? - Data analyses” (TJ2020.html) that contains R-code and instructions and comments for running the code (open this file in your browser). In addition, the submission includes Rdata-file (dataTJ2020.Rdata) containing all the data that are not created within the code and pure R-code (TJ2020.R).

  • Open Access English
    Authors: 
    Tiedemann, Jörg; Scherrer, Yves;
    Publisher: Zenodo
    Project: EC | MeMAD (780069), EC | FoTran (771113)

    This release contains data sets for experiments with document-level machine translation. The data sets have been used in previous studies and provided here for replicability and comparison with other systems. The data sets are taken from the English-German news translation task at WMT 2019 and the English-German bitext in the OpenSubtitles collection v2016 from OPUS. All data sets are sentence aligned with corresponding lines being aligned to each other. Document boundaries are marked with empty lines (on both sides of the parallel corpus). The data set has been used in the following publication: @inproceedings{scherrer-tiedemann-loaiciga-2019, title = "Analysing concatenation approaches to document-level NMT in two different domains", author = {Scherrer, Yves and Tiedemann, J{\"o}rg and Lo{\'a}iciga, Sharid}, booktitle = "Proceedings of the Third Workshop on Discourse in Machine Translation", month = nov, year = "2019", address = "Hong-Kong", publisher = "Association for Computational Linguistics", } Please, cite that paper if you use the data set in your own work. {"references": ["Scherrer, Tiedemann and Lo\u00e1iciga: \"Analysing concatenation approaches to document-level NMT in two different domains\", in Proceedings of DiscoMT2019 at EMNLP 2019, Hong-Kong"]}

  • Open Access
    Authors: 
    Lampinen, Jussi; García-Antúnez, Oriol; Olafsson, Anton Stahl; Kavanagh, Kayleigh Corry; Gulsrud, Natalie; Raymond, Christoper Mark;
    Publisher: Zenodo
    Project: AKA | Individuals, communities ... (335203)

    A public participatory GIS -survey dataset detailing public understandings of and attitudes towards carbon-smart urban green infrastructure in Kumpula, Helsinki, Finland. This dataset was funded by the Strategic Research Council, Academy of Finland through the CO-CARBON project (project number: 335203).

  • Open Access
    Authors: 
    Abarenkov, Kessy; Somervuo, Panu; Nilsson, R. Henrik; Kirk, Paul M.; Huotari, Tea; Abrego, Nerea; Ovaskainen, Otso;
    Publisher: Data Archiving and Networked Services (DANS)
    Project: AKA | Finnish Centre of Excelle... (250444)

    • Incompleteness of reference sequence databases and unresolved taxonomic relationships complicates taxonomic placement of fungal sequences. We developed PROTAX-fungi, a general tool for taxonomic placement of fungal ITS sequences, and implemented it into the PlutoF platform of the UNITE database for molecular identification of fungi. • PROTAX-fungi outperformed the SINTAX and RDB classifiers in terms of increased accuracy and decreased calibration error when applied to data on mock communities representing species groups with poor sequence database coverage. • With empirical data on root- and wood-associated fungi, PROTAX-fungi identified reliably (with at least 90% identification probability) the majority of sequences to the order level but only ca. one fifth of them to the species level, reflecting the current limited coverage of the databases. • When applied to examine the internal consistencies of the Index Fungorum and UNITE databases, PROTAX-fungi revealed inconsistencies in the taxonomy database as well as mislabelling and sequence quality problems in the reference database. The according improvements were implemented in both databases. • PROTAX-fungi provides a robust tool for performing statistically reliable identifications of fungi in spite of the incompleteness of extant reference sequence databases and unresolved taxonomic relationships. Root-associated fungi from GreenlandFungal ITS2 sequence data of root-associated fungi from Greenland in fasta format. DNA extracted from different plant species roots collected along the altitudinal gradient of Aucella mountain in the Zackenberg valley.root-associated fungi from Greenland.fas

  • Open Access English
    Authors: 
    Tiedemann, Jörg; Scherrer, Yves;
    Publisher: Zenodo
    Project: EC | MeMAD (780069), EC | FoTran (771113)

    This release contains data sets for experiments with document-level machine translation. The data sets have been used in previous studies and provided here for replicability and comparison with other systems. The data sets are taken from the English-German news translation task at WMT 2019 and the English-German bitext in the OpenSubtitles collection v2016 from OPUS. All data sets are sentence aligned with corresponding lines being aligned to each other. Document boundaries are marked with empty lines (on both sides of the parallel corpus). The data set has been used in the following publication: @inproceedings{scherrer-tiedemann-loaiciga-2019, title = "Analysing concatenation approaches to document-level NMT in two different domains", author = {Scherrer, Yves and Tiedemann, J{\"o}rg and Lo{\'a}iciga, Sharid}, booktitle = "Proceedings of the Third Workshop on Discourse in Machine Translation", month = nov, year = "2019", address = "Hong-Kong", publisher = "Association for Computational Linguistics", } Please, cite that paper if you use the data set in your own work.

  • Open Access
    Authors: 
    Puurunen, Jenni; Ottka, Claudia; Salonen, Milla; Niskanen, Julia E.; Lohi, Hannes;
    Publisher: The Royal Society
    Project: AKA | An omics approach to iden... (308887)

    Supplementary Tables in excel-format describing the detailed results of GLM models for each measurand.

  • Open Access English
    Authors: 
    Wei, Shichao; Li, Zitong; Momigliano, Paolo; Fu, Chao; Wu, Hua; Merilä, Juha;
    Publisher: Dryad
    Project: AKA | Evolutionary Genetics of ... (218343), AKA | Centre of Excellence in E... (129662), AKA | Evolutionary and conserva... (316294), AKA | Evolutionary genetics of ... (134728)

    The role of geological events and Pleistocene climatic fluctuations as drivers of current patterns of genetic variation in extant species has been a topic of continued interest among evolutionary biologists. Nevertheless, comprehensive studies of widely distributed species are still rare, especially from Asia. Using geographically extensive sampling of many individuals and a large number of nuclear single nucleotide polymorphisms (SNPs), we studied the phylogeography and historical demography of Hyla annectans populations in southern China. Thirty-five sampled populations were grouped into seven clearly defined genetic clusters that closely match phenotype-based subspecies classification. These lineages diverged 2.32–5.23 million years ago, a timing that closely aligns with the rapid and drastic uplifting of the Qinghai-Tibet Plateau and adjacent southwest China. Demographic analyses and species distribution models indicate that different populations of this species have responded differently to past climatic changes. In the Hengduan Mountains, most populations experienced a bottleneck, whereas the populations located outside of the Hengduan Mountains have gradually declined in size since the end of the last glaciation. In addition, the levels of phenotypic and genetic divergence were strongly correlated across major clades. These results highlight the combined effects of geological events and past climatic fluctuations, as well as natural selection, as drivers of contemporary patterns of genetic and phenotypic variation in a widely distributed anuran in Asia. 'SNP_data_for_H.annectans' is the SNP data for Hyla annectans in vcf formats. Which is used for the phylogeney tree, genetic structure, genetic differentiation, demographic analyses. 'Morphological_data_info' are the statistic data of snout-vent length (SVL), weight and spots numbers used for morphological analyses and QST-FST comparison. 'SDM_input_ascii' are the SDM ascii files used for SDMs. 'SDM_locality_info' are the occurrence data points of five genetic clusters for the H. annectans.

  • Open Access
    Authors: 
    Puurunen, Jenni; Ottka, Claudia; Salonen, Milla; Niskanen, Julia E.; Lohi, Hannes;
    Publisher: The Royal Society
    Project: AKA | An omics approach to iden... (308887)

    Supplementary Tables in excel-format describing the AIC model selection process for each measurand in GLM analyses.

Powered by OpenAIRE graph
Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
8 Research products, page 1 of 1
  • Open Access
    Authors: 
    Tallavaara, Miikka; Jørgensen, Erlend Kirkeng;
    Publisher: Zenodo
    Project: AKA | Human population dynamics... (317567)

    This submission contains data and R-code that enable to reproduce the data manipulations and analyses in the paper “Why are population growth rate estimates of past and present hunter-gatherers so different?” by Miikka Tallavaara and Erlend Kirkeng Jørgensen (Philosophical transactions of the Royal Society B). Please, cite the paper and this Zenodo repository if you use the files included in this Zenodo record in your work. The submission includes a html-file titled “Why are population growth rate estimates of past and present hunter-gatherers so different? - Data analyses” (TJ2020.html) that contains R-code and instructions and comments for running the code (open this file in your browser). In addition, the submission includes Rdata-file (dataTJ2020.Rdata) containing all the data that are not created within the code and pure R-code (TJ2020.R).

  • Open Access English
    Authors: 
    Tiedemann, Jörg; Scherrer, Yves;
    Publisher: Zenodo
    Project: EC | MeMAD (780069), EC | FoTran (771113)

    This release contains data sets for experiments with document-level machine translation. The data sets have been used in previous studies and provided here for replicability and comparison with other systems. The data sets are taken from the English-German news translation task at WMT 2019 and the English-German bitext in the OpenSubtitles collection v2016 from OPUS. All data sets are sentence aligned with corresponding lines being aligned to each other. Document boundaries are marked with empty lines (on both sides of the parallel corpus). The data set has been used in the following publication: @inproceedings{scherrer-tiedemann-loaiciga-2019, title = "Analysing concatenation approaches to document-level NMT in two different domains", author = {Scherrer, Yves and Tiedemann, J{\"o}rg and Lo{\'a}iciga, Sharid}, booktitle = "Proceedings of the Third Workshop on Discourse in Machine Translation", month = nov, year = "2019", address = "Hong-Kong", publisher = "Association for Computational Linguistics", } Please, cite that paper if you use the data set in your own work. {"references": ["Scherrer, Tiedemann and Lo\u00e1iciga: \"Analysing concatenation approaches to document-level NMT in two different domains\", in Proceedings of DiscoMT2019 at EMNLP 2019, Hong-Kong"]}

  • Open Access
    Authors: 
    Lampinen, Jussi; García-Antúnez, Oriol; Olafsson, Anton Stahl; Kavanagh, Kayleigh Corry; Gulsrud, Natalie; Raymond, Christoper Mark;
    Publisher: Zenodo
    Project: AKA | Individuals, communities ... (335203)

    A public participatory GIS -survey dataset detailing public understandings of and attitudes towards carbon-smart urban green infrastructure in Kumpula, Helsinki, Finland. This dataset was funded by the Strategic Research Council, Academy of Finland through the CO-CARBON project (project number: 335203).

  • Open Access
    Authors: 
    Abarenkov, Kessy; Somervuo, Panu; Nilsson, R. Henrik; Kirk, Paul M.; Huotari, Tea; Abrego, Nerea; Ovaskainen, Otso;
    Publisher: Data Archiving and Networked Services (DANS)
    Project: AKA | Finnish Centre of Excelle... (250444)

    • Incompleteness of reference sequence databases and unresolved taxonomic relationships complicates taxonomic placement of fungal sequences. We developed PROTAX-fungi, a general tool for taxonomic placement of fungal ITS sequences, and implemented it into the PlutoF platform of the UNITE database for molecular identification of fungi. • PROTAX-fungi outperformed the SINTAX and RDB classifiers in terms of increased accuracy and decreased calibration error when applied to data on mock communities representing species groups with poor sequence database coverage. • With empirical data on root- and wood-associated fungi, PROTAX-fungi identified reliably (with at least 90% identification probability) the majority of sequences to the order level but only ca. one fifth of them to the species level, reflecting the current limited coverage of the databases. • When applied to examine the internal consistencies of the Index Fungorum and UNITE databases, PROTAX-fungi revealed inconsistencies in the taxonomy database as well as mislabelling and sequence quality problems in the reference database. The according improvements were implemented in both databases. • PROTAX-fungi provides a robust tool for performing statistically reliable identifications of fungi in spite of the incompleteness of extant reference sequence databases and unresolved taxonomic relationships. Root-associated fungi from GreenlandFungal ITS2 sequence data of root-associated fungi from Greenland in fasta format. DNA extracted from different plant species roots collected along the altitudinal gradient of Aucella mountain in the Zackenberg valley.root-associated fungi from Greenland.fas

  • Open Access English
    Authors: 
    Tiedemann, Jörg; Scherrer, Yves;
    Publisher: Zenodo
    Project: EC | MeMAD (780069), EC | FoTran (771113)

    This release contains data sets for experiments with document-level machine translation. The data sets have been used in previous studies and provided here for replicability and comparison with other systems. The data sets are taken from the English-German news translation task at WMT 2019 and the English-German bitext in the OpenSubtitles collection v2016 from OPUS. All data sets are sentence aligned with corresponding lines being aligned to each other. Document boundaries are marked with empty lines (on both sides of the parallel corpus). The data set has been used in the following publication: @inproceedings{scherrer-tiedemann-loaiciga-2019, title = "Analysing concatenation approaches to document-level NMT in two different domains", author = {Scherrer, Yves and Tiedemann, J{\"o}rg and Lo{\'a}iciga, Sharid}, booktitle = "Proceedings of the Third Workshop on Discourse in Machine Translation", month = nov, year = "2019", address = "Hong-Kong", publisher = "Association for Computational Linguistics", } Please, cite that paper if you use the data set in your own work.

  • Open Access
    Authors: 
    Puurunen, Jenni; Ottka, Claudia; Salonen, Milla; Niskanen, Julia E.; Lohi, Hannes;
    Publisher: The Royal Society
    Project: AKA | An omics approach to iden... (308887)

    Supplementary Tables in excel-format describing the detailed results of GLM models for each measurand.

  • Open Access English
    Authors: 
    Wei, Shichao; Li, Zitong; Momigliano, Paolo; Fu, Chao; Wu, Hua; Merilä, Juha;
    Publisher: Dryad
    Project: AKA | Evolutionary Genetics of ... (218343), AKA | Centre of Excellence in E... (129662), AKA | Evolutionary and conserva... (316294), AKA | Evolutionary genetics of ... (134728)

    The role of geological events and Pleistocene climatic fluctuations as drivers of current patterns of genetic variation in extant species has been a topic of continued interest among evolutionary biologists. Nevertheless, comprehensive studies of widely distributed species are still rare, especially from Asia. Using geographically extensive sampling of many individuals and a large number of nuclear single nucleotide polymorphisms (SNPs), we studied the phylogeography and historical demography of Hyla annectans populations in southern China. Thirty-five sampled populations were grouped into seven clearly defined genetic clusters that closely match phenotype-based subspecies classification. These lineages diverged 2.32–5.23 million years ago, a timing that closely aligns with the rapid and drastic uplifting of the Qinghai-Tibet Plateau and adjacent southwest China. Demographic analyses and species distribution models indicate that different populations of this species have responded differently to past climatic changes. In the Hengduan Mountains, most populations experienced a bottleneck, whereas the populations located outside of the Hengduan Mountains have gradually declined in size since the end of the last glaciation. In addition, the levels of phenotypic and genetic divergence were strongly correlated across major clades. These results highlight the combined effects of geological events and past climatic fluctuations, as well as natural selection, as drivers of contemporary patterns of genetic and phenotypic variation in a widely distributed anuran in Asia. 'SNP_data_for_H.annectans' is the SNP data for Hyla annectans in vcf formats. Which is used for the phylogeney tree, genetic structure, genetic differentiation, demographic analyses. 'Morphological_data_info' are the statistic data of snout-vent length (SVL), weight and spots numbers used for morphological analyses and QST-FST comparison. 'SDM_input_ascii' are the SDM ascii files used for SDMs. 'SDM_locality_info' are the occurrence data points of five genetic clusters for the H. annectans.

  • Open Access
    Authors: 
    Puurunen, Jenni; Ottka, Claudia; Salonen, Milla; Niskanen, Julia E.; Lohi, Hannes;
    Publisher: The Royal Society
    Project: AKA | An omics approach to iden... (308887)

    Supplementary Tables in excel-format describing the AIC model selection process for each measurand in GLM analyses.

Powered by OpenAIRE graph