Quick search
Advanced search in
Research outcomes
Field to searchTerm
Add rule
Filters (1 )
Filters
Digital Humanities and Cultural HeritageCLARIN
Filters
Digital Humanities and Cultural HeritageCLARIN
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
Download Results
332 research outcomes, page 1 of 34
  • research data . 2021 . Embargo End Date: 11 Mar 2021
    Open Access
    Authors:
    Nedoluzhko, Anna; Novák, Michal; Popel, Martin; Žabokrtský, Zdeněk; Zeman, Daniel;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | Bergamot (825303)

    CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 0.1 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morpho...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Limisiewicz, Tomasz; Mareček, David;

    With the recent success of pre-trained models in NLP, a significant focus was put on interpreting their representations. One of the most prominent approaches is structural probing (Hewitt and Manning, 2019), where a linear projection of language vector space is performe...

  • publication . Article . 2020
    Open Access Italian
    Authors:
    Monachini, Monica; Frontini, Francesca;
    Publisher: HAL CCSD

    National audience; Il 1° ottobre 2015 il MIUR firma l'adesione dell'Italia a CLARIN-ERIC, l'infrastruttura di ricerca che offre risorse e tecnologie linguistiche dedicate al settore delle scienze del linguaggio e delle scienze umane e sociali. Questo articolo intende fo...

  • publication . Other literature type . 2020
    Open Access English
    Authors:
    Koolen, Marijn; Kumpulainen, Sanna; Melgar-Estrada, Liliana;
    Publisher: Zenodo
    Project: NWO | CLARIAH Common Lab Resear... (2300184354)

    The concept of “scholarly primitive” has been widely welcomed both by humanists and system designers in the humanities, due to the fact that it made it possible to have a solid conceptual basis for the operationalization of the essential functionalities required for adv...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Kvapilíková, Ivana; Kocmi, Tom; Bojar, Ondřej;

    This paper presents a description of CUNI systems submitted to the WMT20 task on unsupervised and very low-resource supervised machine translation between German and Upper Sorbian. We experimented with training on synthetic data and pre-training on a related language pa...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Kocmi, Tom; Limisiewicz, Tomasz; Stanovsky, Gabriel;
    Project: EC | Bergamot (825303)

    Gender bias in machine translation can manifest when choosing gender inflections based on spurious gender correlations. For example, always translating doctors as men and nurses as women. This can be particularly harmful as models become more popular and deployed within...

  • publication . Preprint . Conference object . 2020
    Open Access English
    Authors:
    Martin Vastl; Daniel Zeman; Rudolf Rosa;

    We present our submission to the SIGTYP 2020 Shared Task on the prediction of typological features. We submit a constrained system, predicting typological features only based on the WALS database. We investigate two approaches. The simpler of the two is a system based o...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Limisiewicz, Tomasz; Mareček, David;

    Neural networks trained on natural language processing tasks capture syntax even though it is not provided as a supervision signal. This indicates that syntactic analysis is essential to the understating of language in artificial intelligence systems. This overview pape...

  • research data . 2020 . Embargo End Date: 02 Jul 2020
    Open Access
    Authors:
    Çano, Erion;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | ELITR (825460)

    OAGL is a paper metadata dataset consisting of 17528680 records which comprise various scientific publication attributes like abstracts, titles, keywords, publication years, venues, etc. The last field of each record is the page length of the corresponding publication. ...

  • publication . Preprint . Part of book or chapter of book . 2020
    Open Access English
    Authors:
    Tomáš Musil; David Mareček; Rudolf Rosa;

    Multiple studies have probed representations emerging in neural networks trained for end-to-end NLP tasks and examined what word-level linguistic information may be encoded in the representations. In classical probing, a classifier is trained on the representations to e...

332 research outcomes, page 1 of 34