Quick search
Advanced search in
Research outcomes
Field to searchTerm
Add rule
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
Download Results
59 research outcomes, page 2 of 6
  • research data . 2019 . Embargo End Date: 08 Mar 2019
    Open Access
    Authors:
    Çano, Erion;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | ELITR (825460)

    OAGK is a keyword extraction/generation dataset consisting of 2.2 million abstracts, titles and keyword strings from cientific articles. Texts were lowercased and tokenized with Stanford CoreNLP tokenizer. No other preprocessing steps were applied in this release versio...

  • research data . 2018 . Embargo End Date: 08 Aug 2019
    Restricted
    Authors:
    Turchi, Marco; Negri, Matteo; Chatterjee, Rajen;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Human post-edited and reference test sentences for the En-De PBSMT WMT 2018 Automatic post-editing task. This consists of 2,000 German sentences for each file belonging to the IT domain and already tokenized. All data is provided by the EU project QT21 (http://www.qt21....

  • research data . 2018 . Embargo End Date: 24 Sep 2018
    Open Access
    Authors:
    Vidra, Jonáš; Kyjánek, Lukáš; Ševčíková, Magda; Žabokrtský, Zdeněk; Kalužová, Adéla; Dohnalová, Šárka; Hudeček, Vojtěch;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | T4ME NET (249119)

    DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent derivational relations between a derived word and its base word. The present version, DeriNet 1.6, contains ...

  • research data . 2018 . Embargo End Date: 21 May 2018
    Restricted
    Authors:
    Specia, Lucia; Logacheva, Varvara; Blain, Frederic; Fernandez, Ramon; Martins, André;
    Publisher: University of Sheffield
    Project: EC | QT21 (645452)

    Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619. This shared task will build on its previous six editions to further examine automatic methods for estimating the quality of machine translation output at run-time, w...

  • research data . 2018 . Embargo End Date: 03 May 2018
    Restricted
    Authors:
    Turchi, Marco; Negri, Matteo; Chatterjee, Rajen;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already tokenized. Test set contains 2,000 pairs. A phrase-based machine translation system has been used ...

  • research data . 2018 . Embargo End Date: 03 May 2018
    Restricted
    Authors:
    Chatterjee, Rajen; Negri, Matteo; Turchi, Marco;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already tokenized. Test set contains 1,023 pairs. A neural machine translation system has been used to gen...

  • research data . 2018 . Embargo End Date: 19 Feb 2018
    Restricted
    Authors:
    Specia, Lucia; Logacheva, Varvara; Blain, Frederic; Fernandez, Ramon; Martins, André;
    Publisher: University of Sheffield
    Project: EC | QT21 (645452)

    Training and development data for the WMT18 QE task. Test data will be published as a separate item. This shared task will build on its previous six editions to further examine automatic methods for estimating the quality of machine translation output at run-time, witho...

  • research data . 2018 . Embargo End Date: 20 Feb 2018
    Open Access
    Authors:
    Hajič, Jan; Bejček, Eduard; Bémová, Alevtina; Buráňová, Eva; Hajičová, Eva; Havelka, Jiří; Homola, Petr; Kárník, Jiří; Kettnerová, Václava; Klyueva, Natalia; ...
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | T4ME NET (249119)

    The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied Linguistics under various projects between 1996 and 2018 on the original texts, i.e., all anno...

  • research data . 2018 . Embargo End Date: 12 Feb 2018
    Restricted
    Authors:
    Turchi, Marco; Negri, Matteo; Chatterjee, Rajen;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Training and development data for the WMT 2018 Automatic post-editing task. They consist in English-German triplets (source, target and post-edit) belonging to the information technology domain and already tokenized. Training and development respectively contain 13,442 ...

  • research data . 2017 . Embargo End Date: 17 Oct 2017
    Restricted
    Authors:
    Turchi, Marco; Chatterjee, Rajen; Negri, Matteo;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 German sentences belonging to the IT domain and already tokenized. Source and target segments can be downloaded from: https://lindat.mff.cuni.cz/repository/xmlui/handle...

59 research outcomes, page 2 of 6