Quick search
Advanced search in
Research outcomes
Field to searchTerm
Add rule
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
Download Results
59 research outcomes, page 4 of 6
  • research data . 2017 . Embargo End Date: 15 Feb 2017
    Restricted
    Authors:
    Turchi, Marco; Chatterjee, Rajen; Negri, Matteo;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. All data is provide...

  • research data . 2016 . Embargo End Date: 14 Jun 2016
    Open Access
    Authors:
    Cífka, Ondřej; Bojar, Ondřej;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This small dataset contains 3 speech corpora collected using the Alex Translate telephone service (https://ufal.mff.cuni.cz/alex#alex-translate). The "part1" and "part2" corpora contain English speech with transcriptions and Czech translations. These recordings were col...

  • research data . 2016 . Embargo End Date: 01 Apr 2016
    Open Access
    Authors:
    Bojar, Ondřej; Děchtěrenko, Filip; Zelenina, Maria;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This package contains the eye-tracker recordings of 8 subjects evaluating English-to-Czech machine translation quality using the WMT-style ranking of sentences. We provide the set of sentences evaluated, the exact screens presented to the annotators (including bounding ...

  • research data . 2016 . Embargo End Date: 30 Mar 2016
    Open Access
    Authors:
    Nedoluzhko, Anna; Novák, Michal; Cinková, Silvie; Mikulová, Marie; Mírovský, Jiří;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QTLEAP (610516)

    The Prague Czech-English Dependency Treebank 2.0 Coref (PCEDT 2.0 Coref) is a parallel treebank building upon the original PCEDT 2.0 release and enriching it with the extended manual annotation of coreference, as well as with an improved automatic annotation of the core...

  • research data . 2016 . Embargo End Date: 22 Mar 2016
    Open Access
    Authors:
    Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej; Stanojevic, Milos;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    The item contains models to tune for the WMT16 Tuning shared task for Czech-to-English. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training of the translation models. The data is tokenized (using Moses tokenizer), lowercased and sente...

  • research data . 2016 . Embargo End Date: 22 Mar 2016
    Open Access
    Authors:
    Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej; Stanojevic, Milos;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This item contains models to tune for the WMT16 Tuning shared task for English-to-Czech. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training of the translation models. The data is tokenized (using Moses tokenizer), lowercased and sent...

  • research data . 2016 . Embargo End Date: 29 Feb 2016
    Restricted
    Authors:
    Specia, Lucia; Logacheva, Varvara; Scarton, Carolina;
    Publisher: University of Sheffield
    Project: EC | QT21 (645452)

    Training and development data for the WMT16 QE task. Test data will be published as a separate item. This shared task will build on its previous four editions to further examine automatic methods for estimating the quality of machine translation output at run-time, with...

  • research data . 2016
    Open Access
    Authors:
    Springmann, Uwe; Fink, Florian;
    Publisher: Zenodo
    Project: EC | CLARIN (212230)

    <p>The 2-day CIS OCR Workshop on &quot;OCR and postcorrection of early printings for digital humanities&quot; originally held at LMU, Munich 14/15 September 2015 (see http://www.cis.lmu.de/ocrworkshop).</p> <p>Release date: 2016-02-25</p> <p><br /> CIS OCR Workshop by U...

  • research data . 2016 . Embargo End Date: 21 Feb 2016
    Restricted
    Authors:
    Turchi, Marco; Chatterjee, Rajen; Negri, Matteo;
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. Training and development respectively contain 12,000 an...

  • research data . 2016 . Embargo End Date: 07 Dec 2016
    Open Access
    Authors:
    Hercig, Tomáš; Brychcín, Tomáš; Svoboda, Lukáš; Konkol, Michal; Steinberger, Josef;
    Publisher: University of West Bohemia, Department of Computer Science and Engineering
    Project: EC | MEDIAGIST (630786)

    Restaurant Reviews CZ ABSA - 2.15k reviews with their related target and category The work done is described in the paper: https://doi.org/10.13053/CyS-20-3-2469

59 research outcomes, page 4 of 6