Quick search
Advanced search in
Research outcomes
Field to searchTerm
Add rule
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
Download Results
55 research outcomes, page 4 of 6
  • research data . 2016 . Embargo End Date: 14 Jun 2016
    Open Access
    Authors:
    Cífka, Ondřej; Bojar, Ondřej;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This small dataset contains 3 speech corpora collected using the Alex Translate telephone service (https://ufal.mff.cuni.cz/alex#alex-translate). The "part1" and "part2" corpora contain English speech with transcriptions and Czech translations. These recordings were col...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 01 Apr 2016
    Open Access
    Authors:
    Bojar, Ondřej; Děchtěrenko, Filip; Zelenina, Maria;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This package contains the eye-tracker recordings of 8 subjects evaluating English-to-Czech machine translation quality using the WMT-style ranking of sentences. We provide the set of sentences evaluated, the exact screens presented to the annotators (including bounding ...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 30 Mar 2016
    Open Access
    Authors:
    Nedoluzhko, Anna; Novák, Michal; Cinková, Silvie; Mikulová, Marie; Mírovský, Jiří;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QTLEAP (610516)

    The Prague Czech-English Dependency Treebank 2.0 Coref (PCEDT 2.0 Coref) is a parallel treebank building upon the original PCEDT 2.0 release and enriching it with the extended manual annotation of coreference, as well as with an improved automatic annotation of the core...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 22 Mar 2016
    Open Access
    Authors:
    Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej; Stanojevic, Milos;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    This item contains models to tune for the WMT16 Tuning shared task for English-to-Czech. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training of the translation models. The data is tokenized (using Moses tokenizer), lowercased and sent...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 22 Mar 2016
    Open Access
    Authors:
    Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej; Stanojevic, Milos;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    The item contains models to tune for the WMT16 Tuning shared task for Czech-to-English. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training of the translation models. The data is tokenized (using Moses tokenizer), lowercased and sente...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 29 Feb 2016
    Restricted
    Authors:
    Specia, Lucia; Logacheva, Varvara; Scarton, Carolina;
    Persistent Identifiers
    Publisher: University of Sheffield
    Project: EC | QT21 (645452)

    Training and development data for the WMT16 QE task. Test data will be published as a separate item. This shared task will build on its previous four editions to further examine automatic methods for estimating the quality of machine translation output at run-time, with...

    Add to ORCID
  • research data . 2016
    Open Access
    Authors:
    Springmann, Uwe; Fink, Florian;
    Publisher: Zenodo
    Project: EC | CLARIN (212230)

    <p>The 2-day CIS OCR Workshop on &quot;OCR and postcorrection of early printings for digital humanities&quot; originally held at LMU, Munich 14/15 September 2015 (see http://www.cis.lmu.de/ocrworkshop).</p> <p>Release date: 2016-02-25</p> <p><br /> CIS OCR Workshop by U...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 21 Feb 2016
    Restricted
    Authors:
    Turchi, Marco; Chatterjee, Rajen; Negri, Matteo;
    Persistent Identifiers
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Project: EC | QT21 (645452)

    Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. Training and development respectively contain 12,000 an...

    Add to ORCID
  • research data . 2016 . Embargo End Date: 07 Dec 2016
    Open Access
    Authors:
    Hercig, Tomáš; Brychcín, Tomáš; Svoboda, Lukáš; Konkol, Michal; Steinberger, Josef;
    Persistent Identifiers
    Publisher: University of West Bohemia, Department of Computer Science and Engineering
    Project: EC | MEDIAGIST (630786)

    Restaurant Reviews CZ ABSA - 2.15k reviews with their related target and category The work done is described in the paper: https://doi.org/10.13053/CyS-20-3-2469

    Add to ORCID
  • research data . 2015 . Embargo End Date: 25 Dec 2015
    Open Access
    Authors:
    Hoang, Duc Tam; Bojar, Ondřej;
    Persistent Identifiers
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    CsEnVi Pairwise Parallel Corpora consist of Vietnamese-Czech parallel corpus and Vietnamese-English parallel corpus. The corpora were assembled from the following sources: - OPUS, the open parallel corpus is a growing multilingual corpus of translated open source docume...

    Add to ORCID
55 research outcomes, page 4 of 6