Quick search
Advanced search in
Research outcomes
Field to searchTerm
Add rule
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
Download Results
59 research outcomes, page 5 of 6
  • research data . 2015 . Embargo End Date: 25 Dec 2015
    Open Access
    Authors:
    Hoang, Duc Tam; Bojar, Ondřej;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | QT21 (645452)

    CsEnVi Pairwise Parallel Corpora consist of Vietnamese-Czech parallel corpus and Vietnamese-English parallel corpus. The corpora were assembled from the following sources: - OPUS, the open parallel corpus is a growing multilingual corpus of translated open source docume...

  • research data . 2015 . Embargo End Date: 16 May 2015
    Open Access
    Authors:
    Agirre, Eneko; Branco, António; Popel, Martin; Simov, Kiril;
    Publisher: University of the Basque Country, UPV/EHU
    Project: EC | QTLEAP (610516)

    This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are sentences from the Europarl parallel corpus (Koehn, 2005). We selected the monolingual sentences from parallel corpora for the fol...

  • research data . 2015 . Embargo End Date: 15 May 2015
    Open Access
    Authors:
    Agirre, Eneko; Branco, António; Popel, Martin; Simov, Kiril;
    Publisher: University of the Basque Country, UPV/EHU
    Project: EC | QTLEAP (610516)

    This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are Q&A interactions from the real-user scenario (batches 1 and 2). The interactions in this corpus are available in Basque, Bulgarian...

  • research data . 2014 . Embargo End Date: 28 Apr 2014
    Open Access
    Authors:
    Dušek, Ondřej; Hajič, Jan; Hlaváčová, Jaroslava; Pecina, Pavel; Tamchyna, Aleš; Urešová, Zdeňka;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | KHRESMOI (257528)

    This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German.

  • research data . 2014 . Embargo End Date: 27 Mar 2014
    Open Access
    Authors:
    Jawaid, Bushra; Kamran, Amir; Bojar, Ondřej;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | MOSESCORE (288487)

    We release a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We extend the work of Jawaid and Bojar (2012) who use three different taggers and then apply a voting scheme to disambiguate among the different choices suggested by each tagger...

  • research data . 2013 . Embargo End Date: 02 Apr 2014
    Open Access
    Authors:
    Pecina, Pavel; Dušek, Ondřej; Hajič, Jan; Urešová, Zdeňka;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | KHRESMOI (257528)

    This package contains data sets for development and testing of machine translation of medical search short queries between Czech, English, French, and German. The queries come from general public and medical experts.

  • research data . 2013 . Embargo End Date: 10 Dec 2013
    Open Access
    Authors:
    Bojar, Ondřej; Macháček, Matouš; Tamchyna, Aleš; Zeman, Daniel;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | MOSESCORE (288487)

    This dataset contains the whole set of very many Czech translations for 50 English source sentences coming from WMT11 test set (http://www.statmt.org/wmt11). In total, there are 15431447 Czech sentences, i.e. 300k reference translations per source English sentence on av...

  • research data . 2012 . Embargo End Date: 13 Nov 2012
    Open Access
    Authors:
    Bojar, Ondřej; Zeman, Daniel; Dušek, Ondřej; Břečková, Jana; Farkačová, Hana; Grošpic, Pavel; Kačenová, Kristýna; Knechtová, Eva; Koubová, Anna; Lukavská, Jana; ...
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | EUROMATRIXPLUS (231720)

    Additional three Czech reference translations of the whole WMT 2011 data set (http://www.statmt.org/wmt11/test.tgz), translated from the German originals. Original segmentation of the WMT 2011 data is preserved.

  • research data . 2012 . Embargo End Date: 15 May 2012
    Open Access
    Authors:
    Galuščáková, Petra; Garabík, Radovan; Bojar, Ondřej;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | EUROMATRIXPLUS (231720)

    Czech-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2], Official Journal of the European Union [3] and part of OPUS corpus [4] – EMEA, EUConst, KDE4 and PHP) and downloaded website of European Commission [5]. Corpus is publ...

  • research data . 2012 . Embargo End Date: 15 May 2012
    Open Access
    Authors:
    Galuščáková, Petra; Garabík, Radovan; Bojar, Ondřej;
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Project: EC | EUROMATRIXPLUS (231720)

    English-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2], Official Journal of the European Union [3] and part of OPUS corpus [4] – EMEA, EUConst, KDE4 and PHP) and downloaded website of European Commission [5]. Corpus is pu...

59 research outcomes, page 5 of 6