research data . Dataset . 2012 . Embargo end date: 15 May 2012

English-Slovak Parallel Corpus

Galuščáková, Petra; Garabík, Radovan; Bojar, Ondřej;
Open Access
  • Published: 15 May 2012
  • Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Abstract
English-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2], Official Journal of the European Union [3] and part of OPUS corpus [4] – EMEA, EUConst, KDE4 and PHP) and downloaded website of European Commission [5]. Corpus is published in both in plaintext format and with an automatic morphological annotation. References: [1] http://langtech.jrc.it/JRC-Acquis.html/ [2] http://www.statmt.org/europarl/ [3] http://apertium.eu/data [4] http://opus.lingfil.uu.se/ [5] http://ec.europa.eu/
Persistent Identifiers
Funded by
EC| EUROMATRIXPLUS
Project
EUROMATRIXPLUS
Bringing Machine Translation for European Languages to the User
  • Funder: European Commission (EC)
  • Project Code: 231720
  • Funding stream: FP7 | SP1 | ICT
Communities
CLARIN
Digital Humanities and Cultural Heritage
Download from
Any information missing or wrong?Report an Issue