software . 2017 . Embargo end date: 06 Apr 2017

Slavic Forest, Norwegian Wood (scripts)

Rosa, Rudolf; Zeman, Daniel; Mareček, David; Žabokrtský, Zdeněk;
Open Access
  • Published: 28 Jan 2017
  • Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Abstract
Tools and scripts used to create the cross-lingual parsing models submitted to VarDial 2017 shared task (https://bitbucket.org/hy-crossNLP/vardial2017), as described in the linked paper. The trained UDPipe models themselves are published in a separate submission (https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-1971). For each source (SS, e.g. sl) and target (TT, e.g. hr) language, you need to add the following into this directory: - treebanks (Universal Dependencies v1.4): SS-ud-train.conllu TT-ud-predPoS-dev.conllu - parallel data (OpenSubtitles from Opus): OpenSubtitles2016.SS-TT.SS OpenSubtitles2016.SS-TT.TT !!! If they are originally called ...TT-...
Persistent Identifiers
Funded by
EC| HimL
Project
HimL
Health in my Language
  • Funder: European Commission (EC)
  • Project Code: 644402
  • Funding stream: H2020 | IA
Communities
CLARIN
Digital Humanities and Cultural Heritage
Download from
Any information missing or wrong?Report an Issue