

<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Annotated Corpus For Occitan
This corpus contains a collection of texts in Occitan which were manually annotated with parts-of-speech, lemmas. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 28 documents and 12,425 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 The annotated versions are provided in a TSV CoNLL-U format.
Lemma, FOS: Languages and literature, Linguistics, Occitan, Corpus, Part Of Speech, Natural Language Processing
Lemma, FOS: Languages and literature, Linguistics, Occitan, Corpus, Part Of Speech, Natural Language Processing
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average visibility views 468 download downloads 56 citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average Powered byBIP!
- 468views56downloads



This corpus contains a collection of texts in Occitan which were manually annotated with parts-of-speech, lemmas. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 28 documents and 12,425 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 The annotated versions are provided in a TSV CoNLL-U format.