- home
- Advanced Search
Filters
Clear All- Digital Humanities and Cultural Heritage
- IE
- Jyväskylä University Digital Archiv...
- Digital Humanities and Cultural Heritage
- IE
- Jyväskylä University Digital Archiv...
Loading
description Publicationkeyboard_double_arrow_right Article 2021 Austria, Finland, FinlandPublisher:Research in Corpus Linguistics Publicly fundedČermáková, Ann; Jantunen, Jarmo; Jauhiainen, Tommi; Kirk, John; Křen, Michal; Kupietz, Marc; Uí Dhonnchadha, Elaine;handle: 11353/10.1602684 , 10138/332856
This paper reports on the efforts of twelve national teams in building the International Comparable Corpus (ICC; https://korpus.cz/icc) that will contain highly comparable datasets of spoken, written and electronic registers. The languages currently covered are Czech, Finnish, French, German, Irish, Italian, Norwegian, Polish, Slovak, Swedish and, more recently, Chinese, as well as English, which is considered to be the pivot language. The goal of the project is to provide much-needed data for contrastive corpus-based linguistics. The ICC corpus is committed to the idea of re-using existing multilingual resources as much as possible and the design is modelled, with various adjustments, on the International Corpus of English (ICE). As such, ICC will contain approximately the same balance of forty percent of written language and 60 percent of spoken language distributed across 27 different text types and contexts. A number of issues encountered by the project teams are discussed, ranging from copyright and data sustainability to technical advances in data distribution. peerReviewed
Jyväskylä University... arrow_drop_down Jyväskylä University Digital ArchiveArticle . 2021 . Peer-reviewedData sources: Jyväskylä University Digital ArchiveResearch in Corpus Linguistics; Permanent Hosting, Archiving and Indexing of Digital Resources and AssetsArticle . 2021 . Peer-reviewedLicense: CC BYHELDA - Digital Repository of the University of HelsinkiArticle . 2021 . Peer-reviewedData sources: HELDA - Digital Repository of the University of Helsinkiadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.32714/ricl.09.01.06&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen gold 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!more_vert Jyväskylä University... arrow_drop_down Jyväskylä University Digital ArchiveArticle . 2021 . Peer-reviewedData sources: Jyväskylä University Digital ArchiveResearch in Corpus Linguistics; Permanent Hosting, Archiving and Indexing of Digital Resources and AssetsArticle . 2021 . Peer-reviewedLicense: CC BYHELDA - Digital Repository of the University of HelsinkiArticle . 2021 . Peer-reviewedData sources: HELDA - Digital Repository of the University of Helsinkiadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.32714/ricl.09.01.06&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu
Loading
description Publicationkeyboard_double_arrow_right Article 2021 Austria, Finland, FinlandPublisher:Research in Corpus Linguistics Publicly fundedČermáková, Ann; Jantunen, Jarmo; Jauhiainen, Tommi; Kirk, John; Křen, Michal; Kupietz, Marc; Uí Dhonnchadha, Elaine;handle: 11353/10.1602684 , 10138/332856
This paper reports on the efforts of twelve national teams in building the International Comparable Corpus (ICC; https://korpus.cz/icc) that will contain highly comparable datasets of spoken, written and electronic registers. The languages currently covered are Czech, Finnish, French, German, Irish, Italian, Norwegian, Polish, Slovak, Swedish and, more recently, Chinese, as well as English, which is considered to be the pivot language. The goal of the project is to provide much-needed data for contrastive corpus-based linguistics. The ICC corpus is committed to the idea of re-using existing multilingual resources as much as possible and the design is modelled, with various adjustments, on the International Corpus of English (ICE). As such, ICC will contain approximately the same balance of forty percent of written language and 60 percent of spoken language distributed across 27 different text types and contexts. A number of issues encountered by the project teams are discussed, ranging from copyright and data sustainability to technical advances in data distribution. peerReviewed
Jyväskylä University... arrow_drop_down Jyväskylä University Digital ArchiveArticle . 2021 . Peer-reviewedData sources: Jyväskylä University Digital ArchiveResearch in Corpus Linguistics; Permanent Hosting, Archiving and Indexing of Digital Resources and AssetsArticle . 2021 . Peer-reviewedLicense: CC BYHELDA - Digital Repository of the University of HelsinkiArticle . 2021 . Peer-reviewedData sources: HELDA - Digital Repository of the University of Helsinkiadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.32714/ricl.09.01.06&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen gold 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!more_vert Jyväskylä University... arrow_drop_down Jyväskylä University Digital ArchiveArticle . 2021 . Peer-reviewedData sources: Jyväskylä University Digital ArchiveResearch in Corpus Linguistics; Permanent Hosting, Archiving and Indexing of Digital Resources and AssetsArticle . 2021 . Peer-reviewedLicense: CC BYHELDA - Digital Repository of the University of HelsinkiArticle . 2021 . Peer-reviewedData sources: HELDA - Digital Repository of the University of Helsinkiadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.32714/ricl.09.01.06&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu