- home
- Advanced Search
Filters
Clear AllLoading
apps Other research productkeyboard_double_arrow_right Other ORP type 2016 FinnishJožef Stefan Institute Authors: Ljubešić, Nikola; Pirinen, Tommi; Toral, Antonio;Ljubešić, Nikola; Pirinen, Tommi; Toral, Antonio;handle: http://hdl.handle.net/11356/1074 , 11356/1074
The Finnish web corpus fiWaC was built by crawling the .fi top-level domain in 2015 for both Finnish and English documents. The corpus was naively tokenised (via spaces), near-deduplicated on paragraph level and paragraph-shuffled. Each paragraph contains metadata on the URL and language identification. The Finnish (~1.7B tokens) and English (~2B tokens) parts of the corpus are organised in separate files.
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=11356/1074&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
more_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=11356/1074&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2021 FinnishZenodo Authors: Grünewald, Stefan; Friedrich, Annemarie; Kuhn, Jonas;Grünewald, Stefan; Friedrich, Annemarie; Kuhn, Jonas;Pre-trained models to parse Finnish text using the STEPS dependency parser (https://github.com/boschresearch/steps-parser).
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4686609&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
visibility 17visibility views 17 download downloads 6 Powered bymore_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4686609&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 United Kingdom, Belgium FinnishAuthors: Laes, Christian; Sacré, Dirk;Laes, Christian; Sacré, Dirk;handle: 10067/1476540151162165141
The University of Ma... arrow_drop_down The University of Manchester - Institutional RepositoryOther ORP type . 2017Data sources: The University of Manchester - Institutional RepositoryInstitutional Repository Universiteit AntwerpenOther ORP type . 2017Data sources: Institutional Repository Universiteit Antwerpenadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10067/1476540151162165141&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
visibility 0visibility views 0 download downloads 1 Powered bymore_vert The University of Ma... arrow_drop_down The University of Manchester - Institutional RepositoryOther ORP type . 2017Data sources: The University of Manchester - Institutional RepositoryInstitutional Repository Universiteit AntwerpenOther ORP type . 2017Data sources: Institutional Repository Universiteit Antwerpenadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10067/1476540151162165141&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 Sweden FinnishKTH, Filosofi och historia Authors: Dahl, Justiina;Dahl, Justiina;Kanadan ja Suomen Akrtiksen hallinnon kehittämistä ovat itsenäisyyden ajan hallinneet kaksi samanlaista keskushallinntoa vahvistavaa ongelmanasettelua. QC 20171127 EU016
All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______681::3b484426974cbce39d36002890a8b89e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
more_vert All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______681::3b484426974cbce39d36002890a8b89e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu
Loading
apps Other research productkeyboard_double_arrow_right Other ORP type 2016 FinnishJožef Stefan Institute Authors: Ljubešić, Nikola; Pirinen, Tommi; Toral, Antonio;Ljubešić, Nikola; Pirinen, Tommi; Toral, Antonio;handle: http://hdl.handle.net/11356/1074 , 11356/1074
The Finnish web corpus fiWaC was built by crawling the .fi top-level domain in 2015 for both Finnish and English documents. The corpus was naively tokenised (via spaces), near-deduplicated on paragraph level and paragraph-shuffled. Each paragraph contains metadata on the URL and language identification. The Finnish (~1.7B tokens) and English (~2B tokens) parts of the corpus are organised in separate files.
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=11356/1074&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
more_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=11356/1074&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2021 FinnishZenodo Authors: Grünewald, Stefan; Friedrich, Annemarie; Kuhn, Jonas;Grünewald, Stefan; Friedrich, Annemarie; Kuhn, Jonas;Pre-trained models to parse Finnish text using the STEPS dependency parser (https://github.com/boschresearch/steps-parser).
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4686609&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
visibility 17visibility views 17 download downloads 6 Powered bymore_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4686609&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 United Kingdom, Belgium FinnishAuthors: Laes, Christian; Sacré, Dirk;Laes, Christian; Sacré, Dirk;handle: 10067/1476540151162165141
The University of Ma... arrow_drop_down The University of Manchester - Institutional RepositoryOther ORP type . 2017Data sources: The University of Manchester - Institutional RepositoryInstitutional Repository Universiteit AntwerpenOther ORP type . 2017Data sources: Institutional Repository Universiteit Antwerpenadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10067/1476540151162165141&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
visibility 0visibility views 0 download downloads 1 Powered bymore_vert The University of Ma... arrow_drop_down The University of Manchester - Institutional RepositoryOther ORP type . 2017Data sources: The University of Manchester - Institutional RepositoryInstitutional Repository Universiteit AntwerpenOther ORP type . 2017Data sources: Institutional Repository Universiteit Antwerpenadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10067/1476540151162165141&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 Sweden FinnishKTH, Filosofi och historia Authors: Dahl, Justiina;Dahl, Justiina;Kanadan ja Suomen Akrtiksen hallinnon kehittämistä ovat itsenäisyyden ajan hallinneet kaksi samanlaista keskushallinntoa vahvistavaa ongelmanasettelua. QC 20171127 EU016
All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______681::3b484426974cbce39d36002890a8b89e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!
more_vert All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______681::3b484426974cbce39d36002890a8b89e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu