research data . Dataset . 2021 . Embargo end date: 24 May 2021

Ekspress user comment dataset 1.0

Shekhar, Ravi; Pollak, Senja; Pelicon, Andraž; Matthew, Purver; Krustok, Ivar;
Open Access
  • Published: 19 Apr 2021
  • Publisher: Ekspress Meedia Group
Abstract
This dataset is an archive of reader comments on the Ekspress Meedia news site from 2009-2019, containing approximately 31M comments, mostly in the Estonian language, with some in Russian. Description of the Datasets. There are 11 CSV files: comments_2009.csv contains 2 898 438 comments from the year 2009 comments_2010.csv contains 2 377 591 comments from the year 2010 comments_2011.csv contains 2 729 389 comments from the year 2011 comments_2012.csv contains 3 372 776 comments from the year 2012 comments_2013.csv contains 3 289 393 comments from the year 2013 comments_2014.csv contains 3 195 502 comments from the year 2014 comments_2015.csv contains 3 202 592 c...
Persistent Identifiers
Funded by
EC| EMBEDDIA
Project
EMBEDDIA
Cross-Lingual Embeddings for Less-Represented Languages in European News Media
  • Funder: European Commission (EC)
  • Project Code: 825153
  • Funding stream: H2020 | RIA
Communities
Digital Humanities and Cultural Heritage
Download from
Any information missing or wrong?Report an Issue