research product . Other ORP type . 2021

Dataset for Logical-layout analysis on French historical newspapers

Gutehrlé, Nicolas; Atanassova, Iana;
English
  • Published: 03 Dec 2021
  • Publisher: HAL CCSD
  • Country: France
Abstract
This dataset is intended for training and testing Logical Layout Analysis and recognition system on French historical documents published between 1900 and 1950. The original data is part of the "Fond régional: Franche-Comté", which is curated by Gallica, the digital portal of the Bibliothèque Nationale de France (BnF). It is available on Zenodo at the following adress: https://zenodo.org/record/5752440#.YboX6lPjJhE; This dataset is intended for training and testing Logical Layout Analysis and recognition system on French historical documents published between 1900 and 1950. The original data is part of the "Fond régional: Franche-Comté", which is curated by Gallica, the digital portal of the Bibliothèque Nationale de France (BnF). It is available on Zenodo at the following adress: https://zenodo.org/record/5752440#.YboX6lPjJhE
Subjects
free text keywords: Historical Newspapers, Logical Layout, Natural Language Processing, [SHS.HIST]Humanities and Social Sciences/History, [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Related Organizations
Any information missing or wrong?Report an Issue