- home
- Advanced Search
- Digital Humanities and Cultural Heritage
- Publications
- Research data
- European Commission
- EC|H2020
- English
- Hyper Article en Ligne - Sciences d...
- OpenAIRE
- Digital Humanities and Cultural Her...
- Digital Humanities and Cultural Heritage
- Publications
- Research data
- European Commission
- EC|H2020
- English
- Hyper Article en Ligne - Sciences d...
- OpenAIRE
- Digital Humanities and Cultural Her...
Loading
description Publicationkeyboard_double_arrow_right Article 2021 France EnglishPublisher:HAL CCSD Funded by:EC | TibArmyEC| TibArmyAuthors: Travers, Alice; Venturi, Federica;Travers, Alice; Venturi, Federica;International audience
Annali di Ca’ Foscar... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2021License: CC BYFull-Text: https://hal.science/hal-03512896/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::63b9af45a81b7c1a555b8fc51a851442&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesgold 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Annali di Ca’ Foscar... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2021License: CC BYFull-Text: https://hal.science/hal-03512896/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::63b9af45a81b7c1a555b8fc51a851442&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Conference object , Article , Other literature type 2021 France EnglishPublisher:Zenodo Funded by:EC | ELEXISEC| ELEXISAhmadi, Sina; Constant, Mathieu; Fort, Karën; Guillaume, Bruno; McCrae, John,;Nous présentons dans ce papier les travaux que nous avons réalisés pour convertir dans le modèle Ontolex-Lemon l'une des plus importantes ressources lexicographiques pour le français : le Trésor de la Langue Française. En effet, malgré l'utilisation généralisée de cette ressource, son format actuel, basé sur XML, ne respecte pas les standards les plus récents de la représentation des données lexicographiques, notamment ceux basés sur les données liées. Nos travaux mettent en lumière la nécessité d'établir des mécanismes permettant d'augmenter l'inter-opérabilité des ressources et des technologies pour créer et maintenir des ressources lexicographiques. In this paper, we report our efforts to convert one of the most comprehensive lexicographic resources of French, the Trésor de la Langue Française, into the Ontolex-Lemon model. Despite the widespread usage of this resource, the original XML format seems to impede its integration in language technology tools. In order to breathe new life into this resource, we examine the usage and the conversion to more interoperable formats, primarily those based on the linguistic linked data, to provide this resource to a broader range of applications and users. National audience
HAL-Rennes 1; INRIA ... arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotConference object . 2021Full-Text: https://hal.inria.fr/hal-03463294/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5772045&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 36visibility views 36 download downloads 29 Powered bymore_vert HAL-Rennes 1; INRIA ... arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotConference object . 2021Full-Text: https://hal.inria.fr/hal-03463294/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5772045&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report , Other literature type 2021 Croatia, France EnglishPublisher:Zenodo Funded by:EC | OPERAS-PEC| OPERAS-PMaryl, Maciej; Błaszczyńska, Marta; Zalotyńska, Agnieszka; Taylor, Laurence; Avanço, Karla; Balula, Ana; Buchner, Anna; Caliman, Lorena; Clivaz, Claire; Costa, Carlos; Franczak, Mateusz; Gatti, Rupert; Giglia, Elena; Gingold, Arnaud; Jarmelo, Susana; Padez, Maria,; Leão, Delfim; Melinščak Zlodi, Iva; Mojsak, Kajetan; Morka, Agata; Mosterd, Tom; Nury, Elisa; Plag, Cornelia; Schafer, Valérie; Silva, Mickael; Stojanovski, Jadranka; Szleszyński, Bartłomiej; Szulińska, Agnieszka; Tóth-Czifra, Erzsébet; Wciślik, Piotr; Wieneke, Lars;This report discusses the scholarly communication issues in Social Sciences and Humanities that are relevant to the future development and functioning of OPERAS. The outcomes collected here can be divided into two groups of innovations regarding 1) the operation of OPERAS, and 2) its activities. The “operational” issues include the ways in which an innovative research infrastructure should be governed (Chapter 1) as well as the business models for open access publications in Social Sciences and Humanities (Chapter 2). The other group of issues is dedicated to strategic areas where OPERAS and its services may play an instrumental role in providing, enabling, or unlocking innovation: FAIR data (Chapter 3), bibliodiversity and multilingualism in scholarly communication (Chapter 4), the future of scholarly writing (Chapter 5), and quality assessment (Chapter 6). Each chapter provides an overview of the main findings and challenges with emphasis on recommendations for OPERAS and other stakeholders like e- infrastructures, publishers, SSH researchers, research performing organisations, policy makers, and funders. Links to data and further publications stemming from work concerning particular tasks are located at the end of each chapter.
Croatian Scientific ... arrow_drop_down Croatian Scientific Bibliography - CROSBIOther literature type . 2021Data sources: Croatian Scientific Bibliography - CROSBIMémoires en Sciences de l'Information et de la Communication; HAL AMUReport . 2021License: CC BYFull-Text: https://hal.science/hal-03277615/documentHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2021License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5017705&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 1 citations 1 popularity Average influence Average impulse Average Powered by BIP!visibility 4Kvisibility views 4,435 download downloads 2,726 Powered bymore_vert Croatian Scientific ... arrow_drop_down Croatian Scientific Bibliography - CROSBIOther literature type . 2021Data sources: Croatian Scientific Bibliography - CROSBIMémoires en Sciences de l'Information et de la Communication; HAL AMUReport . 2021License: CC BYFull-Text: https://hal.science/hal-03277615/documentHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2021License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5017705&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Italy, France, Lithuania, Spain EnglishPublisher:Archäologische Informationen Funded by:EC | CLIOARCHEC| CLIOARCHNaudinot, Nicolas; Hussain, Shumon,; Matzig, David,; Fontana, Federica; Groß, Daniel; Hess, Thomas; Langlais, Mathieu; Fernández-Lopéz De Pablo, Javier; Mills, William; Posch, Caroline; Rimkus, Tomas; Shnayder, Svetlana; Stefański, Damian; Riede, Felix;handle: 10045/115296 , 11392/2472970
Wir berichten über den 2. Workshop im Rahmen des CLIOARCH-Projekts, der darauf abzielte, auf eine neue Synthese technokultureller Langzeitentwicklungen an der Pleistozän/Holozän-Grenze in Europa hinzuarbeiten. Wir reagieren damit auf den wachsenden Bedarf nach einem metaanalytischen Fundament für den Vergleich und die eventuelle Integration von heterogenen regionalen Datensätzen in der Archäologie des Spätpaläolithikums und frühesten Mesolithikums und betonen insbesondere die reichhaltigen Möglichkeiten, die kooperative Ansätze hierbei bieten. Wir schlagen vor, dass das Expert-Sourcing von vorgefilterten lithischen Informationen eine vielversprechende Grundlage zur Durchführung systematischer archäologischer MetaAnalysen ist und dass die Zusammenstellung, Untersuchung und Konservierung ähnlicher großräumiger Datensammlungen ein wichtiges Forschungsziel für die Zukunft sein könnte. We report on a virtual workshop aimed at advancing a new synthesis of techno-cultural patterns at the Pleistocene-Holocene boundary in Europe. We respond to the growing need of developing meta-analytical frameworks for comparing and eventually integrating disparate regional datasets and stress the opportunities of collaborative approaches. We propose that expert-sourced lithic data is a promising means of conducting systematic archaeological meta-analyses, and that the compilation and examination of similar continental-scale datasets may be an important research goal in the future. CLIOARCH is an ERC Consolidator Grant project and has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 817564).
Archivio istituziona... arrow_drop_down Recolector de Ciencia Abierta, RECOLECTAArticle . 2021Full-Text: https://doi.org/10.11588/ai.2020.1.81428Data sources: Recolector de Ciencia Abierta, RECOLECTAVirtual Library of Klaipeda UniversityArticle . 2020Data sources: Virtual Library of Klaipeda UniversityRepositorio Institucional de la Universidad de AlicanteArticle . 2021Data sources: Repositorio Institucional de la Universidad de AlicanteMémoires en Sciences de l'Information et de la CommunicationArticle . 2021Full-Text: https://hal.science/hal-03474590/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.11588/ai.2020.1.81428&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Archivio istituziona... arrow_drop_down Recolector de Ciencia Abierta, RECOLECTAArticle . 2021Full-Text: https://doi.org/10.11588/ai.2020.1.81428Data sources: Recolector de Ciencia Abierta, RECOLECTAVirtual Library of Klaipeda UniversityArticle . 2020Data sources: Virtual Library of Klaipeda UniversityRepositorio Institucional de la Universidad de AlicanteArticle . 2021Data sources: Repositorio Institucional de la Universidad de AlicanteMémoires en Sciences de l'Information et de la CommunicationArticle . 2021Full-Text: https://hal.science/hal-03474590/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.11588/ai.2020.1.81428&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Australia, France EnglishPublisher:HAL CCSD Funded by:EC | SILVEREC| SILVERGentelli, Liesel; Blichert-Toft, Janne; Davis, Gillan; Gitler, Haim; Albarède, Francis;Hacksilber facilitated trade and transactions from the beginning of the second millennium BCE until the late fourth century BCE in the southern Levant. Here we demonstrate the use of new, data-driven statistical approaches to interpret high-precision Pb isotope analysis of silver found in archaeological contexts for provenance determination. We sampled 46 pieces of International audience
HAL-ENS-LYON; Mémoir... arrow_drop_down All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=sygma_______::8a13381584271a2634bbc9a413f4be35&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL-ENS-LYON; Mémoir... arrow_drop_down All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=sygma_______::8a13381584271a2634bbc9a413f4be35&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article , Other literature type 2021 France, Croatia, France English Funded by:EC | OPERAS-PEC| OPERAS-PNury, Elisa; Clivaz, Claire; Błaszczyńska, Marta; Kaiser, Michael; Morka, Agata; Schaefer, Valérie; Stojanovski, Jadranka; Tóth-Czifra, Erzsébet;Published in OA on RESSI (http://www.ressi.ch/) on the 15.02.22. We present here highlights from an enquiry on the innovations in scholarly writing in the Humanities and Social Sciences in the H2020 project OPERAS-P. This article explores the theme of Open Research Data and its role in the emergence of new models of scholarly writing. We examine more closely the obstacles and fostering conditions to the publication of research data, both from a social and a technical perspective. International audience
Serveur académique l... arrow_drop_down Croatian Scientific Bibliography - CROSBIArticle . 2021Data sources: Croatian Scientific Bibliography - CROSBIHAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2022License: CC BY SAFull-Text: https://hal.science/hal-03214397/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=57a035e5b1ae::98b5609c37c160836ed597ca42e5e1c3&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Serveur académique l... arrow_drop_down Croatian Scientific Bibliography - CROSBIArticle . 2021Data sources: Croatian Scientific Bibliography - CROSBIHAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2022License: CC BY SAFull-Text: https://hal.science/hal-03214397/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=57a035e5b1ae::98b5609c37c160836ed597ca42e5e1c3&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Part of book or chapter of book 2021 France EnglishPublisher:HAL CCSD Funded by:EC | DHARMA, EC | NETAMILEC| DHARMA ,EC| NETAMILAuthors: Francis, Emmanuel;Francis, Emmanuel;International audience
HAL Descartes; Mémoi... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationPart of book or chapter of book . 2021Full-Text: https://hal.science/hal-03185234/documentHyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Part of book or chapter of book . 2021All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______166::73bbde8fc4c8d4c7528e23ddb573265f&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL Descartes; Mémoi... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationPart of book or chapter of book . 2021Full-Text: https://hal.science/hal-03185234/documentHyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Part of book or chapter of book . 2021All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______166::73bbde8fc4c8d4c7528e23ddb573265f&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2020 France EnglishPublisher:HAL CCSD Funded by:EC | LUBARTWORLDEC| LUBARTWORLDAuthors: ZALC, CLAIRE;ZALC, CLAIRE;doi: 10.1086/711477
AbstractDuring its first days of existence, the Vichy regime ordered a review of recent naturalizations. In accordance with a law passed on July 22, 1940, denaturalization decisions were made on a ...
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1086/711477&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesbronze 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!more_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1086/711477&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Doctoral thesis 2020 France EnglishPublisher:HAL CCSD Funded by:EC | PARTHENOS, ANR | BASNUMEC| PARTHENOS ,ANR| BASNUMAuthors: Khemakhem, Mohamed;Khemakhem, Mohamed;Les dictionnaires peuvent être considérés comme le réservoir le plus compréhensible de connaissances humaines, qui contiennent non seulement la description lexicale des mots dans une ou plusieurs langues, mais aussi la conscience commune d’une certaine communauté sur chaque élément de connaissance connu dans une période de temps donnée. Les dictionnaires imprimés sont les principales ressources qui permettent la documentation et le transfert de ces connaissances. Ils existent déjà en grand nombre, et de nouveaux dictionnaires sont continuellement compilés. Cependant, la majorité de ces dictionnaires dans leur version numérique n’est toujours pas structurée en raison de l’absence de méthodes et de techniques évolutives pouvant couvrir le nombre du matériel croissant et sa variété. En outre, les ressources structurées existantes, relativement peu nombreuses, présentent des alternatives d’échange et de recherche limitées, en raison d’un sérieux manque de synchronisation entre leurs schémas de structure. Dans cette thèse, nous abordons la tâche d’analyse des informations lexicales dans les dictionnaires imprimés en construisant des modèles qui permettent leur structuration automatique. La résolution de cette tâche va de pair avec la recherche d’une sortie standardisée de ces modèles afin de garantir une interopérabilité maximale entre les ressources et une facilité d’utilisation pour les tâches en aval. Nous commençons par présenter différentes classifications des ressources dictionnaires pour délimiter les catégories des dictionnaires imprimés sur lesquelles ce travail se focalise. Ensuite, nous définissions la tâche d’analyse en fournissant un aperçu des défis de traitement et une étude de l’état de l’art. Nous présentons par la suite une nouvelle approche basée sur une analyse en cascade de l’information lexicale. Nous décrivons également l’architecture du système résultant, appelé GROBID-Dictionaries, et la méthodologie que nous avons suivie pour rapprocher la conception du système de son applicabilité aux scénarios du monde réel. Ensuite, nous prestons des normes clés pour les ressources lexicales structurées. En outre, nous fournissons une analyse de deux initiatives en cours, TEI-Lex-0 et LMF, qui visent à unifier la modélisation de l’information lexicale dans les dictionnaires imprimés et électroniques. Sur cette base, nous présentons un format de sérialisation conforme aux schémas des deux initiatives de normalisation et qui est assorti à l’approche développée dans notre système d’analyse lexicale. Après avoir présenté les facettes d’analyse et de sérialisation normalisées de nos modèles lexicaux, nous fournissons une étude empirique de leurs performances et de leurs comportements. L’étude est basée sur une configuration spécifique d’apprentissage automatique et sur une série d’expériences menées avec un ensemble sélectionné de dictionnaires variés. Dans cette étude, nous essayons de présenter différentes manières d’ingénierie des caractéristiques et de montrer les points forts et les limites des meilleurs modèles résultants. Nous consacrons également deux séries d’expériences pour explorer l’extensibilité de nos modèles en ce qui concerne les documents traités et la technique d’apprentissage automatique employée. Enfin, nous clôturons cette thèse en présentant les principales conclusions et en ouvrant de nouvelles perspectives pour l’extension de nos investigations dans un certain nombre de directions de recherche pour l’analyse des documents structurés en un ensemble d’entrées. Dictionaries could be considered as the most comprehensive reservoir of human knowledge, which carry not only the lexical description of words in one or more languages, but also the commun awareness of a certain community about every known piece of knowledge in a time frame. Print dictionaries are the principle resources which enable the documentation and transfer of such knowledge. They already exist in abundant numbers, while new ones are continuously compiled, even with the recent strong move to digital resources. However, a majority of these dictionaries, even when available digitally, is still not fully structured due to the absence of scalable methods and techniques that can cover the variety of corresponding material. Moreover, the relatively few existing structured resources present limited exchange and query alternatives, given the discrepancy of their data models and formats. In this thesis we address the task of parsing lexical information in print dictionaries through the design of computer models that enable their automatic structuring. Solving this task goes hand in hand with finding a standardised output for these models to guarantee a maximum interoperability among resources and usability for downstream tasks. First, we present different classifications of the dictionaric resources to delimit the category of print dictionaries we aim to process. Second, we introduce the parsing task by providing an overview of the processing challenges and a study of the state of the art. Then, we present a novel approach based on a top-down parsing of the lexical information. We also outline the architecture of the resulting system, called GROBID-Dictionaries, and the methodology we followed to close the gap between the conception of the system and its applicability to real-world scenarios. After that, we draw the landscape of the leading standards for structured lexical resources. In addition, we provide an analysis of two ongoing initiatives, TEI-Lex-0 and LMF, that aim at the unification of modelling the lexical information in print and electronic dictionaries. Based on that, we present a serialisation format that is inline with the schemes of the two standardisation initiatives and fits the approach implemented in our parsing system. After presenting the parsing and standardised serialisation facets of our lexical models, we provide an empirical study of their performance and behaviour. The investigation is based on a specific machine learning setup and series of experiments carried out with a selected pool of varied dictionaries. We try in this study to present different ways for feature engineering and exhibit the strength and the limits of the best resulting models. We also dedicate two series of experiments for exploring the scalability of our models with regard to the processed documents and the employed machine learning technique. Finally, we sum up this thesis by presenting the major conclusions and opening new perspectives for extending our investigations in a number of research directions for parsing entry-based documents.
INRIA a CCSD electro... arrow_drop_down INRIA a CCSD electronic archive serverDoctoral thesis . 2020Data sources: INRIA a CCSD electronic archive serverAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______212::de8c2bd7cb0470497f4d7a88f40ee3a6&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert INRIA a CCSD electro... arrow_drop_down INRIA a CCSD electronic archive serverDoctoral thesis . 2020Data sources: INRIA a CCSD electronic archive serverAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______212::de8c2bd7cb0470497f4d7a88f40ee3a6&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report , Other literature type 2020 France EnglishPublisher:Zenodo Funded by:EC | TRIPLEEC| TRIPLEAuthors: POUYLLAU, St��phane; BUNEL, M��lanie; CAPELLI, Laurent; MINEL, Jean-Luc;POUYLLAU, St��phane; BUNEL, M��lanie; CAPELLI, Laurent; MINEL, Jean-Luc;This research report is a proposal to define a platform, called We, for the TRIPLE project. First, the We platform is situated in the ecosystem of academic tools. Then its functionalities are described. This document proposes a conceptual representation of an innovative platform based on the discovery of experts, topics and projects using the analysis, classification, linking and enrichment of data. It is not a proposal of a Human-Machine Interface (HMI) which is under the responsibility of WP3 and WP5 as well as the editorialization of the components, but there are ideas and may constitute some areas of work to be discussed. Also, the different diagrams aim at illustrating the functionalities and not the design of the screens. These suggestions include elements of the proposal (experts), new public (Python library) and technological advances (Deep Learning) which were not so advanced when the proposal has been elaborated.
ZENODO arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2020License: CC BY NC NDHAL Paris Nanterre; HAL AMU; Mémoires en Sciences de l'Information et de la CommunicationReport . 2020License: CC BY NC NDadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4032622&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 607visibility views 607 download downloads 270 Powered bymore_vert ZENODO arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2020License: CC BY NC NDHAL Paris Nanterre; HAL AMU; Mémoires en Sciences de l'Information et de la CommunicationReport . 2020License: CC BY NC NDadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4032622&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu
Loading
description Publicationkeyboard_double_arrow_right Article 2021 France EnglishPublisher:HAL CCSD Funded by:EC | TibArmyEC| TibArmyAuthors: Travers, Alice; Venturi, Federica;Travers, Alice; Venturi, Federica;International audience
Annali di Ca’ Foscar... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2021License: CC BYFull-Text: https://hal.science/hal-03512896/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::63b9af45a81b7c1a555b8fc51a851442&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesgold 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Annali di Ca’ Foscar... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2021License: CC BYFull-Text: https://hal.science/hal-03512896/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::63b9af45a81b7c1a555b8fc51a851442&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Conference object , Article , Other literature type 2021 France EnglishPublisher:Zenodo Funded by:EC | ELEXISEC| ELEXISAhmadi, Sina; Constant, Mathieu; Fort, Karën; Guillaume, Bruno; McCrae, John,;Nous présentons dans ce papier les travaux que nous avons réalisés pour convertir dans le modèle Ontolex-Lemon l'une des plus importantes ressources lexicographiques pour le français : le Trésor de la Langue Française. En effet, malgré l'utilisation généralisée de cette ressource, son format actuel, basé sur XML, ne respecte pas les standards les plus récents de la représentation des données lexicographiques, notamment ceux basés sur les données liées. Nos travaux mettent en lumière la nécessité d'établir des mécanismes permettant d'augmenter l'inter-opérabilité des ressources et des technologies pour créer et maintenir des ressources lexicographiques. In this paper, we report our efforts to convert one of the most comprehensive lexicographic resources of French, the Trésor de la Langue Française, into the Ontolex-Lemon model. Despite the widespread usage of this resource, the original XML format seems to impede its integration in language technology tools. In order to breathe new life into this resource, we examine the usage and the conversion to more interoperable formats, primarily those based on the linguistic linked data, to provide this resource to a broader range of applications and users. National audience
HAL-Rennes 1; INRIA ... arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotConference object . 2021Full-Text: https://hal.inria.fr/hal-03463294/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5772045&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 36visibility views 36 download downloads 29 Powered bymore_vert HAL-Rennes 1; INRIA ... arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotConference object . 2021Full-Text: https://hal.inria.fr/hal-03463294/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5772045&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report , Other literature type 2021 Croatia, France EnglishPublisher:Zenodo Funded by:EC | OPERAS-PEC| OPERAS-PMaryl, Maciej; Błaszczyńska, Marta; Zalotyńska, Agnieszka; Taylor, Laurence; Avanço, Karla; Balula, Ana; Buchner, Anna; Caliman, Lorena; Clivaz, Claire; Costa, Carlos; Franczak, Mateusz; Gatti, Rupert; Giglia, Elena; Gingold, Arnaud; Jarmelo, Susana; Padez, Maria,; Leão, Delfim; Melinščak Zlodi, Iva; Mojsak, Kajetan; Morka, Agata; Mosterd, Tom; Nury, Elisa; Plag, Cornelia; Schafer, Valérie; Silva, Mickael; Stojanovski, Jadranka; Szleszyński, Bartłomiej; Szulińska, Agnieszka; Tóth-Czifra, Erzsébet; Wciślik, Piotr; Wieneke, Lars;This report discusses the scholarly communication issues in Social Sciences and Humanities that are relevant to the future development and functioning of OPERAS. The outcomes collected here can be divided into two groups of innovations regarding 1) the operation of OPERAS, and 2) its activities. The “operational” issues include the ways in which an innovative research infrastructure should be governed (Chapter 1) as well as the business models for open access publications in Social Sciences and Humanities (Chapter 2). The other group of issues is dedicated to strategic areas where OPERAS and its services may play an instrumental role in providing, enabling, or unlocking innovation: FAIR data (Chapter 3), bibliodiversity and multilingualism in scholarly communication (Chapter 4), the future of scholarly writing (Chapter 5), and quality assessment (Chapter 6). Each chapter provides an overview of the main findings and challenges with emphasis on recommendations for OPERAS and other stakeholders like e- infrastructures, publishers, SSH researchers, research performing organisations, policy makers, and funders. Links to data and further publications stemming from work concerning particular tasks are located at the end of each chapter.
Croatian Scientific ... arrow_drop_down Croatian Scientific Bibliography - CROSBIOther literature type . 2021Data sources: Croatian Scientific Bibliography - CROSBIMémoires en Sciences de l'Information et de la Communication; HAL AMUReport . 2021License: CC BYFull-Text: https://hal.science/hal-03277615/documentHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2021License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5017705&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 1 citations 1 popularity Average influence Average impulse Average Powered by BIP!visibility 4Kvisibility views 4,435 download downloads 2,726 Powered bymore_vert Croatian Scientific ... arrow_drop_down Croatian Scientific Bibliography - CROSBIOther literature type . 2021Data sources: Croatian Scientific Bibliography - CROSBIMémoires en Sciences de l'Information et de la Communication; HAL AMUReport . 2021License: CC BYFull-Text: https://hal.science/hal-03277615/documentHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2021License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.5017705&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Italy, France, Lithuania, Spain EnglishPublisher:Archäologische Informationen Funded by:EC | CLIOARCHEC| CLIOARCHNaudinot, Nicolas; Hussain, Shumon,; Matzig, David,; Fontana, Federica; Groß, Daniel; Hess, Thomas; Langlais, Mathieu; Fernández-Lopéz De Pablo, Javier; Mills, William; Posch, Caroline; Rimkus, Tomas; Shnayder, Svetlana; Stefański, Damian; Riede, Felix;handle: 10045/115296 , 11392/2472970
Wir berichten über den 2. Workshop im Rahmen des CLIOARCH-Projekts, der darauf abzielte, auf eine neue Synthese technokultureller Langzeitentwicklungen an der Pleistozän/Holozän-Grenze in Europa hinzuarbeiten. Wir reagieren damit auf den wachsenden Bedarf nach einem metaanalytischen Fundament für den Vergleich und die eventuelle Integration von heterogenen regionalen Datensätzen in der Archäologie des Spätpaläolithikums und frühesten Mesolithikums und betonen insbesondere die reichhaltigen Möglichkeiten, die kooperative Ansätze hierbei bieten. Wir schlagen vor, dass das Expert-Sourcing von vorgefilterten lithischen Informationen eine vielversprechende Grundlage zur Durchführung systematischer archäologischer MetaAnalysen ist und dass die Zusammenstellung, Untersuchung und Konservierung ähnlicher großräumiger Datensammlungen ein wichtiges Forschungsziel für die Zukunft sein könnte. We report on a virtual workshop aimed at advancing a new synthesis of techno-cultural patterns at the Pleistocene-Holocene boundary in Europe. We respond to the growing need of developing meta-analytical frameworks for comparing and eventually integrating disparate regional datasets and stress the opportunities of collaborative approaches. We propose that expert-sourced lithic data is a promising means of conducting systematic archaeological meta-analyses, and that the compilation and examination of similar continental-scale datasets may be an important research goal in the future. CLIOARCH is an ERC Consolidator Grant project and has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 817564).
Archivio istituziona... arrow_drop_down Recolector de Ciencia Abierta, RECOLECTAArticle . 2021Full-Text: https://doi.org/10.11588/ai.2020.1.81428Data sources: Recolector de Ciencia Abierta, RECOLECTAVirtual Library of Klaipeda UniversityArticle . 2020Data sources: Virtual Library of Klaipeda UniversityRepositorio Institucional de la Universidad de AlicanteArticle . 2021Data sources: Repositorio Institucional de la Universidad de AlicanteMémoires en Sciences de l'Information et de la CommunicationArticle . 2021Full-Text: https://hal.science/hal-03474590/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.11588/ai.2020.1.81428&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Archivio istituziona... arrow_drop_down Recolector de Ciencia Abierta, RECOLECTAArticle . 2021Full-Text: https://doi.org/10.11588/ai.2020.1.81428Data sources: Recolector de Ciencia Abierta, RECOLECTAVirtual Library of Klaipeda UniversityArticle . 2020Data sources: Virtual Library of Klaipeda UniversityRepositorio Institucional de la Universidad de AlicanteArticle . 2021Data sources: Repositorio Institucional de la Universidad de AlicanteMémoires en Sciences de l'Information et de la CommunicationArticle . 2021Full-Text: https://hal.science/hal-03474590/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.11588/ai.2020.1.81428&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Australia, France EnglishPublisher:HAL CCSD Funded by:EC | SILVEREC| SILVERGentelli, Liesel; Blichert-Toft, Janne; Davis, Gillan; Gitler, Haim; Albarède, Francis;Hacksilber facilitated trade and transactions from the beginning of the second millennium BCE until the late fourth century BCE in the southern Levant. Here we demonstrate the use of new, data-driven statistical approaches to interpret high-precision Pb isotope analysis of silver found in archaeological contexts for provenance determination. We sampled 46 pieces of International audience
HAL-ENS-LYON; Mémoir... arrow_drop_down All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=sygma_______::8a13381584271a2634bbc9a413f4be35&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL-ENS-LYON; Mémoir... arrow_drop_down All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=sygma_______::8a13381584271a2634bbc9a413f4be35&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article , Other literature type 2021 France, Croatia, France English Funded by:EC | OPERAS-PEC| OPERAS-PNury, Elisa; Clivaz, Claire; Błaszczyńska, Marta; Kaiser, Michael; Morka, Agata; Schaefer, Valérie; Stojanovski, Jadranka; Tóth-Czifra, Erzsébet;Published in OA on RESSI (http://www.ressi.ch/) on the 15.02.22. We present here highlights from an enquiry on the innovations in scholarly writing in the Humanities and Social Sciences in the H2020 project OPERAS-P. This article explores the theme of Open Research Data and its role in the emergence of new models of scholarly writing. We examine more closely the obstacles and fostering conditions to the publication of research data, both from a social and a technical perspective. International audience
Serveur académique l... arrow_drop_down Croatian Scientific Bibliography - CROSBIArticle . 2021Data sources: Croatian Scientific Bibliography - CROSBIHAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2022License: CC BY SAFull-Text: https://hal.science/hal-03214397/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=57a035e5b1ae::98b5609c37c160836ed597ca42e5e1c3&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert Serveur académique l... arrow_drop_down Croatian Scientific Bibliography - CROSBIArticle . 2021Data sources: Croatian Scientific Bibliography - CROSBIHAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2022License: CC BY SAFull-Text: https://hal.science/hal-03214397/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=57a035e5b1ae::98b5609c37c160836ed597ca42e5e1c3&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Part of book or chapter of book 2021 France EnglishPublisher:HAL CCSD Funded by:EC | DHARMA, EC | NETAMILEC| DHARMA ,EC| NETAMILAuthors: Francis, Emmanuel;Francis, Emmanuel;International audience
HAL Descartes; Mémoi... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationPart of book or chapter of book . 2021Full-Text: https://hal.science/hal-03185234/documentHyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Part of book or chapter of book . 2021All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______166::73bbde8fc4c8d4c7528e23ddb573265f&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL Descartes; Mémoi... arrow_drop_down HAL Descartes; Mémoires en Sciences de l'Information et de la CommunicationPart of book or chapter of book . 2021Full-Text: https://hal.science/hal-03185234/documentHyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Part of book or chapter of book . 2021All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______166::73bbde8fc4c8d4c7528e23ddb573265f&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2020 France EnglishPublisher:HAL CCSD Funded by:EC | LUBARTWORLDEC| LUBARTWORLDAuthors: ZALC, CLAIRE;ZALC, CLAIRE;doi: 10.1086/711477
AbstractDuring its first days of existence, the Vichy regime ordered a review of recent naturalizations. In accordance with a law passed on July 22, 1940, denaturalization decisions were made on a ...
add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1086/711477&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesbronze 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!more_vert add ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1086/711477&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Doctoral thesis 2020 France EnglishPublisher:HAL CCSD Funded by:EC | PARTHENOS, ANR | BASNUMEC| PARTHENOS ,ANR| BASNUMAuthors: Khemakhem, Mohamed;Khemakhem, Mohamed;Les dictionnaires peuvent être considérés comme le réservoir le plus compréhensible de connaissances humaines, qui contiennent non seulement la description lexicale des mots dans une ou plusieurs langues, mais aussi la conscience commune d’une certaine communauté sur chaque élément de connaissance connu dans une période de temps donnée. Les dictionnaires imprimés sont les principales ressources qui permettent la documentation et le transfert de ces connaissances. Ils existent déjà en grand nombre, et de nouveaux dictionnaires sont continuellement compilés. Cependant, la majorité de ces dictionnaires dans leur version numérique n’est toujours pas structurée en raison de l’absence de méthodes et de techniques évolutives pouvant couvrir le nombre du matériel croissant et sa variété. En outre, les ressources structurées existantes, relativement peu nombreuses, présentent des alternatives d’échange et de recherche limitées, en raison d’un sérieux manque de synchronisation entre leurs schémas de structure. Dans cette thèse, nous abordons la tâche d’analyse des informations lexicales dans les dictionnaires imprimés en construisant des modèles qui permettent leur structuration automatique. La résolution de cette tâche va de pair avec la recherche d’une sortie standardisée de ces modèles afin de garantir une interopérabilité maximale entre les ressources et une facilité d’utilisation pour les tâches en aval. Nous commençons par présenter différentes classifications des ressources dictionnaires pour délimiter les catégories des dictionnaires imprimés sur lesquelles ce travail se focalise. Ensuite, nous définissions la tâche d’analyse en fournissant un aperçu des défis de traitement et une étude de l’état de l’art. Nous présentons par la suite une nouvelle approche basée sur une analyse en cascade de l’information lexicale. Nous décrivons également l’architecture du système résultant, appelé GROBID-Dictionaries, et la méthodologie que nous avons suivie pour rapprocher la conception du système de son applicabilité aux scénarios du monde réel. Ensuite, nous prestons des normes clés pour les ressources lexicales structurées. En outre, nous fournissons une analyse de deux initiatives en cours, TEI-Lex-0 et LMF, qui visent à unifier la modélisation de l’information lexicale dans les dictionnaires imprimés et électroniques. Sur cette base, nous présentons un format de sérialisation conforme aux schémas des deux initiatives de normalisation et qui est assorti à l’approche développée dans notre système d’analyse lexicale. Après avoir présenté les facettes d’analyse et de sérialisation normalisées de nos modèles lexicaux, nous fournissons une étude empirique de leurs performances et de leurs comportements. L’étude est basée sur une configuration spécifique d’apprentissage automatique et sur une série d’expériences menées avec un ensemble sélectionné de dictionnaires variés. Dans cette étude, nous essayons de présenter différentes manières d’ingénierie des caractéristiques et de montrer les points forts et les limites des meilleurs modèles résultants. Nous consacrons également deux séries d’expériences pour explorer l’extensibilité de nos modèles en ce qui concerne les documents traités et la technique d’apprentissage automatique employée. Enfin, nous clôturons cette thèse en présentant les principales conclusions et en ouvrant de nouvelles perspectives pour l’extension de nos investigations dans un certain nombre de directions de recherche pour l’analyse des documents structurés en un ensemble d’entrées. Dictionaries could be considered as the most comprehensive reservoir of human knowledge, which carry not only the lexical description of words in one or more languages, but also the commun awareness of a certain community about every known piece of knowledge in a time frame. Print dictionaries are the principle resources which enable the documentation and transfer of such knowledge. They already exist in abundant numbers, while new ones are continuously compiled, even with the recent strong move to digital resources. However, a majority of these dictionaries, even when available digitally, is still not fully structured due to the absence of scalable methods and techniques that can cover the variety of corresponding material. Moreover, the relatively few existing structured resources present limited exchange and query alternatives, given the discrepancy of their data models and formats. In this thesis we address the task of parsing lexical information in print dictionaries through the design of computer models that enable their automatic structuring. Solving this task goes hand in hand with finding a standardised output for these models to guarantee a maximum interoperability among resources and usability for downstream tasks. First, we present different classifications of the dictionaric resources to delimit the category of print dictionaries we aim to process. Second, we introduce the parsing task by providing an overview of the processing challenges and a study of the state of the art. Then, we present a novel approach based on a top-down parsing of the lexical information. We also outline the architecture of the resulting system, called GROBID-Dictionaries, and the methodology we followed to close the gap between the conception of the system and its applicability to real-world scenarios. After that, we draw the landscape of the leading standards for structured lexical resources. In addition, we provide an analysis of two ongoing initiatives, TEI-Lex-0 and LMF, that aim at the unification of modelling the lexical information in print and electronic dictionaries. Based on that, we present a serialisation format that is inline with the schemes of the two standardisation initiatives and fits the approach implemented in our parsing system. After presenting the parsing and standardised serialisation facets of our lexical models, we provide an empirical study of their performance and behaviour. The investigation is based on a specific machine learning setup and series of experiments carried out with a selected pool of varied dictionaries. We try in this study to present different ways for feature engineering and exhibit the strength and the limits of the best resulting models. We also dedicate two series of experiments for exploring the scalability of our models with regard to the processed documents and the employed machine learning technique. Finally, we sum up this thesis by presenting the major conclusions and opening new perspectives for extending our investigations in a number of research directions for parsing entry-based documents.
INRIA a CCSD electro... arrow_drop_down INRIA a CCSD electronic archive serverDoctoral thesis . 2020Data sources: INRIA a CCSD electronic archive serverAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______212::de8c2bd7cb0470497f4d7a88f40ee3a6&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert INRIA a CCSD electro... arrow_drop_down INRIA a CCSD electronic archive serverDoctoral thesis . 2020Data sources: INRIA a CCSD electronic archive serverAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______212::de8c2bd7cb0470497f4d7a88f40ee3a6&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report , Other literature type 2020 France EnglishPublisher:Zenodo Funded by:EC | TRIPLEEC| TRIPLEAuthors: POUYLLAU, St��phane; BUNEL, M��lanie; CAPELLI, Laurent; MINEL, Jean-Luc;POUYLLAU, St��phane; BUNEL, M��lanie; CAPELLI, Laurent; MINEL, Jean-Luc;This research report is a proposal to define a platform, called We, for the TRIPLE project. First, the We platform is situated in the ecosystem of academic tools. Then its functionalities are described. This document proposes a conceptual representation of an innovative platform based on the discovery of experts, topics and projects using the analysis, classification, linking and enrichment of data. It is not a proposal of a Human-Machine Interface (HMI) which is under the responsibility of WP3 and WP5 as well as the editorialization of the components, but there are ideas and may constitute some areas of work to be discussed. Also, the different diagrams aim at illustrating the functionalities and not the design of the screens. These suggestions include elements of the proposal (experts), new public (Python library) and technological advances (Deep Learning) which were not so advanced when the proposal has been elaborated.
ZENODO arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2020License: CC BY NC NDHAL Paris Nanterre; HAL AMU; Mémoires en Sciences de l'Information et de la CommunicationReport . 2020License: CC BY NC NDadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4032622&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 607visibility views 607 download downloads 270 Powered bymore_vert ZENODO arrow_drop_down Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotReport . 2020License: CC BY NC NDHAL Paris Nanterre; HAL AMU; Mémoires en Sciences de l'Information et de la CommunicationReport . 2020License: CC BY NC NDadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.5281/zenodo.4032622&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu