- home
- Advanced Search
Filters
Clear All- Digital Humanities and Cultural Heritage
- 2013-2022
- Open Access
- Research data
- Research software
- Other research products
- AR
- English
- Digital Humanities and Cultural Heritage
- 2013-2022
- Open Access
- Research data
- Research software
- Other research products
- AR
- English
Loading
apps Other research product2019 Argentina EnglishAuthors: Xamena, Eduardo; Marmanillo, Walter Gabriel; Mechaca, Ana Lidia;Xamena, Eduardo; Marmanillo, Walter Gabriel; Mechaca, Ana Lidia;Large amounts of ancient documents have become available in the last years, regarding Argentinian history. This fact turns possible to find interesting and useful aggregated information. This work proposes the application of Natural Language Processing, Text Mining and Visualization tools over Argentinian ancient document repositories. Conceptual maps and entity networks make up the first target of this preliminary paper. The first step is the normalization of OCR acquired books of General G¨uemes. Exploratory analyses reveal the presence of manifold spelling errors, due to the OCR acquisition process of the volumes. We propose smart automatic ways for overcoming this issue in the process of normalization. Besides, a first topic landscape of a subset of volumes is obtained and analysed, via Topic Modelling tools. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2019Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::349107fb928f5390b92717973d40014e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 37visibility views 37 download downloads 43 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2019Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::349107fb928f5390b92717973d40014e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 Argentina EnglishAuthors: Grill, Pablo; Claassen, Mathias; Rosá, Aiala; Correa, Hernán;Grill, Pablo; Claassen, Mathias; Rosá, Aiala; Correa, Hernán;This paper presents a series of semi-supervised learning algorithms which were designed to classify words or expressions with temporal meanings. The algorithms use a set of pre-tagged temporal expressions and a set of semantic classes which were defined within a research project on the lexical coding of temporal meaning in Spanish. The algorithms in this article are mostly based on word embeddings, but they also make use of other methods. The results obtained strongly depend on the temporal classes considered, but, for some classes, results have reached 90% precision or above. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2017Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::84f0748e1ab8e19193b77698e9ade791&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 11visibility views 11 download downloads 23 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2017Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::84f0748e1ab8e19193b77698e9ade791&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2016 Argentina EnglishAuthors: Argerich, Luis; Cano, Matías J.; Torre Zaffaroni, Joaquín;Argerich, Luis; Cano, Matías J.; Torre Zaffaroni, Joaquín;In this paper we propose the application of feature hashing to create word embeddings for natural language processing. Feature hashing has been used successfully to create document vectors in related tasks like document classification. In this work we show that feature hashing can be applied to obtain word embeddings in linear time with the size of the data. The results show that this algorithm, that does not need training, is able to capture the semantic meaning of words.We compare the results against GloVe showing that they are similar. As far as we know this is the first application of feature hashing to the word embeddings problem and the results indicate this is a scalable technique with practical results for NLP applications. Sociedad Argentina de Informática e Investigación Operativa (SADIO)
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2016Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::b6a750cf0868f0a19fa50351c730f57c&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 8visibility views 8 download downloads 347 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2016Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::b6a750cf0868f0a19fa50351c730f57c&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research product2013 Argentina EnglishAuthors: Cardellino, Cristian; Alonso i Alemany, Laura;Cardellino, Cristian; Alonso i Alemany, Laura;We present SuFLexQA, a system for Question Answering that integrates deep linguistic information from verbal lexica into Quepy, a generic framework for translating natural language questions into a query language. We are participating in the QALD-3 contest to assess the main achievements and shortcomings of the system. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::37dbf345e828a7d9be1c60d78bd82a47&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 13visibility views 13 download downloads 6 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::37dbf345e828a7d9be1c60d78bd82a47&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research product2013 Argentina EnglishAuthors: Carrillo, Facundo; Cecchi, Guillermo; Sigman, Mariano; Fernández Slezak, Diego;Carrillo, Facundo; Cecchi, Guillermo; Sigman, Mariano; Fernández Slezak, Diego;Latent Semantic Analysis is a natural language processing tools that allows estimating semantic distance between terms. The success of LSA is mainly based on the training corpus choice, which have been studied principally in English. This study focuses on studying LSA with regional Spanish corpus and evaluate the performance by identifying synonyms. We found that performance was slightly better than chance, concordantly with previous results. Standard LSA method cannot dynamically increase the training corpus. By using classifiers we combined multiple LSA models and showed that the use of automatic classifiers increase the performance. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::c6017db5301060978c8ea27ca7c7fefb&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 17visibility views 17 download downloads 19 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::c6017db5301060978c8ea27ca7c7fefb&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu
Loading
apps Other research product2019 Argentina EnglishAuthors: Xamena, Eduardo; Marmanillo, Walter Gabriel; Mechaca, Ana Lidia;Xamena, Eduardo; Marmanillo, Walter Gabriel; Mechaca, Ana Lidia;Large amounts of ancient documents have become available in the last years, regarding Argentinian history. This fact turns possible to find interesting and useful aggregated information. This work proposes the application of Natural Language Processing, Text Mining and Visualization tools over Argentinian ancient document repositories. Conceptual maps and entity networks make up the first target of this preliminary paper. The first step is the normalization of OCR acquired books of General G¨uemes. Exploratory analyses reveal the presence of manifold spelling errors, due to the OCR acquisition process of the volumes. We propose smart automatic ways for overcoming this issue in the process of normalization. Besides, a first topic landscape of a subset of volumes is obtained and analysed, via Topic Modelling tools. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2019Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::349107fb928f5390b92717973d40014e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 37visibility views 37 download downloads 43 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2019Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::349107fb928f5390b92717973d40014e&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2017 Argentina EnglishAuthors: Grill, Pablo; Claassen, Mathias; Rosá, Aiala; Correa, Hernán;Grill, Pablo; Claassen, Mathias; Rosá, Aiala; Correa, Hernán;This paper presents a series of semi-supervised learning algorithms which were designed to classify words or expressions with temporal meanings. The algorithms use a set of pre-tagged temporal expressions and a set of semantic classes which were defined within a research project on the lexical coding of temporal meaning in Spanish. The algorithms in this article are mostly based on word embeddings, but they also make use of other methods. The results obtained strongly depend on the temporal classes considered, but, for some classes, results have reached 90% precision or above. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2017Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::84f0748e1ab8e19193b77698e9ade791&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 11visibility views 11 download downloads 23 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2017Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::84f0748e1ab8e19193b77698e9ade791&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research productkeyboard_double_arrow_right Other ORP type 2016 Argentina EnglishAuthors: Argerich, Luis; Cano, Matías J.; Torre Zaffaroni, Joaquín;Argerich, Luis; Cano, Matías J.; Torre Zaffaroni, Joaquín;In this paper we propose the application of feature hashing to create word embeddings for natural language processing. Feature hashing has been used successfully to create document vectors in related tasks like document classification. In this work we show that feature hashing can be applied to obtain word embeddings in linear time with the size of the data. The results show that this algorithm, that does not need training, is able to capture the semantic meaning of words.We compare the results against GloVe showing that they are similar. As far as we know this is the first application of feature hashing to the word embeddings problem and the results indicate this is a scalable technique with practical results for NLP applications. Sociedad Argentina de Informática e Investigación Operativa (SADIO)
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2016Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::b6a750cf0868f0a19fa50351c730f57c&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 8visibility views 8 download downloads 347 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2016Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::b6a750cf0868f0a19fa50351c730f57c&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research product2013 Argentina EnglishAuthors: Cardellino, Cristian; Alonso i Alemany, Laura;Cardellino, Cristian; Alonso i Alemany, Laura;We present SuFLexQA, a system for Question Answering that integrates deep linguistic information from verbal lexica into Quepy, a generic framework for translating natural language questions into a query language. We are participating in the QALD-3 contest to assess the main achievements and shortcomings of the system. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::37dbf345e828a7d9be1c60d78bd82a47&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 13visibility views 13 download downloads 6 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::37dbf345e828a7d9be1c60d78bd82a47&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euapps Other research product2013 Argentina EnglishAuthors: Carrillo, Facundo; Cecchi, Guillermo; Sigman, Mariano; Fernández Slezak, Diego;Carrillo, Facundo; Cecchi, Guillermo; Sigman, Mariano; Fernández Slezak, Diego;Latent Semantic Analysis is a natural language processing tools that allows estimating semantic distance between terms. The success of LSA is mainly based on the training corpus choice, which have been studied principally in English. This study focuses on studying LSA with regional Spanish corpus and evaluate the performance by identifying synonyms. We found that performance was slightly better than chance, concordantly with previous results. Standard LSA method cannot dynamically increase the training corpus. By using classifiers we combined multiple LSA models and showed that the use of automatic classifiers increase the performance. Sociedad Argentina de Informática e Investigación Operativa
Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::c6017db5301060978c8ea27ca7c7fefb&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 17visibility views 17 download downloads 19 Powered bymore_vert Servicio de Difusión... arrow_drop_down Servicio de Difusión de la Creación IntelectualOther ORP type . 2013Data sources: Servicio de Difusión de la Creación IntelectualAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______1329::c6017db5301060978c8ea27ca7c7fefb&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu