Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
49 Research products, page 1 of 5

  • Digital Humanities and Cultural Heritage
  • Publications
  • Research data
  • Other research products
  • Article
  • European Commission
  • EU
  • OpenAIRE

10
arrow_drop_down
Relevance
arrow_drop_down
  • Open Access Spanish; Castilian
    Authors: 
    Margarita Serna Vallejo;
    Publisher: Ediciones Universidad de Valladolid
    Country: Spain
    Project: EC | RESISTANCE (778076)

    RESUMEN: Desde finales de la Baja Edad Media y a lo largo de Época Moderna, algunas de las cofradías de pescadores establecidas en el corregimiento de las Cuatro Villas de la Costa consiguieron que la Monarquía les reconociera el privilegio de disfrutar de una jurisdicción marítima en cada corporación. El establecimiento de estas jurisdicciones disgustó a otras instituciones que vieron disminuidas sus competencias jurisdiccionales. Y de esta situación surgieron distintos conflictos en los que las hermandades tuvieron que luchar por la conservación de la jurisdicción marítima. ABSTRACT: Since the end of the Late Middle Ages and throughout the Modern Era, some of the fishermen's associations established in the corregimiento of the Four Villas of the Coast managed to get the Monarchy to recognize the privilege of enjoying a maritime jurisdiction in each brotherhood. The establishment of these jurisdictions disgusted other institutions that saw their jurisdiction diminished. From this situation arose different conflicts in which the brotherhoods had to fight for the preservation of the maritime jurisdiction. Este trabajo se ha realizado en el marco del Proyecto de Investigación Culturas urbanas en la España Moderna: policía, gobernanza e imaginarios (siglos XVI-XIX) con referencia HAR2015-64014-C3-1-R, financiado por el Ministerio de Economía y Competitividad) y del europeo (Rebellion and Resistance in the Iberian Empires, 16th-19th Centuries que ha recibido financiación del programa de investigación e innovación Horizonte 2020 de la Unión Europea en virtud del acuerdo de subvención Marie Skłodowska-Curie No 778076.

  • Publication . Article . Conference object . Preprint . 2016 . Embargo End Date: 01 Jan 2016
    Open Access
    Authors: 
    Guntis Barzdins; Didzis Gosko;
    Publisher: arXiv
    Project: EC | SUMMA (688139)

    Two extensions to the AMR smatch scoring script are presented. The first extension com-bines the smatch scoring script with the C6.0 rule-based classifier to produce a human-readable report on the error patterns frequency observed in the scored AMR graphs. This first extension results in 4% gain over the state-of-art CAMR baseline parser by adding to it a manually crafted wrapper fixing the identified CAMR parser errors. The second extension combines a per-sentence smatch with an en-semble method for selecting the best AMR graph among the set of AMR graphs for the same sentence. This second modification au-tomatically yields further 0.4% gain when ap-plied to outputs of two nondeterministic AMR parsers: a CAMR+wrapper parser and a novel character-level neural translation AMR parser. For AMR parsing task the character-level neural translation attains surprising 7% gain over the carefully optimized word-level neural translation. Overall, we achieve smatch F1=62% on the SemEval-2016 official scor-ing set and F1=67% on the LDC2015E86 test set. Comment: NAACL HLT 2016, SemEval-2016 Task 8 submission

  • Open Access
    Authors: 
    Najafabadipour, Marjan; Zanin, Massimiliano; Rodríguez-González, Alejandro; Torrente, Maria; Nuñez García, Beatriz; Cruz Bermudez, Juan Luis; Provencio, Mariano; Menasalvas, Ernestina;
    Publisher: Zenodo
    Project: EC | IASIS (727658)

    The automatic extraction of a patient’s natural history from Electronic Health Records (EHRs) is a critical step towards building intelligent systems that can reason about clinical variables and support decision making. Although EHRs contain a large amount of valuable information about the patient’s medical care, this information can only be fully understood when analyzed in a temporal context. Any intelligent system should then be able to extract medical concepts, date expressions, temporal relations and the temporal ordering of medical events from the free texts of EHRs; yet, this task is hard to tackle, due to the domain specific nature of EHRs, writing quality and lack of structure of these texts, and more generally the presence of redundant information. In this paper, we introduce a new Natural Language Processing (NLP) framework, capable of extracting the aforementioned elements from EHRs written in Spanish using rule-based methods. We focus on building medical timelines, which include disease diagnosis and its progression over time. By using a large dataset of EHRs comprising information about patients suffering from lung cancer, we show that our framework has an adequate level of performance by correctly building the timeline for 843 patients from a pool of 989 patients, achieving a correct result in 85% of instances.

  • Publication . Contribution for newspaper or weekly magazine . Conference object . Article . Preprint . 2016
    Open Access
    Authors: 
    Ahmed Ali; Najim Dehak; Patrick Cardinal; Sameer Khurana; Sree Harsha Yella; James Glass; Peter Bell; Steve Renals;
    Publisher: ISCA
    Country: United Kingdom
    Project: EC | SUMMA (688139)

    In this paper, we investigate different approaches for dialect identification in Arabic broadcast speech. These methods are based on phonetic and lexical features obtained from a speech recognition system, and bottleneck features using the i-vector framework. We studied both generative and discriminative classifiers, and we combined these features using a multi-class Support Vector Machine (SVM). We validated our results on an Arabic/English language identification task, with an accuracy of 100%. We also evaluated these features in a binary classifier to discriminate between Modern Standard Arabic (MSA) and Dialectal Arabic, with an accuracy of 100%. We further reported results using the proposed methods to discriminate between the five most widely used dialects of Arabic: namely Egyptian, Gulf, Levantine, North African, and MSA, with an accuracy of 59.2%. We discuss dialect identification errors in the context of dialect code-switching between Dialectal Arabic and MSA, and compare the error pattern between manually labeled data, and the output from our classifier. All the data used on our experiments have been released to the public as a language identification corpus.

  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2016
    Open Access English
    Authors: 
    Marcin Junczys-Dowmunt; Tomasz Dwojak; Rico Sennrich;
    Country: United Kingdom
    Project: EC | SUMMA (688139), EC | TraMOOC (644333)

    This paper describes the AMU-UEDIN submissions to the WMT 2016 shared task on news translation. We explore methods of decode-time integration ofattention-based neural translation models with phrase-based statistical machinetranslation. Efficient batch-algorithms for GPU-querying are proposed and implemented. For English-Russian, our system stays behind the state-of-the-art pure neural models in terms of BLEU. Among restricted systems, manual evaluation places it in the first cluster tied with the pure neural model. For the Russian-English task, our submission achieves the top BLEU result, outperforming the best pure neural system by 1.1 BLEU points and our ownphrase-based baseline by 1.6 BLEU. After manual evaluation, this system is thebest restricted system in its own cluster. In follow-up experiments we improve results by additional 0.8 BLEU.

  • Publication . Conference object . Other literature type . Article . 2016
    Open Access
    Authors: 
    Ngoc Quang Luong; Andrei Popescu-Belis;
    Publisher: Association for Computational Linguistics
    Country: Switzerland
    Project: SNSF | MODERN: Modeling discours... (147653), EC | SUMMA (688139)
  • Publication . Article . Preprint . Other literature type . Conference object . Contribution for newspaper or weekly magazine . 2020
    Open Access English
    Authors: 
    Biao Zhang; Philip Williams; Ivan Titov; Rico Sennrich;
    Countries: Switzerland, United Kingdom
    Project: EC | ELITR (825460), SNSF | Multi-Task Learning with ... (176727), EC | GoURMET (825299)

    Massively multilingual models for neural machine translation (NMT) are theoretically attractive, but often underperform bilingual models and deliver poor zero-shot translations. In this paper, we explore ways to improve them. We argue that multilingual NMT requires stronger modeling capacity to support language pairs with varying typological characteristics, and overcome this bottleneck via language-specific components and deepening NMT architectures. We identify the off-target translation issue (i.e. translating into a wrong target language) as the major source of the inferior zero-shot performance, and propose random online backtranslation to enforce the translation of unseen training language pairs. Experiments on OPUS-100 (a novel multilingual dataset with 100 languages) show that our approach substantially narrows the performance gap with bilingual models in both one-to-many and many-to-many settings, and improves zero-shot performance by ~10 BLEU, approaching conventional pivot-based methods. Comment: ACL2020

  • Restricted
    Authors: 
    Yichi Zhang;
    Publisher: Informa UK Limited
    Project: EC | BROKEX (802070)

    A transnational flow of capital exchange during the 19th and early 20th centuries brought planning ideas and modernity into China. Since European countries and America used violence to place China ...

  • Open Access
    Authors: 
    Arturo González;
    Country: Ireland
    Project: EC | TRUSS (642453)

    The growth of cities, the impacts of climate change and the massive cost of providing new infrastructure provide the impetus for TRUSS (Training in Reducing Uncertainty in Structural Safety), a €3.7 million Marie Skłodowska-Curie Action Innovative Training Network project funded by EU's Horizon 2020 programme, which aims to maximize the potential of infrastructure that already exists (http://trussitn.eu). For that purpose, TRUSS brings together an international, inter-sectoral and multidisciplinary collaboration between five academic and eleven industry institutions from five European countries. The project covers rail and road infrastructure, buildings and energy and marine infrastructure. This paper reports progress in fields such as advanced sensor-based structural health monitoring solutions - unmanned aerial vehicles, optical backscatter reflectometry, monitoring sensors mounted on vehicles, ... - and innovative algorithms for structural designs and short- and long-term assessments of buildings, bridges, pavements, ships, ship unloaders, nuclear components and wind turbine towers that will support infrastructure operators and owners in managing their assets. European Commission Horizon 2020

  • Authors: 
    Perdoncin, Anton;
    Publisher: CAIRN
    Country: France
    Project: EC | LUBARTWORLD (818843)

    International audience

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
49 Research products, page 1 of 5
  • Open Access Spanish; Castilian
    Authors: 
    Margarita Serna Vallejo;
    Publisher: Ediciones Universidad de Valladolid
    Country: Spain
    Project: EC | RESISTANCE (778076)

    RESUMEN: Desde finales de la Baja Edad Media y a lo largo de Época Moderna, algunas de las cofradías de pescadores establecidas en el corregimiento de las Cuatro Villas de la Costa consiguieron que la Monarquía les reconociera el privilegio de disfrutar de una jurisdicción marítima en cada corporación. El establecimiento de estas jurisdicciones disgustó a otras instituciones que vieron disminuidas sus competencias jurisdiccionales. Y de esta situación surgieron distintos conflictos en los que las hermandades tuvieron que luchar por la conservación de la jurisdicción marítima. ABSTRACT: Since the end of the Late Middle Ages and throughout the Modern Era, some of the fishermen's associations established in the corregimiento of the Four Villas of the Coast managed to get the Monarchy to recognize the privilege of enjoying a maritime jurisdiction in each brotherhood. The establishment of these jurisdictions disgusted other institutions that saw their jurisdiction diminished. From this situation arose different conflicts in which the brotherhoods had to fight for the preservation of the maritime jurisdiction. Este trabajo se ha realizado en el marco del Proyecto de Investigación Culturas urbanas en la España Moderna: policía, gobernanza e imaginarios (siglos XVI-XIX) con referencia HAR2015-64014-C3-1-R, financiado por el Ministerio de Economía y Competitividad) y del europeo (Rebellion and Resistance in the Iberian Empires, 16th-19th Centuries que ha recibido financiación del programa de investigación e innovación Horizonte 2020 de la Unión Europea en virtud del acuerdo de subvención Marie Skłodowska-Curie No 778076.

  • Publication . Article . Conference object . Preprint . 2016 . Embargo End Date: 01 Jan 2016
    Open Access
    Authors: 
    Guntis Barzdins; Didzis Gosko;
    Publisher: arXiv
    Project: EC | SUMMA (688139)

    Two extensions to the AMR smatch scoring script are presented. The first extension com-bines the smatch scoring script with the C6.0 rule-based classifier to produce a human-readable report on the error patterns frequency observed in the scored AMR graphs. This first extension results in 4% gain over the state-of-art CAMR baseline parser by adding to it a manually crafted wrapper fixing the identified CAMR parser errors. The second extension combines a per-sentence smatch with an en-semble method for selecting the best AMR graph among the set of AMR graphs for the same sentence. This second modification au-tomatically yields further 0.4% gain when ap-plied to outputs of two nondeterministic AMR parsers: a CAMR+wrapper parser and a novel character-level neural translation AMR parser. For AMR parsing task the character-level neural translation attains surprising 7% gain over the carefully optimized word-level neural translation. Overall, we achieve smatch F1=62% on the SemEval-2016 official scor-ing set and F1=67% on the LDC2015E86 test set. Comment: NAACL HLT 2016, SemEval-2016 Task 8 submission

  • Open Access
    Authors: 
    Najafabadipour, Marjan; Zanin, Massimiliano; Rodríguez-González, Alejandro; Torrente, Maria; Nuñez García, Beatriz; Cruz Bermudez, Juan Luis; Provencio, Mariano; Menasalvas, Ernestina;
    Publisher: Zenodo
    Project: EC | IASIS (727658)

    The automatic extraction of a patient’s natural history from Electronic Health Records (EHRs) is a critical step towards building intelligent systems that can reason about clinical variables and support decision making. Although EHRs contain a large amount of valuable information about the patient’s medical care, this information can only be fully understood when analyzed in a temporal context. Any intelligent system should then be able to extract medical concepts, date expressions, temporal relations and the temporal ordering of medical events from the free texts of EHRs; yet, this task is hard to tackle, due to the domain specific nature of EHRs, writing quality and lack of structure of these texts, and more generally the presence of redundant information. In this paper, we introduce a new Natural Language Processing (NLP) framework, capable of extracting the aforementioned elements from EHRs written in Spanish using rule-based methods. We focus on building medical timelines, which include disease diagnosis and its progression over time. By using a large dataset of EHRs comprising information about patients suffering from lung cancer, we show that our framework has an adequate level of performance by correctly building the timeline for 843 patients from a pool of 989 patients, achieving a correct result in 85% of instances.

  • Publication . Contribution for newspaper or weekly magazine . Conference object . Article . Preprint . 2016
    Open Access
    Authors: 
    Ahmed Ali; Najim Dehak; Patrick Cardinal; Sameer Khurana; Sree Harsha Yella; James Glass; Peter Bell; Steve Renals;
    Publisher: ISCA
    Country: United Kingdom
    Project: EC | SUMMA (688139)

    In this paper, we investigate different approaches for dialect identification in Arabic broadcast speech. These methods are based on phonetic and lexical features obtained from a speech recognition system, and bottleneck features using the i-vector framework. We studied both generative and discriminative classifiers, and we combined these features using a multi-class Support Vector Machine (SVM). We validated our results on an Arabic/English language identification task, with an accuracy of 100%. We also evaluated these features in a binary classifier to discriminate between Modern Standard Arabic (MSA) and Dialectal Arabic, with an accuracy of 100%. We further reported results using the proposed methods to discriminate between the five most widely used dialects of Arabic: namely Egyptian, Gulf, Levantine, North African, and MSA, with an accuracy of 59.2%. We discuss dialect identification errors in the context of dialect code-switching between Dialectal Arabic and MSA, and compare the error pattern between manually labeled data, and the output from our classifier. All the data used on our experiments have been released to the public as a language identification corpus.

  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2016
    Open Access English
    Authors: 
    Marcin Junczys-Dowmunt; Tomasz Dwojak; Rico Sennrich;
    Country: United Kingdom
    Project: EC | SUMMA (688139), EC | TraMOOC (644333)

    This paper describes the AMU-UEDIN submissions to the WMT 2016 shared task on news translation. We explore methods of decode-time integration ofattention-based neural translation models with phrase-based statistical machinetranslation. Efficient batch-algorithms for GPU-querying are proposed and implemented. For English-Russian, our system stays behind the state-of-the-art pure neural models in terms of BLEU. Among restricted systems, manual evaluation places it in the first cluster tied with the pure neural model. For the Russian-English task, our submission achieves the top BLEU result, outperforming the best pure neural system by 1.1 BLEU points and our ownphrase-based baseline by 1.6 BLEU. After manual evaluation, this system is thebest restricted system in its own cluster. In follow-up experiments we improve results by additional 0.8 BLEU.

  • Publication . Conference object . Other literature type . Article . 2016
    Open Access
    Authors: 
    Ngoc Quang Luong; Andrei Popescu-Belis;
    Publisher: Association for Computational Linguistics
    Country: Switzerland
    Project: SNSF | MODERN: Modeling discours... (147653), EC | SUMMA (688139)
  • Publication . Article . Preprint . Other literature type . Conference object . Contribution for newspaper or weekly magazine . 2020
    Open Access English
    Authors: 
    Biao Zhang; Philip Williams; Ivan Titov; Rico Sennrich;
    Countries: Switzerland, United Kingdom
    Project: EC | ELITR (825460), SNSF | Multi-Task Learning with ... (176727), EC | GoURMET (825299)

    Massively multilingual models for neural machine translation (NMT) are theoretically attractive, but often underperform bilingual models and deliver poor zero-shot translations. In this paper, we explore ways to improve them. We argue that multilingual NMT requires stronger modeling capacity to support language pairs with varying typological characteristics, and overcome this bottleneck via language-specific components and deepening NMT architectures. We identify the off-target translation issue (i.e. translating into a wrong target language) as the major source of the inferior zero-shot performance, and propose random online backtranslation to enforce the translation of unseen training language pairs. Experiments on OPUS-100 (a novel multilingual dataset with 100 languages) show that our approach substantially narrows the performance gap with bilingual models in both one-to-many and many-to-many settings, and improves zero-shot performance by ~10 BLEU, approaching conventional pivot-based methods. Comment: ACL2020

  • Restricted
    Authors: 
    Yichi Zhang;
    Publisher: Informa UK Limited
    Project: EC | BROKEX (802070)

    A transnational flow of capital exchange during the 19th and early 20th centuries brought planning ideas and modernity into China. Since European countries and America used violence to place China ...

  • Open Access
    Authors: 
    Arturo González;
    Country: Ireland
    Project: EC | TRUSS (642453)

    The growth of cities, the impacts of climate change and the massive cost of providing new infrastructure provide the impetus for TRUSS (Training in Reducing Uncertainty in Structural Safety), a €3.7 million Marie Skłodowska-Curie Action Innovative Training Network project funded by EU's Horizon 2020 programme, which aims to maximize the potential of infrastructure that already exists (http://trussitn.eu). For that purpose, TRUSS brings together an international, inter-sectoral and multidisciplinary collaboration between five academic and eleven industry institutions from five European countries. The project covers rail and road infrastructure, buildings and energy and marine infrastructure. This paper reports progress in fields such as advanced sensor-based structural health monitoring solutions - unmanned aerial vehicles, optical backscatter reflectometry, monitoring sensors mounted on vehicles, ... - and innovative algorithms for structural designs and short- and long-term assessments of buildings, bridges, pavements, ships, ship unloaders, nuclear components and wind turbine towers that will support infrastructure operators and owners in managing their assets. European Commission Horizon 2020

  • Authors: 
    Perdoncin, Anton;
    Publisher: CAIRN
    Country: France
    Project: EC | LUBARTWORLD (818843)

    International audience