Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
152 Research products, page 1 of 16

  • Digital Humanities and Cultural Heritage
  • Publications
  • Other research products
  • 2018-2022
  • 050105 experimental psychology
  • European Commission

10
arrow_drop_down
Relevance
arrow_drop_down
  • Publication . Conference object . Article . Preprint . 2021
    Open Access
    Authors: 
    Henry Conklin; Bailin Wang; Kenny Smith; Ivan Titov;
    Publisher: Association for Computational Linguistics
    Project: NWO | Scaling Semantic Parsing ... (13221), EC | BroadSem (678254)

    Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance. Comment: ACL2021 Camera Ready; fix a small typo

  • Open Access English
    Authors: 
    Michelangelo Naim; Mikhail Katkov; Stefano Recanatesi; Misha Tsodyks;
    Project: EC | M-GATE (765549), EC | HBP SGA1 (720270), NIH | Associative Processes in ... (2R01MH055687-21), EC | HBP SGA2 (785907)

    Structured information is easier to remember and recall than random one. In real life, information exhibits multi-level hierarchical organization, such as clauses, sentences, episodes and narratives in language. Here we show that multi-level grouping emerges even when participants perform memory recall experiments with random sets of words. To quantitatively probe brain mechanisms involved in memory structuring, we consider an experimental protocol where participants perform ‘final free recall’ (FFR) of several random lists of words each of which was first presented and recalled individually. We observe a hierarchy of grouping organizations of FFR, most notably many participants sequentially recalled relatively long chunks of words from each list before recalling words from another list. More-over, participants who exhibited strongest organization during FFR achieved highest levels of performance. Based on these results, we develop a hierarchical model of memory recall that is broadly compatible with our findings. Our study shows how highly controlled memory experiments with random and meaningless material, when combined with simple models, can be used to quantitatively probe the way meaningful information can efficiently be organized and processed in the brain, so to be easily retrieved.Significance StatementInformation that people communicate to each other is highly structured. For example, a story contains meaningful elements of various degrees of complexity (clauses, sentences, episodes etc). Recalling a story, we are chiefly concerned with these meaningful elements and not its exact wording. Here we show that people introduce structure even when recalling random lists of words, by grouping the words into ‘chunks’ of various sizes. Doing so improves their performance. The so formed chunks closely correspond in size to story elements described above. This suggests that our memory is trained to create a structure that resembles the one it typically deals with in real life, and that using random material like word lists can be used to quantitatively probe these memory mechanisms.

  • Open Access English
    Authors: 
    Rui Mendes; Ricardo Gomes; Diederick Christian Niehorster; Efstathia Soroli;
    Publisher: Bern Open Publishing
    Project: EC | POLONEZ (665778)

    This document contains the abstracts for the 2018 Scandinavian Workshop on Applied Eye Tracking (SWAET 2018) which was held at Copenhagen Business School, Denmark, 23 to 24 August, 2018..

  • Open Access
    Authors: 
    Steffen Lepa; Martin Herzog; Jochen Steffens; Andreas Schoenrock; Hauke Egermann;
    Publisher: Informa UK Limited
    Country: United Kingdom
    Project: EC | ABC DJ (688122)

    We describe the development of a computational model predicting listener-perceived expressions of music in branding contexts. Representative ground truth from multi-national online listening experiments was combined with machine learning of music branding expert knowledge, and audio signal analysis toolbox outputs. A mixture of random forest and traditional regression models is able to predict average ratings of perceived brand image on four dimensions. Resulting cross-validated prediction accuracy (R²) was Arousal: 61%, Valence: 44%, Authenticity: 55%, and Timeliness: 74%. Audio descriptors for rhythm, instrumentation, and musical style contributed most. Adaptive sub-models for different marketing target groups further increase prediction accuracy.

  • Open Access
    Authors: 
    Johann-Mattis List; George Starostin; Lai Yunfan;
    Publisher: Gorgias Press LLC
    Country: Germany
    Project: EC | CALC (715618)
  • Publication . Conference object . Other literature type . 2020
    Open Access English
    Authors: 
    Bogdan Ludusan; Petra Wagner;
    Publisher: ISCA
    Country: Germany
    Project: EC | HA-HA (799022)

    With laughter research seeing a development in recent years, there is also an increased need in materials having laughter annotations. We examine in this study how one can leverage existing spontaneous speech resources to this goal. We first analyze the process of manual laughter annotation in corpora, by establishing two important parameters of the process: the amount of time required and its inter-rater reliability. Next, we propose a novel semi-automatic tool for laughter annotation, based on a signal-based representation of speech rhythm. We test both annotation approaches on the same recordings, containing German dyadic spontaneous interactions, and employing a larger pool of annotators than previously done. We then compare and discuss the obtained results based on the two aforementioned parameters, highlighting the benefits and costs associated to each approach.

  • Open Access English
    Authors: 
    Kun Sun; Rong Wang;
    Publisher: MDPI AG
    Country: Germany
    Project: EC | WIDE (742545)

    This study applies relative entropy in naturalistic large-scale corpus to calculate the difference among L2 (second language) learners at different levels. We chose lemma, token, POStrigram, conjunction to represent lexicon and grammar to detect the patterns of language proficiency development among different L2 groups using relative entropy. The results show that information distribution discrimination regarding lexical and grammatical differences continues to increase from L2 learners at a lower level to those at a higher level. This result is consistent with the assumption that in the course of second language acquisition, L2 learners develop towards a more complex and diverse use of language. Meanwhile, this study uses the statistics method of time series to process the data on L2 differences yielded by traditional frequency-based methods processing the same L2 corpus to compare with the results of relative entropy. However, the results from the traditional methods rarely show regularity. As compared to the algorithms in traditional approaches, relative entropy performs much better in detecting L2 proficiency development. In this sense, we have developed an effective and practical algorithm for stably detecting and predicting the developments in L2 learners’ language proficiency. H2020 European Research Council

  • Open Access
    Authors: 
    Jana Hasenäcker; Olga Solaja; Davide Crepaldi;
    Publisher: Springer Science and Business Media LLC
    Country: Italy
    Project: EC | STATLEARN (679010)

    In visual word identification, readers automatically access word internal information: they recognize orthographically embedded words (e.g., HAT in THAT) and are sensitive to morphological structure (DEAL-ER, BASKET-BALL). The exact mechanisms that govern these processes, however, are not well established yet - how is this information used? What is the role of affixes in this process? To address these questions, we tested the activation of meaning of embedded word stems in the presence or absence of a morphological structure using two semantic categorization tasks in Italian. Participants made category decisions on words (e.g., is CARROT a type of food?). Some no-answers (is CORNER a type of food?) contained category-congruent embedded word stems (i.e., CORN-). Moreover, the embedded stems could be accompanied by a pseudo-suffix (-er in CORNER) or a non-morphological ending (-ce in PEACE) - this allowed gauging the role of pseudo-suffixes in stem activation. The analyses of accuracy and response times revealed that words were harder to reject as members of a category when they contained an embedded word stem that was indeed category-congruent. Critically, this was the case regardless of the presence or absence of a pseudo-suffix. These findings provide evidence that the lexical identification system activates the meaning of embedded word stems when the task requires semantic information. This study brings together research on orthographic neighbors and morphological processing, yielding results that have important implications for models of visual word processing.

  • Open Access
    Authors: 
    Chuang Y; Voller M; Elnaz Shafaei-Bajestan; Susanne Gahl; Peter Hendrix; Baayen Rh;
    Publisher: Center for Open Science
    Project: EC | WIDE (742545)

    Nonwords are often used to clarify how lexical processing takes place in the absence of semantics. This study shows that nonwords are not semantically vacuous. We used Linear Discriminative Learning (Baayen et al., 2019) to estimate the meanings of nonwords in the MALD database (Tucker et al., 2018) from the speech signal. We show that measures gauging nonword semantics significantly improve model fit for both acoustic durations and RTs. Although nonwords do not evoke meanings that afford conscious reflexion, they do make contact with the semantic space, and the angles and distances of nonwords with respect to actual words co-determine articulation and lexicality decisions.

  • Open Access English
    Authors: 
    Ulrike Zeshan; Sibaji Panda;
    Publisher: De Gruyter
    Country: United Kingdom
    Project: EC | MULTISIGN (263647)

    Abstract We present data from a bimodal trilingual situation involving Indian Sign Language (ISL), Hindi and English. Signers are co-using these languages while in group conversations with deaf people and hearing non-signers. The data show that in this context, English is an embedded language that does not impact on the grammar of the utterances, while both ISL and Hindi structures are realised throughout. The data show mismatches between the simultaneously expressed ISL and Hindi, such that semantic content and/or syntactic structures are different in both languages, yet are produced at the same time. The data also include instances of different propositions expressed simultaneously in the two languages. This under-documented behaviour is called “sign-speaking” here, and we explore its implications for theories of multilingualism, code-switching, and bilingual language production.

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
152 Research products, page 1 of 16
  • Publication . Conference object . Article . Preprint . 2021
    Open Access
    Authors: 
    Henry Conklin; Bailin Wang; Kenny Smith; Ivan Titov;
    Publisher: Association for Computational Linguistics
    Project: NWO | Scaling Semantic Parsing ... (13221), EC | BroadSem (678254)

    Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance. Comment: ACL2021 Camera Ready; fix a small typo

  • Open Access English
    Authors: 
    Michelangelo Naim; Mikhail Katkov; Stefano Recanatesi; Misha Tsodyks;
    Project: EC | M-GATE (765549), EC | HBP SGA1 (720270), NIH | Associative Processes in ... (2R01MH055687-21), EC | HBP SGA2 (785907)

    Structured information is easier to remember and recall than random one. In real life, information exhibits multi-level hierarchical organization, such as clauses, sentences, episodes and narratives in language. Here we show that multi-level grouping emerges even when participants perform memory recall experiments with random sets of words. To quantitatively probe brain mechanisms involved in memory structuring, we consider an experimental protocol where participants perform ‘final free recall’ (FFR) of several random lists of words each of which was first presented and recalled individually. We observe a hierarchy of grouping organizations of FFR, most notably many participants sequentially recalled relatively long chunks of words from each list before recalling words from another list. More-over, participants who exhibited strongest organization during FFR achieved highest levels of performance. Based on these results, we develop a hierarchical model of memory recall that is broadly compatible with our findings. Our study shows how highly controlled memory experiments with random and meaningless material, when combined with simple models, can be used to quantitatively probe the way meaningful information can efficiently be organized and processed in the brain, so to be easily retrieved.Significance StatementInformation that people communicate to each other is highly structured. For example, a story contains meaningful elements of various degrees of complexity (clauses, sentences, episodes etc). Recalling a story, we are chiefly concerned with these meaningful elements and not its exact wording. Here we show that people introduce structure even when recalling random lists of words, by grouping the words into ‘chunks’ of various sizes. Doing so improves their performance. The so formed chunks closely correspond in size to story elements described above. This suggests that our memory is trained to create a structure that resembles the one it typically deals with in real life, and that using random material like word lists can be used to quantitatively probe these memory mechanisms.

  • Open Access English
    Authors: 
    Rui Mendes; Ricardo Gomes; Diederick Christian Niehorster; Efstathia Soroli;
    Publisher: Bern Open Publishing
    Project: EC | POLONEZ (665778)

    This document contains the abstracts for the 2018 Scandinavian Workshop on Applied Eye Tracking (SWAET 2018) which was held at Copenhagen Business School, Denmark, 23 to 24 August, 2018..

  • Open Access
    Authors: 
    Steffen Lepa; Martin Herzog; Jochen Steffens; Andreas Schoenrock; Hauke Egermann;
    Publisher: Informa UK Limited
    Country: United Kingdom
    Project: EC | ABC DJ (688122)

    We describe the development of a computational model predicting listener-perceived expressions of music in branding contexts. Representative ground truth from multi-national online listening experiments was combined with machine learning of music branding expert knowledge, and audio signal analysis toolbox outputs. A mixture of random forest and traditional regression models is able to predict average ratings of perceived brand image on four dimensions. Resulting cross-validated prediction accuracy (R²) was Arousal: 61%, Valence: 44%, Authenticity: 55%, and Timeliness: 74%. Audio descriptors for rhythm, instrumentation, and musical style contributed most. Adaptive sub-models for different marketing target groups further increase prediction accuracy.

  • Open Access
    Authors: 
    Johann-Mattis List; George Starostin; Lai Yunfan;
    Publisher: Gorgias Press LLC
    Country: Germany
    Project: EC | CALC (715618)
  • Publication . Conference object . Other literature type . 2020
    Open Access English
    Authors: 
    Bogdan Ludusan; Petra Wagner;
    Publisher: ISCA
    Country: Germany
    Project: EC | HA-HA (799022)

    With laughter research seeing a development in recent years, there is also an increased need in materials having laughter annotations. We examine in this study how one can leverage existing spontaneous speech resources to this goal. We first analyze the process of manual laughter annotation in corpora, by establishing two important parameters of the process: the amount of time required and its inter-rater reliability. Next, we propose a novel semi-automatic tool for laughter annotation, based on a signal-based representation of speech rhythm. We test both annotation approaches on the same recordings, containing German dyadic spontaneous interactions, and employing a larger pool of annotators than previously done. We then compare and discuss the obtained results based on the two aforementioned parameters, highlighting the benefits and costs associated to each approach.

  • Open Access English
    Authors: 
    Kun Sun; Rong Wang;
    Publisher: MDPI AG
    Country: Germany
    Project: EC | WIDE (742545)

    This study applies relative entropy in naturalistic large-scale corpus to calculate the difference among L2 (second language) learners at different levels. We chose lemma, token, POStrigram, conjunction to represent lexicon and grammar to detect the patterns of language proficiency development among different L2 groups using relative entropy. The results show that information distribution discrimination regarding lexical and grammatical differences continues to increase from L2 learners at a lower level to those at a higher level. This result is consistent with the assumption that in the course of second language acquisition, L2 learners develop towards a more complex and diverse use of language. Meanwhile, this study uses the statistics method of time series to process the data on L2 differences yielded by traditional frequency-based methods processing the same L2 corpus to compare with the results of relative entropy. However, the results from the traditional methods rarely show regularity. As compared to the algorithms in traditional approaches, relative entropy performs much better in detecting L2 proficiency development. In this sense, we have developed an effective and practical algorithm for stably detecting and predicting the developments in L2 learners’ language proficiency. H2020 European Research Council

  • Open Access
    Authors: 
    Jana Hasenäcker; Olga Solaja; Davide Crepaldi;
    Publisher: Springer Science and Business Media LLC
    Country: Italy
    Project: EC | STATLEARN (679010)

    In visual word identification, readers automatically access word internal information: they recognize orthographically embedded words (e.g., HAT in THAT) and are sensitive to morphological structure (DEAL-ER, BASKET-BALL). The exact mechanisms that govern these processes, however, are not well established yet - how is this information used? What is the role of affixes in this process? To address these questions, we tested the activation of meaning of embedded word stems in the presence or absence of a morphological structure using two semantic categorization tasks in Italian. Participants made category decisions on words (e.g., is CARROT a type of food?). Some no-answers (is CORNER a type of food?) contained category-congruent embedded word stems (i.e., CORN-). Moreover, the embedded stems could be accompanied by a pseudo-suffix (-er in CORNER) or a non-morphological ending (-ce in PEACE) - this allowed gauging the role of pseudo-suffixes in stem activation. The analyses of accuracy and response times revealed that words were harder to reject as members of a category when they contained an embedded word stem that was indeed category-congruent. Critically, this was the case regardless of the presence or absence of a pseudo-suffix. These findings provide evidence that the lexical identification system activates the meaning of embedded word stems when the task requires semantic information. This study brings together research on orthographic neighbors and morphological processing, yielding results that have important implications for models of visual word processing.

  • Open Access
    Authors: 
    Chuang Y; Voller M; Elnaz Shafaei-Bajestan; Susanne Gahl; Peter Hendrix; Baayen Rh;
    Publisher: Center for Open Science
    Project: EC | WIDE (742545)

    Nonwords are often used to clarify how lexical processing takes place in the absence of semantics. This study shows that nonwords are not semantically vacuous. We used Linear Discriminative Learning (Baayen et al., 2019) to estimate the meanings of nonwords in the MALD database (Tucker et al., 2018) from the speech signal. We show that measures gauging nonword semantics significantly improve model fit for both acoustic durations and RTs. Although nonwords do not evoke meanings that afford conscious reflexion, they do make contact with the semantic space, and the angles and distances of nonwords with respect to actual words co-determine articulation and lexicality decisions.

  • Open Access English
    Authors: 
    Ulrike Zeshan; Sibaji Panda;
    Publisher: De Gruyter
    Country: United Kingdom
    Project: EC | MULTISIGN (263647)

    Abstract We present data from a bimodal trilingual situation involving Indian Sign Language (ISL), Hindi and English. Signers are co-using these languages while in group conversations with deaf people and hearing non-signers. The data show that in this context, English is an embedded language that does not impact on the grammar of the utterances, while both ISL and Hindi structures are realised throughout. The data show mismatches between the simultaneously expressed ISL and Hindi, such that semantic content and/or syntactic structures are different in both languages, yet are produced at the same time. The data also include instances of different propositions expressed simultaneously in the two languages. This under-documented behaviour is called “sign-speaking” here, and we explore its implications for theories of multilingualism, code-switching, and bilingual language production.