Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to Digital Humanities and Cultural Heritage. Are you interested to view more results? Visit OpenAIRE - Explore.
62 Research products, page 1 of 7

  • Digital Humanities and Cultural Heritage
  • Publications
  • Research data
  • Research software
  • Other research products
  • 2013-2022
  • Article
  • 050905 science studies
  • Scientometrics

10
arrow_drop_down
Relevance
arrow_drop_down
  • Open Access English
    Authors: 
    Zhiqi Wang; Ronald Rousseau;
    Publisher: Springer International Publishing
    Country: Belgium

    Abstract: The Yule-Simpson paradox refers to the fact that outcomes of comparisons between groups are reversed when groups are combined. Using Essential Sciences Indicators, a part of InCites (Clarivate), data for countries, it is shown that although the Yule-Simpson phenomenon in citation analysis and research evaluation is not common, it isn't extremely rare either. The Yule-Simpson paradox is a phenomenon one should be aware of, otherwise one may encounter unforeseen surprises in scientometric studies.

  • Open Access
    Authors: 
    Mei Hsiu-Ching Ho; John S. Liu;
    Publisher: Springer Science and Business Media LLC

    Scholars all over the world have produced a large body of COVID-19 literature in an exceptionally short period after the outbreak of this rapidly-spreading virus. An analysis of the literature accumulated in the first 150 days hints that the rapid knowledge accumulation in its early-stage development was expedited through a wide variety of journal platforms, a sense and pressure of national urgency, and inspiration from journal editorials.

  • Publication . Article . Preprint . Other literature type . 2018
    Open Access
    Authors: 
    Giovanni Colavizza;
    Publisher: Springer Science and Business Media LLC
    Country: Switzerland
    Project: SNSF | Linked Books: Reconstruct... (159961), SNSF | Understanding Citations i... (168489)

    The humanities are often characterized by sociologists as having a low mutual dependence among scholars and high task uncertainty. According to Fuchs' theory of scientific change, this leads over time to intellectual and social fragmentation, as new scholarship accumulates in the absence of shared unifying theories. We consider here a set of specialisms in the discipline of history and measure the connectivity properties of their bibliographic coupling networks over time, in order to assess whether fragmentation is indeed occurring. We construct networks using both reference overlap and textual similarity. It is shown that the connectivity of reference overlap networks is gradually and steadily declining over time, whilst that of textual similarity networks is stable. Author bibliographic coupling networks also show signs of a decline in connectivity, in the absence of an increasing propensity for collaborations. We speculate that, despite the gradual weakening of ties among historians as mapped by references, new scholarship might be continually integrated through shared vocabularies and narratives. This would support our belief that citations are but one kind of bibliometric data to consider --- perhaps even of secondary importance --- when studying the humanities, while text should play a more prominent role.

  • Open Access
    Authors: 
    Liang Meng; Haifeng Wang; Pengfei Han;
    Publisher: Springer Science and Business Media LLC

    Intriguing unforced regularities in human behaviors have been reported in varied research domains, including scientometrics. In this study we examine the manuscript submission behavior of researchers, with a focus on its monthly pattern. With a large and reliable dataset which records the submission history of articles published on 10 multidisciplinary journals and 10 management journals over a five-year period (2013–2017), we observe a prominent turn-of-the-month submission effect for accepted papers in management journals but not multidisciplinary journals. This effect gets more pronounced in submissions to top-tier journals and when the first day of a month happens to be a Saturday or Sunday. Sense of ceremony is proposed as a likely explanation of this effect, since the first day of a month is a fundamental temporal landmark which has a ‘fresh start effect’ on researchers. To conclude, an original and interesting day-of-the-month effect in the academia is reported in this study, which calls for more research attention.

  • Closed Access
    Authors: 
    Sergio Jimenez; Youlin Avila; George Dueñas; Alexander Gelbukh;
    Publisher: Springer Science and Business Media LLC

    The decision of reading or not a research paper is commonly made while reading its title and abstract. Although content and merit should lead to that decision, other factors such as writing style may intervene. Eventually, more readings could produce more citations. We investigated the stylistic factors in the title and abstract of research papers that affect their “citability”, and built a prediction model for citations at 5, 10, and 15 years. Since the number of citations is the preferred ranking function of several academic search engines, our “citability” function could alleviate the under-representation of recent not-yet-cited papers in query results. For this study, we collected a large dataset of around 750,000 titles and abstracts from articles in Scopus, intended to be representative of the entire science. For each instance, we extracted a relatively large set of 3578 stylistic features that were extracted at different linguistic levels, i.e. characters, syllables, tokens (i.e. words), sentences, stop/content words, and part-of-speech (POS) tags. Particularly, we present a novel set of corpus-based stylistic features that we called Corpus Spectral Signatures (CSS). We found out that a linear prediction model for citations (binned into quartiles) build with only the top-250 correlated features achieved a mean absolute error of 0.805 quartiles, and that on average, predictions were highly correlated with their real values (Spearman’s $$rho=0.515$$ ). CSS features were among the top correlated features, but POS features were the most predictive group of features in an ablation study.

  • Publication . Article . 2018
    Closed Access
    Authors: 
    Baitong Chen; Ying Ding; Feicheng Ma;
    Publisher: Springer Science and Business Media LLC

    Understanding semantic word shifts in scientific domains is essential for facilitating interdisciplinary communication. Using a data set of published papers in the field of information retrieval (IR), this paper studies the semantic shifts of words in IR based on mining per-word topic distribution over time. We propose that semantic word shifts not only occur over time, but also over topics. The shifts are examined from two perspectives, the topic-level and the context-level. According to the over-time word-topic distribution, stable words and unstable words are recognized. The diverging and converging trends in the unstable type reveal characteristics of the topic evolution process. The context-level shifts are further detected by similarities between word vectors. Our work associates semantic word shifts with the evolving of topics, which facilitates a better understanding of semantic word shifts from both topics and contexts.

  • Open Access
    Authors: 
    Wolfgang Glänzel; Lin Zhang;
    Publisher: Springer Science and Business Media LLC

    Proceeding from Moravcsik's paradigmatic ideas of how to build indigenous capability and sustainable science systems in developing countries, we attempted to further focus on the peculiarities of the twenty-first century and the new challenges of globalisation. In doing so, we selected three particular topics deemed relevant in this context: increase of international visibility and reception by the international community, international collaboration and the participation in research in emerging fields. We analysed these issues using the example of 16 developing countries and emerging economies. We found that several countries achieve an impressive citation impact with a considerable share of highly cited papers. The high impact proved to be associated with international collaboration. We also found two extreme situations in international collaboration, both of which might form challenges in building sustainable national science systems and research structures. Research activity in emerging research topics, finally, showed the presence of developing countries in highly topical research and their capability to contribute also to newest research trends.

  • Closed Access
    Authors: 
    Tingting Zhang; Baozhen Lee; Qinghua Zhu;
    Publisher: Springer Science and Business Media LLC

    Traditional plagiarism detection is based primarily on methods of character matching or topic similarity. Another promising methodology remains largely unexplored: employing deep mining to establish a contextual hierarchy among themes. This paper proposes a semantic approach to measuring the extent of plagiarism, based on a hierarchical graph model. The main innovations are as follows: (1) hierarchical extraction of topic feature terms and elucidation of a corresponding graph structure; (2) graph similarity calculation based on the maximum common subgraph. This semantic-measure method goes beyond semantic detection of topics to take into account the context of topic feature terms, as well as the hierarchical structure by which those topics are related. This contextual-hierarchical perspective should, in turn, improve the accuracy of plagiarism detection. In addition, by mining the implicit relationships between hierarchical feature terms, our method can detect plagiarized documents with similar themes but using different topic words: a potential boon to plagiarism detection recall. In an experiment conducted on a dataset from Chinese paper database CNKI, the semantic-measure method indeed demonstrates accuracy and recall superior to those achieved with current state-of-the-art methods.

  • Closed Access
    Authors: 
    Cholmyong Pak; Guang Yu; Weibin Wang;
    Publisher: Springer Science and Business Media LLC

    Citation impact indicators play a significant role in evaluating the scientific research activity. Most of citation impact indicators are based on the citation count that the publication is cited as a reference in the other publications, but the difference between each citation situation was not considered. Normally, the number of citations that a publication is cited in the other publications may represent the formal quality of the publication. Similarly, the number of times that a publication is really mentioned within the citing publication, it may also represent the formal quality of the citation. We have examined about how many times each reference was really mentioned within the citing publications and studied about the citation situation within the citing publications. We verified that the citation distribution of references according to the mention frequency follows the Generalized Pareto distribution. The results showed that about 20% of total references were mentioned three and more times, and the number of citation mentions for the about 50% of total references were from about 20% of the total references in the citing publications.

  • Open Access
    Authors: 
    Alberto Martín-Martín; Enrique Orduña-Malea; Emilio Delgado López-Cózar;
    Publisher: arXiv
    Country: Spain

    This article describes a procedure to generate a snapshot of the structure of a specific scientific community and their outputs based on the information available in Google Scholar Citations (GSC). We call this method MADAP (Multifaceted Analysis of Disciplines through Academic Profiles). The international community of researchers working in Bibliometrics, Scientometrics, Informetrics, Webometrics, and Altmetrics was selected as a case study. The records of the top 1,000 most cited documents by these authors according to GSC were manually processed to fill any missing information and deduplicate fields like the journal titles and book publishers. The results suggest that it is feasible to use GSC and the MADAP method to produce an accurate depiction of the community of researchers working in Bibliometrics (both specialists and occasional researchers) and their publication habits (main publication venues such as journals and book publishers). Additionally, the wide document coverage of Google Scholar (specially books and book chapters) enables more comprehensive analyses of the documents published in a specific discipline than were previously possible with other citation indexes, finally shedding light on what until now had been a blind spot in most citation analyses. This is a post-peer-review, pre-copyedit version of an article published in Scientometrics. The final authenticated version is available online at: https://doi.org/10.1007/s11192-017-2587-4 Research funded by Ministerio de Educación, Cultura y Deporte (FPU2013/05863). Universitat Politècnica de València (PAID-10-14).