• shareshare
  • link
  • cite
  • add
auto_awesome_motion View all 12 versions
Publication . Article . 2021

Automated identification of borrowings in multilingual wordlists

Johann-Mattis List; Robert Forkel;
Open Access

Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas.

Subjects by Vocabulary

Microsoft Academic Graph classification: Historical linguistics Computer science Natural language processing computer.software_genre computer Computational linguistics Southern china Language evolution Identification (information) Task (project management) Artificial intelligence business.industry business


Research Article, Articles, computational linguistics, historical linguistics, lexical borrowing, borrowing detection, computational historical linguistics, General Medicine

Related Organizations
Funded by
Computer-Assisted Language Comparison: Reconciling Computational and Classical Approaches in Historical Linguistics
  • Funder: European Commission (EC)
  • Project Code: 715618
  • Funding stream: H2020 | ERC | ERC-STG
Validated by funder
Related to Research communities
Digital Humanities and Cultural Heritage