publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . 2016

The MGB-2 Challenge: Arabic Multi-Dialect Broadcast Media Recognition

Ahmed Ali; Peter Bell; James Glass; Yacine Messaoui; Hamdy Mubarak; Steve Renals; Yifan Zhang;
Open Access English
  • Published: 19 Sep 2016
  • Country: United Kingdom
Abstract
This paper describes the Arabic Multi-Genre Broadcast (MGB-2) Challenge for SLT-2016. Unlike last year's English MGB Challenge, which focused on recognition of diverse TV genres, this year, the challenge has an emphasis on handling the diversity in dialect in Arabic speech. Audio data comes from 19 distinct programmes from the Aljazeera Arabic TV channel between March 2005 and December 2015. Programmes are split into three groups: conversations, interviews, and reports. A total of 1,200 hours have been released with lightly supervised transcriptions for the acoustic modelling. For language modelling, we made available over 110M words crawled from Aljazeera Arabi...
Subjects
free text keywords: Computer Science - Computation and Language, Natural language processing, computer.software_genre, computer, Training set, Computer science, Language modelling, Transcription (linguistics), Grapheme, Speech transcription, Broadcasting, business.industry, business, Metadata, Arabic, language.human_language, language, Artificial intelligence
Funded by
EC| SUMMA
Project
SUMMA
Scalable Understanding of Multilingual Media
  • Funder: European Commission (EC)
  • Project Code: 688139
  • Funding stream: H2020 | RIA
Validated by funder
Communities
Digital Humanities and Cultural Heritage
Download fromView all 6 versions
Edinburgh Research Explorer
Contribution for newspaper or weekly magazine . 2017
OpenAIRE
Preprint . 2016
Provider: OpenAIRE

[5] Andreas Stolcke et al. Srilm-an extensible language modeling toolkit. In Interspeech, volume 2002, page 2002, 2002.

[6] Norbert Braunschweiler, Mark JF Gales, and Sabine Buchholz. Lightly supervised recognition for automatic alignment of large coherent speech recordings. In INTERSPEECH, pages 2222-2225, 2010.

[7] Fadi Biadsy, Nizar Habash, and Julia Hirschberg. Improving the arabic pronunciation dictionary for phone and word recognition with linguistically-based pronunciation rules. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 397-405. Association for Computational Linguistics, 2009.

Any information missing or wrong?Report an Issue