Call for papers: Satellite workshop Interspeech 2019: Pluricentric Languages in Speech Technology


PLACE/DATE: Graz, Austria, September 14 2019
Rudolf Muhr (Graz, Austria); Barbara Schuppler (Graz, Austria), Sarmad Hussain (Lahore, Pakistan); Tania Habib (Lahore, Pakistan)



1.     Pluricentric languages (PLCLs) are a common type among the languages of the world. Presently 43 languages have been identified to belong to this category (see Languages like English, Spanish, Portuguese, Bengali, Hindi, Urdu etc. fall into this category. These languages are being used in at least two nations having an official function there and forming national varieties of their own with specific linguistic and pragmatic features. In addition to the variation on the level of national standard varieties there is also so called “second level variation” on a regional and local level that is often being used in diglossic speech situations where code switching is a salient feature with two or more varieties being used within thesame utterance.

2. The amount of linguistic variation in pluricentric languages is considerable and poses a challenge for speech recognition in particular andhuman language technology in general.

3. The motivation for the satellite workshop is the observation that pluricentric languages have not sufficiently been dealt with. This is particularly the case with the so-called “non-dominant varieties” that often suffer from lack of documentation and treatment in speech technology. (For details see The satellite workshop will therefore have a focus on these varieties as they share many features with endangered languages.


  • Speech recognition and development of language resources for underresourced pluricentric languages and varieties of languages. This is particularly the case for the so called non-dominant varieties. Examples for this are amongst others, Scots, Saami, Karelian Finnish, Ruthenian and Kashubian, Tadczik, Frisian as well as diverse American and African languages: Aymara, Bamabara, Fulfulde, Lingala, Malinke, Soninke, Tuareg, Xhosa etc.
  • Language and speech resources development (parallel corpora, pronunciation databases, tagging etc.) especially for non-dominant varieties.
    Speech technologies such as speech recognition, text-to-speech and speech-to-speech for the national varieties of Pluricentric languages; on the level of standard varieties and on the level of so-called “informal speech”.
  • Empirical studies on the phonetics and phonology of national varieties of different pluricentric languages.
  • Speech and language technologies that are able to cope with the variation in the pluricentric languages and particularly in respect to non-dominant varieties and under-resourced languages
  • Speech and language processing for code-switched speech in national varieties of pluricentric languages.

ABSTRACTS:  400-600 words long should be submittted via easy-chair:
(Call is open from 01 February - 13 April 2019) - Deadline extended util April 13 2019

LENGTH OF PRESENTATIONS:  15 minutes presenation plus 5 minutes for discussion.

KEYNOTE SPEAKER: Prof. Martine Adda-Decker (LPP Paris Sorbonne Nouvelle ( and LIMSI ( Title: Variation in spoken pluricentric languages : insights from large corpora and challenges for speech technology

PUBLICATION OF PRESENTATIONS:  A special issue of a major journal will be later published containing peer-reviewed substantially extended versions of the abstracts contributed to the conference. Submissions to it will be by invitation.


