OBJECTIVES OF THE WORKSHOP:
- Pluricentric languages (PLCLs) are a common type among the languages of the world. Presently 43 languages have been identified to belong to this category (see www.pluricentriclanguages.org). Languages like English, Spanish, Portuguese, Bengali, Hindi, Urdu etc. fall into this category. These languages are being used in at least two nations having an official function there and forming national varieties of their own with specific linguistic and pragmatic features. In addition to the variation on the level of national standard varieties there is also so called “second level variation” on a regional and local level that is often being used in diglossic speech situations where code switching is a salient feature with two or more varieties being used within thesame utterance.
- The amount of linguistic variation in pluricentric languages is considerable and poses a challenge for speech recognition in particular andhuman language technology in general.
- The motivation for the special session isthe observation that pluricentric languages have not sufficiently been dealt with. This is particularly the case with the so-called “non-dominant varieties” that often suffer from lack of documentation and treatment in speech technology. (For details see www.pluricentriclanguages.org). The special session will therefore have a focus on these varieties as they share many features with endangered languages.