Skip to main content

Atypical Speech & Voices: Corpora, Classification, Coaching & Conversion

Examples of such less typical patterns may include speaking while eating, during physical exercise, singing, as well as a wide range of pathological effects or speech generated by special aged groups (children, elderly).

In fact, recent advances in the field of Computational Paralinguistics allow for automatic recognition, analysis, and synthesis of an ever-increasing range of "atypical" phenomena. At the same time, deeper analysis methods have opened doors to new assistive technologies, such as coaching systems, serious games, and tutoring systems, as well as diagnostic aids (e.g., for early detection of autism spectrum disorders, Alzheimer's or Parkinson's diseases).

Edited by: Tiago Falk, Elmar Nöth, Vijay Parsa and Björn Schuller

  1. We investigate the automatic recognition of emotions in the singing voice and study the worth and role of a variety of relevant acoustic parameters. The data set contains phrases and vocalises sung by eight re...

    Authors: Florian Eyben, Gláucia L Salomão, Johan Sundberg, Klaus R Scherer and Björn W Schuller
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2015 2015:19
  2. Automatic diagnosis and monitoring of Alzheimer’s disease can have a significant impact on society as well as the well-being of patients. The part of the brain cortex that processes language abilities is one o...

    Authors: Ali Khodabakhsh, Fatih Yesil, Ekrem Guner and Cenk Demiroglu
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2015 2015:9
  3. This paper presents a voice conversion (VC) method that utilizes conditional restricted Boltzmann machines (CRBMs) for each speaker to obtain high-order speaker-independent spaces where voice features are conv...

    Authors: Toru Nakashika, Tetsuya Takiguchi and Yasuo Ariki
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2015 2015:8
  4. Vocal tremor has been simulated using a high-dimensional discrete vocal fold model. Specifically, respiratory, phonatory, and articulatory tremors have been modeled as instabilities in six parameters of the mo...

    Authors: Rubén Fraile, Juan Ignacio Godino-Llorente and Malte Kob
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2015 2015:1
  5. Speech technology is firmly rooted in daily life, most notably in command-and-control (C&C) applications. C&C usability downgrades quickly, however, when used by people with non-standard speech. We pursue a fu...

    Authors: Bart Ons, Jort F Gemmeke and Hugo Van hamme
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2014 2014:43
  6. Building a voice-operated system for learning disabled users is a difficult task that requires a considerable amount of time and effort. Due to the wide spectrum of disabilities and their different related pho...

    Authors: Marek Bohac, Michaela Kucharova, Zoraida Callejas, Jan Nouza and Petr ÄŒerva
    Citation: EURASIP Journal on Audio, Speech, and Music Processing 2014 2014:39