Building a generalized model for multi-lingual vocal emotion conversion

Susmitha Vekkot

doi:10.1109/ACII.2017.8273658

2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII)

Building a generalized model for multi-lingual vocal emotion conversion

Year: 2017, Pages: 576-580

DOI Bookmark: 10.1109/ACII.2017.8273658

Authors

Susmitha Vekkot, Dept of Electronics & Communication Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, Amrita University, India

Abstract

The paper proposes a methodology for expressive speech synthesis by identifying, estimating and modelling the parameters responsible for generation of vocal affect. The existing systems take into account one or more of the prosodic factors and try to model affect based on information in prosody or spectrum or more rarely, both. This paper analyses the incompleteness of the kind of representation used in literature and puts forth a strategy to simultaneously convert prosody and spectral features for effective emotion conversion. Initial studies on prosodic parameter modification proves that certain common characteristics appear across all archetypal emotions which can be kept constant during model development. A prototype system is proposed wherein it takes extracted speech parameters, models it based on target emotion required and builds a generalized model for emotion incorporation which can be used in social environments.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Emotion conversion for expressive Arabic text to speech
2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)
A Study on Prosody of Vietnamese Emotional Speech
Knowledge and Systems Engineering, International Conference on
An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model
2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages: Bengali, Hindi and Telugu
2014 Seventh International Conference on Contemporary Computing (IC3)
Neutral to happy emotion conversion by blending prosody and laughter
2015 Eighth International Conference on Contemporary Computing (IC3)
From simulated speech to natural speech, what are the robust features for emotion recognition?
2015 International Conference on Affective Computing and Intelligent Interaction (ACII)
Compression of prosody for speech modification in synthesis
Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36163)
Can We Generate Emotional Pronunciations for Expressive Speech Synthesis?
IEEE Transactions on Affective Computing
Emotion Intensity and its Control for Emotional Voice Conversion
IEEE Transactions on Affective Computing
Speech Emotion Recognition Based on Hyper-Prosodic Features
2017 International Conference on Computer Technology, Electronics and Communication (ICCTEC)

Building a generalized model for multi-lingual vocal emotion conversion

Authors

Abstract

Related Articles