On the Evolution of Speech Representations for Affective Computing: A brief history and critical overview

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

On the Evolution of Speech Representations for Affective Computing: A brief history and critical overview

By: 
Sina Alisamir; Fabien Ringeval

Recent advances in the field of machine learning have shown great potential for the automatic recognition of apparent human emotions. In the era of Internet of Things and big-data processing, where voice-based systems are well established, opportunities to leverage cutting-edge technologies to develop personalized and human-centered services are genuinely real, with a growing demand in many areas such as education, health, well-being, and entertainment. Automatic emotion recognition from speech, which is a key element for developing personalized and human-centered services, has reached a degree of maturity that makes it of broad commercial interest today. However, there are still major limiting factors that prevent a broad applicability of emotion recognition technology. For example, one open challenge is the poor generalization capabilities of currently used feature extraction techniques to interpret expressions of affect across different persons, contexts, cultures, and languages.

Recent advances in the field of machine learning have shown great potential for the automatic recognition of apparent human emotions. In the era of Internet of Things and big-data processing, where voice-based systems are well established, opportunities to leverage cutting-edge technologies to develop personalized and human-centered services are genuinely real, with a growing demand in many areas such as education, health, well-being, and entertainment. Automatic emotion recognition from speech, which is a key element for developing personalized and human-centered services, has reached a degree of maturity that makes it of broad commercial interest today. However, there are still major limiting factors that prevent a broad applicability of emotion recognition technology. For example, one open challenge is the poor generalization capabilities of currently used feature extraction techniques to interpret expressions of affect across different persons, contexts, cultures, and languages.

SPS on Twitter

  • Voting for the IEEE SPS 5-Minute Video Clip Contest is now live! Check out the three finalists and cast your vote f… https://t.co/fbqgHY1tw7
  • CALL FOR PROPOSALS: Now seeking proposals for the 2024 IEEE International Workshop on Machine Learning for Signal P… https://t.co/l7V1bF2qhT
  • The DEGAS Webinar Series continues on Thursday, 19 May when Dr. Usman A. Khan presents "Distributed stochastic non-… https://t.co/AbfwVL0Yne
  • The IEEE Journal of Selected Topics in Signal Processing is now accepting submissions for a Special Issue on Signal… https://t.co/PbuzgYLigt
  • RT : New graduates transitioning to the next stage of their career often have several questions. In this video, I share… https://t.co/WA4aRlKNRn

SPS Videos


Signal Processing in Home Assistants

 


Multimedia Forensics


Careers in Signal Processing             

 


Under the Radar