Skip to content

MIntelligence-Group/SpeechImg_EmoRec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

Interpretable Multimodal Emotion Recognition using Hybrid Fusion of Speech and Image Data

Implementation for the paper (submitted to Springer Multimedia Tools and Applications (MTAP) Journal).
Interpretable Multimodal Emotion Recognition using Hybrid Fusion of Speech and Image Data
Puneet Kumar, Sarthak Malik and Balasubramanian Raman

Code Files

The code files were private till the corresponding research paper's acceptance in Springer MTAP. They will be made publically available soon.

Dataset Access

Access to the ‘IIT Roorkee Speech and Image Emotion Recognition (IIT-R SIER) dataset’ can be obtained by through Access Form - IIT-R SIER Dataset.pdf. The dataset is prepared by Puneet Kumar and Sarthak Malik at Machine Intelligence Lab, IIT Roorkee under the supervision of Prof. Balasubramanian Raman. It contains speech utterances, corresponding images and emotion labels (happy, sad, hate, anger).

Releases

No releases published

Packages

No packages published
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy