CSIR Central

Video-Based Facial Expression Recognition using a Blend of 3D CNN and ConvLSTM

IR@CEERI: CSIR-Central Electronics Engineering Research Institute, Pilani

View Archive Info
 
 
Field Value
 
Title Video-Based Facial Expression Recognition using a Blend of 3D CNN and ConvLSTM
 
Creator Saurav, S
Kumar, T
Saini, R
Singh, S
 
Subject Digital Systems
 
Description The 3-Dimensional Convolutional Neural Network (3D CNN) and Long Short-Term Memory Network (LSTM) have consistently outperformed many approaches in video-based Facial Expression Recognition (VEER). The vanilla version of the fully-connected LSTM (FC-LSTM) unrolls the image to a one-dimensional vector, which results in the loss of vital spatial information. Convolutional LSTM (ConvLSTM) overcomes this limitation by performing LSTM operations in terms of convolutions without performing any unrolling, as in the case with FC-LSTM. Motivated by this, in this paper, we propose a neural network architecture that consists of a blend of 3D CNN and ConvLSTM. The proposed hybrid architecture captures spatial-temporal information to produce competitive accuracy on three publicly available FER databases, namely the CK+, SAVEE, and AFEW. The experimental results demonstrate excellent performance without using any external emotion data with an added advantage of having a simple model with a comparatively fewer number of parameters and model size. Our designed FER pipeline is a suitable candidate for automatic recognition of facial expressions in real-time on a resource-constrained embedded platform.
 
Date 2020
 
Type Conference or Workshop Item
PeerReviewed
 
Format application/pdf
 
Identifier http://ceeri.csircentral.net/567/1/182020.pdf
Saurav, S and Kumar, T and Saini, R and Singh, S (2020) Video-Based Facial Expression Recognition using a Blend of 3D CNN and ConvLSTM. In: 17th IEEE India Council International Conference (INDICON-2020), December 11-13, 2020, NSUT, New Delhi, India.
 
Relation http://ceeri.csircentral.net/567/