CNNs overfitting on FER2013

I want to train a few chosen models (MobileNet, Xception and ResNet50) for a task of facial emotion recognition. I am using the FER2013 dataset, however I don't need to recognize all included emotions, only sad, angry, fearful, neutral and happy. So it's 5 labels in total. Because the dataset is imbalanced, I applied class weights. I'm training the models from scratch with Keras and Tensorflow.

Based on Papers with code (~70% on Inception for example) I would expect to achieve accuracy around 70% or even more, as these results are for the full 7-class dataset.

Unfortunately, the highest the models go is ~65% (Xception), ~62% (ResNet50) and ~63% (MobileNet) before they start to overfit.

For data augmentation I'm using the following transformations:

train_datagen = tf.keras.preprocessing.image.ImageDataGenerator(    rescale=1./255,    width_shift_range=0.1,    height_shift_range=0.1,    zoom_range=0.1,    fill_mode='constant',    cval=0,    horizontal_flip=True,)

I'm using SGD optimizer with initial learning rate equal to 1e-3, momentum 0.9 and weight decay of 1e-4 (I have tried to use 1e-6 and 1e-2 with no improvements). Learning rate is halved every 10-epoch stagnation. Batch size is equal to 16 as the size of 8 gave no advancements, only making the traning process longer.

As an example, here are the metrics from training Xception (batch size = 16, initial learning rate = 0.001, momentum = 0.9, weight decay = 1e-4):

Training accuracy:

Testing accuracy:

Training loss:

Testing loss:

The best accuracy for this model was 65.64%.

What could be improved in my training method? Is there any way to achieve better results?

CNNs overfitting on FER2013

Trending Articles

አዋጅ ቁጥር 881-2007 የሙስና ወንጀሎችን ለመደንገግ የወጣ አዋጅ

SEEDUWA SAKURA LIVE IN GONAPALA 2018

Event ID 124: The Virtualization Based Security enablement policy check at...

Parris out on $9,000 bail

TASK ERROR: storage migration failed: block job (mirror) error:...

गर्मी पर स्टेटस – Funny Summer Status in Hindi for Whatsapp

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

Karnataka SSLC 10th Exam Time Table 2016 (www.kseeb.kar.nic.in)

Scripting Tracker - Development Tool for SAP GUI Scripting

newbie need guide - help - read flash xc2287-96F with dap miniwiggler

SOFT COPY ZA NGAIZA CHEMISTRY

Thomas Oliver corporation. DRDO certified company (reply)

S.K. Macharia Biography, Wealth, Awards, Family, Wife and Children

Re: VMware-converter-all-4.3.0-292238.exe

Practice Sheet of Right form of verbs for HSC Students

The 10 Wyoming Cities With The Largest Black Population For 2021

Electronic Bank Statement field Assignment (ZUONR) missing alphabets from...

More things we have to put up with: when NOT to raise hell with Disclosure

Forum Post: RE: TMS570LC4357: Disable error pin output for ESM group 1, 2, 3

PSM I question: Product Backlog item considered complete