Unlocking the power of L1 regularization: A novel approach to taming overfitting in CNN for image classification

Journal article


Sheikh, R., Wahid, F., Ali, S., Alkhayyat, A., Ma, Y., Khan, J. and Lee, Y. 2025. Unlocking the power of L1 regularization: A novel approach to taming overfitting in CNN for image classification. PLOS One. 20 (9), pp. 1-40. https://doi.org/10.1371/journal.pone.0327985
AuthorsSheikh, R., Wahid, F., Ali, S., Alkhayyat, A., Ma, Y., Khan, J. and Lee, Y.
Abstract

Convolutional Neural Networks (CNNs) stand as indispensable tools in deep learning, capable of autonomously extracting crucial features from diverse data types. However, the intricacies of CNN architectures can present challenges such as overfitting and underfitting, necessitating thoughtful strategies to optimize their performance. In this work, these issues have been resolved by introducing L1 regularization in the basic architecture of CNN when it is applied for image classification. The proposed model has been applied to three different datasets. It has been observed that incorporating L1 regularization with different coefficient values has distinct effects on the working mechanism of CNN architecture resulting in improving its performance. In MNIST digit classification, L1 regularization (coefficient: 0.01) simplifies feature representation and prevents overfitting, leading to enhanced accuracy. In the Mango Tree Leaves dataset, dual L1 regularization (coefficient: 0.001 for convolutional and 0.01 for dense layers) improves model interpretability and generalization, facilitating effective leaf classification. Additionally, for hand-drawn sketches like those in the Quick, Draw! Dataset, L1 regularization (coefficient: 0.001) refines feature representation, resulting in improved recognition accuracy and generalization across diverse sketch categories. These findings underscore the significance of regularization techniques like L1 regularization in fine-tuning CNNs, optimizing their performance, and ensuring their adaptability to new data while maintaining high accuracy. Such strategies play a pivotal role in advancing the utility of CNNs across various domains, further solidifying their position as a cornerstone of deep learning.

Year2025
JournalPLOS One
Journal citation20 (9), pp. 1-40
PublisherPublic Library of Science (PLoS)
ISSN1932-6203
Digital Object Identifier (DOI)https://doi.org/10.1371/journal.pone.0327985
Web address (URL)https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0327985
Accepted author manuscript
File Access Level
Restricted
Publisher's version
License
File Access Level
Open
Output statusPublished
Publication dates
Online05 Sep 2025
Publication process dates
Accepted24 Jun 2025
Deposited29 Sep 2025
Permalink -

https://repository.derby.ac.uk/item/v0vy1/unlocking-the-power-of-l1-regularization-a-novel-approach-to-taming-overfitting-in-cnn-for-image-classification

Download files


Publisher's version
journal.pone.0327985.pdf
License: CC BY 4.0
File access level: Open

  • 39
    total views
  • 3
    total downloads
  • 9
    views this month
  • 2
    downloads this month

Export as

Related outputs

Energy optimization and plant comfort management in smart greenhouses using the artificial bee colony algorithm
Jawad, M., Wahid, F., Ali, S., Ma, Y., Alkhyyat, A., Khan, J. and Lee, Y. 2025. Energy optimization and plant comfort management in smart greenhouses using the artificial bee colony algorithm. Scientific Reports. 15, pp. 1-29. https://doi.org/10.1038/s41598-024-84141-5
Visual impairment prevention by early detection of diabetic retinopathy based on stacked auto-encoder
Almas, S., Wahid, F., Ali, S., Alkhyyat, A., Ullah, K., Khan, J. and Lee, Y. 2025. Visual impairment prevention by early detection of diabetic retinopathy based on stacked auto-encoder. Scientific Reports. 15. https://doi.org/10.1038/s41598-025-85752-2
A deep neural network-based approach for seizure activity recognition of epilepsy sufferers
Khurshid, D., Wahid, F., Ali, S., Gumaei, A. H., Alzanin, S. M. and Mosleh, M. A. A. 2024. A deep neural network-based approach for seizure activity recognition of epilepsy sufferers. Frontiers. 11, pp. 1-16. https://doi.org/10.3389/fmed.2024.1405848
A Transfer Learning-Based Approach for Brain Tumor Classification
Bibi, N., Wahid, F., Ma, Y., Ali, S., Abbassi, I. A. and Alkhayya, A. 2024. A Transfer Learning-Based Approach for Brain Tumor Classification. IEEE Access. 12, pp. 1-21. https://doi.org/10.1109/ACCESS.2024.3425469