Murdoch University Research Repository

Welcome to the Murdoch University Research Repository

The Murdoch University Research Repository is an open access digital collection of research
created by Murdoch University staff, researchers and postgraduate students.

Learn more

A fuzzy data augmentation technique to improve regularisation

Dabare, R., Wong, K.W., Shiratuddin, M.F. and Koutsakis, P.ORCID: 0000-0002-4168-0888 (2021) A fuzzy data augmentation technique to improve regularisation. International Journal of Intelligent Systems . Early View.

Link to Published Version: https://doi.org/10.1002/int.22731
*Subscription may be required

Abstract

Deep learning (DL) has achieved superior classification in many applications due to its capability of extracting features from the data. However, the success of DL comes with the tradeoff of possible overfitting. The bias towards the data it has seen during the training process leads to poor generalisation. One way of solving this issue is by having enough training data so that the classifier is invariant to many data patterns. In the literature, data augmentation has been used as a type of regularisation method to reduce the chance for the model to overfit. However, most of the relevant works focus on image, sound or text data. There is not much work on numerical data augmentation, although many real-world problems deal with numerical data. In this paper, we propose using a technique based on Fuzzy C-Means clustering and fuzzy membership grades. Fuzzy-related techniques are used to address the variance problem by generating new data items based on fuzzy numbers and each data item's belongings to different fuzzy clusters. This data augmentation technique is used to improve the generalisation of a Deep Neural Network that is suitable for numerical data. By combining the proposed fuzzy data augmentation technique with the Dropout regularisation technique, we manage to balance the classification model's bias-variance tradeoff. Our proposed technique is evaluated using four popular data sets and is shown to provide better regularisation and higher classification accuracy compared with popular regularisation approaches.

Item Type: Journal Article
Murdoch Affiliation(s): IT, Media and Communications
Publisher: Wiley
Copyright: © 2021 Wiley Periodicals LLC
URI: http://researchrepository.murdoch.edu.au/id/eprint/63011
Item Control Page Item Control Page