Murdoch University Research Repository

Welcome to the Murdoch University Research Repository

The Murdoch University Research Repository is an open access digital collection of research
created by Murdoch University staff, researchers and postgraduate students.

Learn more

PWIDB: A framework for learning to classify imbalanced data streams with incremental data re-balancing technique

Mohammed, R.A., Wong, K.W., Shiratuddin, M.F. and Wang, X.ORCID: 0000-0002-1557-8265 (2020) PWIDB: A framework for learning to classify imbalanced data streams with incremental data re-balancing technique. Procedia Computer Science, 176 . pp. 818-827.

[img]
Preview
PDF - Published Version
Download (1MB) | Preview
Free to read: https://doi.org/10.1016/j.procs.2020.09.077
*No subscription required

Abstract

The performance of classification algorithms with highly imbalanced streaming data depends upon efficient balancing strategy. Some techniques of balancing strategy have been applied using static batch data to resolve the class imbalance problem, which is difficult if applied for massive data streams. In this paper, a new Piece-Wise Incremental Data re-Balancing (PWIDB) framework is proposed. The PWIDB framework combines automated balancing techniques using Racing Algorithm (RA) and incremental rebalancing technique. RA is an active learning approach capable of classifying imbalanced data and can provide a way to select an appropriate re-balancing technique with imbalanced data. In this paper, we have extended the capability of RA for handling imbalanced data streams in the proposed PWIDB framework. The PWIDB accumulates previous knowledge with increments of re-balanced data and captures the concept of the imbalanced instances. The PWIDB is an incremental streaming batch framework, which is suitable for learning with streaming imbalanced data. We compared the performance of PWIDB with a well-known FLORA technique. Experimental results show that the PWIDB framework exhibits an improved and stable performance compared to FLORA and accumulative re-balancing techniques.

Item Type: Journal Article
Murdoch Affiliation: Information Technology, Mathematics and Statistics
Publisher: Elsevier B.V.
Copyright: © 2020 The Author(s)
URI: http://researchrepository.murdoch.edu.au/id/eprint/58470
Item Control Page Item Control Page

Downloads

Downloads per month over past year