Anglia Ruskin Research Online (ARRO)
Browse

Synthetic Data Generation and Impact Analysis of Machine Learning Models for Enhanced Credit Card Fraud Detection

Download (501.65 kB)
conference contribution
posted on 2025-05-07, 10:40 authored by Ahmed Abdullah Khaled, Md Mahmudul Hasan, Shareeful Islam, Spyridon Papastergiou, Haralambos Mouratidis
The financial industry is currently experiencing a substantial shift in its operating landscape because of the swift integration of technology. This transformation brings with it potential risks and challenges. Heightened occurrence of online fraud is one the key concerns for this sector, which has been exacerbated by the growing prevalence of online payment methods on e-commerce platforms and other websites. The identification of credit card fraud is a challenging task due to nature of imbalanced transactional data to detect and predict any fraudulent activities. In this context, this paper provides a unique approach to create synthetic dataset to tackle imbalanced issue for credit card fraud detection. The approach adopts Synthetic Minority Over-sampling Technique (SMOTE) technique for balancing dataset. An experiment is performed using several ML models including SVM (Support Vector Machines), KNN (K-Nearest Neighbours), and Random Forest to demonstrate the feasibility of using synthetic data. In this study, we have combined resampling techniques like SMOTE for oversampling the minority class with ensemble methods and appropriate evaluation metrics like the F1-score to improve the imbalanced data. The result from the experiment compared with widely used public datasets to evaluate the model performance. The analysis reveals an imbalance in the real ULB (Université Libre de Bruxelles) dataset, with the positive class (frauds) comprising a mere 0.172% of all transactions. The findings clearly show that the Random Forest model performs better than other modes with outstanding precision, recall, accuracy, and F1 score values to detect fraudulent transactions and reduce false positives.

History

Refereed

  • Yes

Volume

711

Page range

362-374

ISSN

1868-4238

Publisher

Springer Nature Switzerland

ISBN

9783031632105

Conference proceeding

IFIP Advances in Information and Communication Technology

Name of event

20th International Conference on Artificial Intelligence Applications and Innovations (AIAI)

Location

GREECE, Corfu

Event start date

2024-06-27

Event finish date

2024-06-30

Editors

Maglogiannis I, Iliadis L, Macintyre J, Avlonitis M, Papaleonidas A

File version

  • Accepted version

Affiliated with

  • School of Computing and Information Science Outputs