Risk Assessment and Detection of Fraudulent Claims in Insurance Systems with Machine Learning Approaches

Authors

  • Mario Sutardiman Department of Information and Technology, Pradita University
  • Dyah Ayu Arditya Department of Information and Technology, Pradita University
  • Jarot S Suroso Master of Computer Science Department, Pradita University

DOI:

https://doi.org/10.46984/sebatik.v28i2.2522

Keywords:

Fraud Detection, Insurance Systems, Machine Learning, Random Forest, Artificial Neural Networks

Abstract

Fraudulent insurance claims pose a significant challenge to the sustainability and efficiency of insurance systems, leading to substantial financial losses and undermining trust between insurers and policyholders. The complexity and volume of contemporary data make it difficult for traditional fraud detection techniques like manual assessments to handle. To detect fraud in insurance claims, this study investigates the use of machine learning approaches, such as Random Forest and Artificial Neural Networks (ANN). Using a structured methodology, models were trained on historical claim data and assessed using  accuracy, F1-score, recall, and precision. The Random Forest algorithm's accuracy was 94%, according to the results, while the ANN performed better on controlled datasets. Additionally, feature importance analysis identified key predictors, such as claim amount and submission frequency, providing actionable insights for fraud prevention strategies. The implementation of machine learning into claim management systems offers a scalable, accurate, and cost-effective solution to combating fraud. Future work will focus on testing with larger datasets and exploring hybrid approaches to enhance robustness and adaptability.

References

Apicella, A., Isgrò, F., & Prevete, R. (2024). Don’t Push the Button! Exploring Data Leakage Risks in Machine Learning and Transfer Learning. https://ssrn.com/abstract=4733889

Asgarian, A., Saha, R., Jakubovitz, D., & Peyre, J. (2023). AutoFraudNet: A Multimodal Network to Detect Fraud in the Auto Insurance Industry. https://doi.org/10.48550/arXiv.2301.07526

Belhadi, A., Abdellah, N., & Nezai, A. (2023). The Effect of Big Data on the Development of the Insurance Industry. Business Ethics and Leadership, 7(1), 1–11. https://doi.org/10.21272/bel.7(1).1-11.2023

Benalcazar, D., Tapia, J. E., Gonzalez, S., & Busch, C. (2023). Synthetic ID Card Image Generation for Improving Presentation Attack Detection. IEEE Transactions on Information Forensics and Security, 18, 1814–1824. https://doi.org/10.1109/TIFS.2023.3255585

Binsar, F., Eryanto, E., Wahyudi, I., Sugandi, Y., & Suroso, J. (2020). Risk of Invalidation of Data in Hospital Information Systems in Indonesia. 777–782.

Bockel-Rickermann, C., Verdonck, T., & Verbeke, W. (2023). Fraud analytics: A decade of research. Expert Systems with Applications, 232, 120605. https://doi.org/10.1016/j.eswa.2023.120605

Gangadhar, K. S. N. V. K., Kumar, B. A., Vivek, Y., & Ravi, V. (2022). Chaotic Variational Auto Encoder based One Class Classifier for Insurance Fraud Detection. https://arxiv.org/abs/2212.07802

Johnson, J. M., & Khoshgoftaar, T. M. (2023). Data-Centric AI for Healthcare Fraud Detection. SN Computer Science, 4(4), 389.

Kofi Immanuel Jones, & Swati Sah. (2023). The Implementation of Machine Learning in The Insurance Industry with Big Data Analytics. International Journal of Data Informatics and Intelligent Computing, 2(2), 21–38. https://doi.org/10.59461/ijdiic.v2i2.47

Kumaraswamy, N., Markey, M. K., Ekin, J. C., Barner, F. ;, & Rascati, K. (2022). Healthcare Fraud Data Mining Methods: A Look Back and Look Ahead. 19(1).

Nabrawi, E., & Alanazi, A. (2023). Fraud Detection in Healthcare Insurance Claims Using Machine Learning. Risks, 11(9). https://doi.org/10.3390/risks11090160

Nguyen, V. B., Dastidar, K. G., Granitzer, M., & Siblini, W. (2022). The Importance of Future Information in Credit Card Fraud Detection. http://arxiv.org/abs/2204.05265

Óskarsdóttir, M., Ahmed, W., Antonio, K., Baesens, B., Dendievel, R., Donas, T., & Reynkens, T. (2022). Social Network Analytics for Supervised Fraud Detection in Insurance. Risk Analysis, 42(8), 1872–1890. https://doi.org/10.1111/risa.13693

Sathisha, H. K., & Sowmya, G. S. (2024). Detecting Financial Fraud in the Digital Age: The AI and ML Revolution. Future and Emerging Technologies in AI & ML, 3(2), 61–66.

Shekhar, S., Leder-Luis, J., & Akoglu, L. (2023). Unsupervised Machine Learning for Explainable Health Care Fraud Detection. http://www.nber.org/papers/w30946

Talukder, Md. A., Hossen, R., Uddin, M. A., Uddin, M. N., & Acharjee, U. K. (2024). Securing Transactions: A Hybrid Dependable Ensemble Machine Learning Model using IHT-LR and Grid Search. https://arxiv.org/abs/2402.14389

Tatineni, S., & Mustyala, A. (2024). Enhancing Financial Security: Data Science’s Role in Risk Management and Fraud Detection. ESP International Journal of Advancements in Computational Technology (ESP-IJACT), 2(2), 94–105.

Vo Hoang, K., Anh, C., & Thuan, N. (2023). Detecting Fraud Transaction using Ripper Algorithm Combines with Ensemble Learning Model. International Journal of Advanced Computer Science and Applications, 14. https://doi.org/10.14569/IJACSA.2023.0140438

Zanke, P. (2023). AI-Driven Fraud Detection Systems: A Comparative Study across Banking, Insurance, and Healthcare. Advances in Deep Learning Techniques, 3(2), 1–22. https://thesciencebrigade.com/adlt/article/view/182

Zhang, H., Hong, J., Dong, F., Drew, S., Xue, L., & Zhou, J. (2023). A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection. https://arxiv.org/abs/2302.03654

Published

2024-12-20

How to Cite

Sutardiman, M., Arditya, D. A. and Suroso, J. S. (2024) “Risk Assessment and Detection of Fraudulent Claims in Insurance Systems with Machine Learning Approaches”, Sebatik, 28(2). doi: 10.46984/sebatik.v28i2.2522.