A Review of Machine Learning Techniques for RiskEvaluation in Healthcare and Insurance Systems

Neha  Upadhyay

doi:10.5281/zenodo.17452974

PDF

Published: 2025-10-18

DOI: https://doi.org/10.5281/zenodo.17452974

Keywords:

Loan Default Prediction, XGBoost, Knowledge Graph Embedding (KGE), Credit Risk Assessment, Machine Learning, Classification Models, Feature Enrichment

Neha Upadhyay

IIS University

Abstract

Financial institutions require an accurate estimation of the risk of loan default in order to reduce losses incurred by credit
and sustain lending. This study proposes a robust stacking-based machine learning framework that integrates Knowledge Graph
Embedding (KGE) for semantic feature enrichment with XGBoost as the final predictive model. The approach is evaluated on the
Home Credit Default Risk (HCDR) dataset, comprising diverse financial, demographic, and behavioral attributes of loan applicants.
A comprehensive preprocessing pipeline, including imputation, normalization, one-hot encoding, and correlation-based feature
selection, ensures data quality and model generalizability. The proposed KGE-XGBoost model captures both structured tabular and
relational semantics by transforming borrower-entity relationships into dense embeddings, which are concatenated with original
features to form a unified representation. Experimental results demonstrate superior performance with 96.79% accuracy (ACC),
80.83% precision (PRE), 78.75% recall (REC), and an F1-score (F1) of 79.00%. The proposed model exhibits a strong ability to
outperform the baseline models (Random Forest achieved ACC 94.20%, NN achieved ACC 89%, and DT achieved ACC 73%),
particularly in scenarios with class imbalances. The KGE integration has been found to greatly contribute to feature expressiveness
and it presents a scalable and promising credit risk assessment solution to real-life financial applications.

Downloads

Download data is not yet available.

Issue

Vol. 1 No. 10 (2025): October-2025

Section

Research Paper

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

How to Cite

A Review of Machine Learning Techniques for RiskEvaluation in Healthcare and Insurance Systems. (2025). Journal of Global Research in Multidisciplinary Studies(JGRMS), 1(10), 36-43. https://doi.org/10.5281/zenodo.17452974

A Review of Machine Learning Techniques for RiskEvaluation in Healthcare and Insurance Systems

Abstract

Downloads

Issue

Section

How to Cite

Most read articles by the same author(s)

Similar Articles

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

How to Cite

Most read articles by the same author(s)

Similar Articles