Skip to content

caesaredia/credit-default-risk-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Credit Default Risk Analysis

This project analyzes factors that may influence the probability of customer loan default using a real-world dataset from a financial institution. The analysis focuses on identifying trends and relationships across demographic and financial variables.

Project Objectives πŸ“Š

  • Understand which customer attributes contribute to higher or lower loan default risk.
  • Provide insights that can help credit divisions make more informed lending decisions.
  • Apply data cleaning, transformation, and exploratory analysis techniques.

Key Analysis & Hypotheses Tested πŸ”

  • Number of Children: Are customers with more children more likely to default?
  • Family Status: How does marital status influence default probability?
  • Income Level: Do low-income customers default more often?
  • Loan Purpose: Are loans for education or car purchases riskier?

Tools Used

  • Python (pandas, matplotlib, seaborn)
  • Jupyter Notebook

Files

  • credit_analysis.ipynb β€” the main notebook containing all analysis steps
  • credit_scoring_eng.csv β€” source data file

Notes

  • Outliers and missing values were addressed using median imputation and basic filtering.
  • The dataset was cleaned for inconsistencies like casing differences and invalid values (e.g., age = 0, children = -1).

Author

Nabilla Hafsah Caesaredia

About

Predicting customer loan default risk using exploratory data analysis and basic statistical modeling.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published