Skip to content

This project focuses on building a ML-based fraud detection system to identify potentially fraudulent transactions in a financial dataset. Using exploratory data analysis (EDA), data preprocessing, and multiple classification models (Logistic Regression, XGBoost), the model is trained to distinguish between legitimate and fraudulent activity.

Notifications You must be signed in to change notification settings

anuskagithub/FraudDetection_UsingMobileTransactions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

Fraud Detection Using Mobile Transactions

This project focuses on detecting fraudulent transactions in mobile money operations using supervised machine learning techniques. The dataset was obtained from Kaggle and processed through data cleaning, exploratory analysis, model building, and evaluation.

Objective

To build a machine learning model that can accurately classify whether a transaction is fraudulent based on transaction features, and deploy a working prototype with a dashboard for visualization and predictions.

Dataset

Source: Kaggle - Mobile Money Fraud Detection

Features:

type: Type of transaction (e.g., CASH_OUT, TRANSFER) amount: Amount involved in the transaction oldbalanceOrg, newbalanceOrig, oldbalanceDest, newbalanceDest: Balance before and after the transaction isFraud: Target variable (1 for fraud, 0 for normal)

Project Pipeline

  1. Data Preprocessing
  • Loaded dataset using pandas
  • Handled missing values
  • Encoded categorical variables using LabelEncoder
  • Scaled features with StandardScaler
  • Handled class imbalance using SMOTE
  1. Exploratory Data Analysis (EDA)
  • Distribution of fraud vs. non-fraud transactions
  • Correlation heatmaps
  • Insights on transaction types and their fraud likelihood
  1. Model Building
  • Trained multiple models:

    • Logistic Regression
    • XGBoost
  • Used metrics like:

    • Accuracy
    • Confusion Matrix
    • ROC-AUC
    • Classification Report
  1. Evaluation
  • Compared model performance
  • Identified XGBoost as the most accurate and reliable model

Technologies Used

  • Python
  • Pandas, NumPy, Seaborn, Matplotlib
  • Scikit-learn
  • XGBoost
  • Imbalanced-learn (SMOTE)

Demo: Output Screenshots

Here are the snapshots of the project outputs:

1. EDA - Fraud vs Non-Fraud Transaction Count

image

2.Transaction Amount Distribution: Fraud vs Non-Fraud

image

3.Fraud Count Across Transaction Types

image

4. Correlation Heatmap

image

5. ROC-Curve

image

About

This project focuses on building a ML-based fraud detection system to identify potentially fraudulent transactions in a financial dataset. Using exploratory data analysis (EDA), data preprocessing, and multiple classification models (Logistic Regression, XGBoost), the model is trained to distinguish between legitimate and fraudulent activity.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published