From 3f88111541eafc08b047a321d17e22cc0f75be0d Mon Sep 17 00:00:00 2001 From: that-ar-guy Date: Wed, 26 Feb 2025 12:43:42 +0530 Subject: [PATCH 1/5] autism done/ images to added --- docs/machine-learning/autism-detection.md | 164 ++++++++++++++++++++++ docs/machine-learning/index.md | 11 ++ 2 files changed, 175 insertions(+) create mode 100644 docs/machine-learning/autism-detection.md diff --git a/docs/machine-learning/autism-detection.md b/docs/machine-learning/autism-detection.md new file mode 100644 index 0000000..0a26f3b --- /dev/null +++ b/docs/machine-learning/autism-detection.md @@ -0,0 +1,164 @@ +# 🌟 Autism Spectrum Disorder (ASD) Detection using Machine Learning + +
+ +
+ +## 🎯 AIM +To develop a machine learning model that predicts the likelihood of Autism Spectrum Disorder (ASD) based on behavioral and demographic features. + +## 🌊 DATASET LINK +[Autism Screening Data](https://www.kaggle.com/code/konikarani/autism-prediction/data) + +## 📚 KAGGLE NOTEBOOK +[Autism Detection Kaggle Notebook](https://www.kaggle.com/code/thatarguy/autism-prediction-using-ml?kernelSessionId=224549646) + +??? Abstract "Kaggle Notebook" + + + +## ⚙️ TECH STACK + +| **Category** | **Technologies** | +|--------------------------|---------------------------------------------| +| **Languages** | Python | +| **Libraries/Frameworks** | Pandas, NumPy, Scikit-learn, | +| **Tools** | Jupyter Notebook, VS Code | + +--- + +## 🖍 DESCRIPTION +!!! info "What is the requirement of the project?" + - The rise in Autism cases necessitates early detection. + - Traditional diagnostic methods are time-consuming and expensive. + - Machine learning can provide quick, accurate predictions to aid early intervention. + +??? info "How is it beneficial and used?" + - Helps doctors and researchers identify ASD tendencies early. + - Reduces the time taken for ASD screening. + - Provides a scalable and cost-effective approach. + +??? info "How did you start approaching this project? (Initial thoughts and planning)" + - Collected and preprocessed the dataset. + - Explored different ML models for classification. + - Evaluated models based on accuracy and efficiency. + + +--- + +## 🔍 PROJECT EXPLANATION + +### 🧩 DATASET OVERVIEW & FEATURE DETAILS +The dataset consists of **800 rows** and **22 columns**, containing information related to autism spectrum disorder (ASD) detection based on various parameters. + + +| **Feature Name** | **Description** | **Datatype** | +|---------------------|----------------------------------------------------|:-----------:| +| `ID` | Unique identifier for each record | `int64` | +| `A1_Score` - `A10_Score` | Responses to 10 screening questions (0 or 1) | `int64` | +| `age` | Age of the individual | `float64` | +| `gender` | Gender (`m` for male, `f` for female) | `object` | +| `ethnicity` | Ethnic background | `object` | +| `jaundice` | Whether the individual had jaundice at birth (`yes/no`) | `object` | +| `austim` | Family history of autism (`yes/no`) | `object` | +| `contry_of_res` | Country of residence | `object` | +| `used_app_before` | Whether the individual used a screening app before (`yes/no`) | `object` | +| `result` | Score calculated based on the screening test | `float64` | +| `age_desc` | Age description (e.g., "18 and more") | `object` | +| `relation` | Relation of the person filling out the form | `object` | +| `Class/ASD` | ASD diagnosis label (`1` for ASD, `0` for non-ASD) | `int64` | + +This dataset provides essential features for training a model to detect ASD based on questionnaire responses and demographic information. + + +--- + +### 🛠 PROJECT WORKFLOW +!!! success "Project workflow" + ``` mermaid + graph LR + A[Start] --> B[Data Preprocessing]; + B --> C[Feature Engineering]; + C --> D[Model Training]; + D --> E[Model Evaluation]; + E --> F[Deployment]; + ``` + +=== "Step 1" + - Collected dataset and performed exploratory data analysis. + +=== "Step 2" + - Preprocessed data (handling missing values, encoding categorical data). + +=== "Step 3" + - Feature selection and engineering. + +=== "Step 4" + - Trained multiple classification models (Decision Tree, Random Forest, XGBoost). + +=== "Step 5" + - Evaluated models using accuracy, precision, recall, and F1-score. + + +--- + +### 🖥️ CODE EXPLANATION +=== "Section 1: Data Preprocessing" + - Loaded dataset and handled missing values. + +=== "Section 2: Model Training" + - Implemented Logistic Regression and Neural Networks for classification. + +--- + +### ⚖️ PROJECT TRADE-OFFS AND SOLUTIONS +=== "Trade Off 1" + - **Accuracy vs. Model Interpretability**: Used a Random Forest model instead of a deep neural network for better interpretability. + +=== "Trade Off 2" + - **Speed vs. Accuracy**: Chose Logistic Regression for quick predictions in real-time applications. + +--- + +## 🖼 SCREENSHOTS +!!! tip "Visualizations and EDA of different features" + + === "Age Distribution" + ![img](https://assets.ltkcontent.com/images/103034/line-graph-example_27c5571306.jpg) + +??? example "Model performance graphs" + + === "Confusion Matrix" + ![img](https://assets.ltkcontent.com/images/103029/bar-graph-example_27c5571306.jpg) + +--- + +## 📉 MODELS USED AND THEIR EVALUATION METRICS +| Model | Accuracy | Precision | Recall | F1-score | +|------------|----------|-----------|--------|----------| +| Decision Tree | 73% | 0.71 | 0.73 | 0.72 | +| Random Forest | 82% | 0.82 | 0.82 | 0.82 | +| XGBoost | 81% | 0.81 | 0.81 | 081 | + +--- + +## ✅ CONCLUSION +### 🔑 KEY LEARNINGS +!!! tip "Insights gained from the data" + - Behavioral screening scores are the strongest predictors of ASD. + - Family history and neonatal jaundice also show correlations with ASD diagnosis. + +??? tip "Improvements in understanding machine learning concepts" + - Feature selection and engineering play a crucial role in medical predictions. + - Trade-offs between accuracy, interpretability, and computational efficiency need to be balanced. + +--- + +### 🌍 USE CASES +=== "Early ASD Screening" + - Helps parents and doctors identify ASD tendencies at an early stage. + +=== "Assistive Diagnostic Tool" + - Can support psychologists in preliminary ASD assessments before clinical diagnosis. + + diff --git a/docs/machine-learning/index.md b/docs/machine-learning/index.md index 8261844..a35feb9 100644 --- a/docs/machine-learning/index.md +++ b/docs/machine-learning/index.md @@ -97,5 +97,16 @@ + +
+ + autism detcion using ml +
+

Autism Detection

+

Predicting Autism Using Machine Learning

+

📅 2025-02-26 | ⏱️ 8 mins

+
+
+
From ac0d54fa82ae539b3f74f9d5ff4ea2b54125f763 Mon Sep 17 00:00:00 2001 From: Mohammed Abdul Rahman <130785777+that-ar-guy@users.noreply.github.com> Date: Wed, 26 Feb 2025 14:29:40 +0530 Subject: [PATCH 2/5] index autism image updated --- docs/machine-learning/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/machine-learning/index.md b/docs/machine-learning/index.md index a35feb9..1c2e36e 100644 --- a/docs/machine-learning/index.md +++ b/docs/machine-learning/index.md @@ -100,7 +100,7 @@
- autism detcion using ml + autism detcion using ml

Autism Detection

Predicting Autism Using Machine Learning

From 4665322ac878bf8170a40d655251db0ed6901b62 Mon Sep 17 00:00:00 2001 From: Mohammed Abdul Rahman <130785777+that-ar-guy@users.noreply.github.com> Date: Wed, 26 Feb 2025 14:35:29 +0530 Subject: [PATCH 3/5] images updated --- docs/machine-learning/autism-detection.md | 15 +++++++++++---- 1 file changed, 11 insertions(+), 4 deletions(-) diff --git a/docs/machine-learning/autism-detection.md b/docs/machine-learning/autism-detection.md index 0a26f3b..524c2a0 100644 --- a/docs/machine-learning/autism-detection.md +++ b/docs/machine-learning/autism-detection.md @@ -1,7 +1,7 @@ # 🌟 Autism Spectrum Disorder (ASD) Detection using Machine Learning
- +
## 🎯 AIM @@ -15,7 +15,8 @@ To develop a machine learning model that predicts the likelihood of Autism Spect ??? Abstract "Kaggle Notebook" - + ## ⚙️ TECH STACK @@ -124,12 +125,18 @@ This dataset provides essential features for training a model to detect ASD base !!! tip "Visualizations and EDA of different features" === "Age Distribution" - ![img](https://assets.ltkcontent.com/images/103034/line-graph-example_27c5571306.jpg) + ![img](https://github.com/user-attachments/assets/412aa82d-0f7a-4c7a-bdca-30a553de36b4) ??? example "Model performance graphs" === "Confusion Matrix" - ![img](https://assets.ltkcontent.com/images/103029/bar-graph-example_27c5571306.jpg) + ![img](https://github.com/user-attachments/assets/71c5773c-fe1f-42bb-ab76-e1150f564507) + +??? example "Features Correlation" + + === "Feature Correlation Heatmap" + ![img](https://github.com/user-attachments/assets/60d24749-2f2e-4222-9895-c46c29ea596e) + --- From 0eb3c2fc8fead271e94d9245d8e0f867a9f88c59 Mon Sep 17 00:00:00 2001 From: that-ar-guy Date: Wed, 26 Feb 2025 14:49:07 +0530 Subject: [PATCH 4/5] bug fixed --- docs/machine-learning/autism-detection.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/docs/machine-learning/autism-detection.md b/docs/machine-learning/autism-detection.md index 524c2a0..eb24a78 100644 --- a/docs/machine-learning/autism-detection.md +++ b/docs/machine-learning/autism-detection.md @@ -15,8 +15,7 @@ To develop a machine learning model that predicts the likelihood of Autism Spect ??? Abstract "Kaggle Notebook" - + ## ⚙️ TECH STACK From ffad1ddfac7798dfddb5eed9a67289178950ac51 Mon Sep 17 00:00:00 2001 From: that-ar-guy Date: Thu, 27 Feb 2025 21:25:15 +0530 Subject: [PATCH 5/5] updated kaggle notebook --- docs/machine-learning/autism-detection.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/docs/machine-learning/autism-detection.md b/docs/machine-learning/autism-detection.md index eb24a78..7b3b6f8 100644 --- a/docs/machine-learning/autism-detection.md +++ b/docs/machine-learning/autism-detection.md @@ -11,12 +11,11 @@ To develop a machine learning model that predicts the likelihood of Autism Spect [Autism Screening Data](https://www.kaggle.com/code/konikarani/autism-prediction/data) ## 📚 KAGGLE NOTEBOOK -[Autism Detection Kaggle Notebook](https://www.kaggle.com/code/thatarguy/autism-prediction-using-ml?kernelSessionId=224549646) +[Autism Detection Kaggle Notebook](https://www.kaggle.com/code/thatarguy/autism-prediction-using-ml?kernelSessionId=224830771) ??? Abstract "Kaggle Notebook" - - + ## ⚙️ TECH STACK | **Category** | **Technologies** |