Diabetes Classification Project Test Accuracy 83%

Overview

This project analyzes the diabetes dataset from Kaggle. The goal is to compare various input features against the outcome of diabetes diagnosis. Visualizations using Matplotlib are employed to provide insights into the data.

Dataset

Source: Kaggle https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database
Description: The dataset contains several medical predictor variables and one target variable, which indicates whether a patient is diabetic or not.

Features

Number of Pregnancies
Glucose Concentration
Blood Pressure
Skin Thickness
Insulin Level
BMI (Body Mass Index)
Diabetes Pedigree Function
Age

Methodology

Data Preprocessing: Data cleaning and normalization techniques were applied to prepare the dataset for training.
Model: A feedforward neural network was built using PyTorch. The architecture consists of:
- An input layer that takes 8 features.
- Two hidden layers with 20 neurons each, activated by the ReLU function.
- An output layer that provides two outputs corresponding to the binary classification of diabetes.
Loss Function: CrossEntropyLoss was used to evaluate the model's performance, as it is suitable for multi-class classification problems.
Optimizer: The Adam optimizer was employed for optimizing the model parameters, known for its efficiency and effectiveness.

Training and Testing the Model

The model was trained for 1000 epochs. During training:

The model's predictions were compared against the actual training labels to calculate the loss.
The optimizer updated the model parameters to minimize the loss.
Periodic evaluations were made on a separate test set to monitor the model's performance.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
functions		functions
html		html
model		model
notebooks		notebooks
outputs		outputs
scripts		scripts
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes Classification Project Test Accuracy 83%

Overview

Dataset

Features

Methodology

Training and Testing the Model

About

Releases

Packages

Languages

ADongol123/diabities_prediction_model

Folders and files

Latest commit

History

Repository files navigation

Diabetes Classification Project Test Accuracy 83%

Overview

Dataset

Features

Methodology

Training and Testing the Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages