Water Quality Prediction on Ganga River

Developed a machine learning model that predicted the Water Quality Index (WQI) of the Ganga River with 85 percent accuracy, using historical water data.

Objectives

The primary objective of this project is to develop a predictive model for the Water Quality Index of the River Ganga.

Data Collection and Preprocessing: Gather comprehensive datasets encompassing various water quality parameters along the Ganges. Preprocess the data to handle missing values, outliers, and ensure compatibility with machine learning algorithms.
Feature Selection: Identify key features impacting water quality and select a subset for model training to enhance efficiency and interpretability.
Algorithm Selection: Evaluate and compare the performance of diverse machine learning algorithms such as Random Forest, Support Vector Machines (SVM), and others ML algorithms for predicting the Water Quality Index.
Model Training and Validation: Train the selected models on historical data, utilizing a portion for validation to assess performance and fine-tune hyperparameters.
Prediction and Visualization: Predict which ML algo gives the most accurate water quality level of the and receive real-time predictions of the Water Quality Index.

Technologies used

Google Colaboratory
Python Panda

Analysis

Random Forest Classifier has the highest scores in all metrics (Accuracy, Precision, Recall, and F1 Score) with perfect scores of 1.0000, indicating it perfectly predicts the water quality classifications in the testing set.
Decision Tree Classifier also performs very well, with high scores in all metrics, but not as perfect as the Random Forest.
KNN Classifier and Logistic Regression have similar performance, with decent accuracy and other metric scores.
SVM Classifier has the lowest scores among the five models, indicating it is less suitable for this classification task compared to the other models.

Support

Email at [email protected] for contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitattributes		.gitattributes
Final Presentation.pptx		Final Presentation.pptx
Ganga River updated FINALL.xlsx		Ganga River updated FINALL.xlsx
README.md		README.md
Ritika N.pdf		Ritika N.pdf
WATER QUALITY INDEX PREDICTION USING MACHINE LEARNING ON RIVER GANGA_Final.docx		WATER QUALITY INDEX PREDICTION USING MACHINE LEARNING ON RIVER GANGA_Final.docx
Water Quality index.pdf		Water Quality index.pdf
accuracy.png		accuracy.png
df1.xlsx		df1.xlsx
extracted data.PNG		extracted data.PNG
input.csv		input.csv
wqi values.PNG		wqi values.PNG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Water Quality Prediction on Ganga River

Objectives

Technologies used

Analysis

Related

Support

About

Releases

Packages

itsmeritika20/Water-Quality-Index-Prediction

Folders and files

Latest commit

History

Repository files navigation

Water Quality Prediction on Ganga River

Objectives

Technologies used

Analysis

Related

Support

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages