DataScienceResources

This repository houses some of the links which I found useful for data science and machine learning.

Model explanations

The most important features in your data for a complex machine learning model (such as an ensemble model) can be provided by training an explainable model. The explainable model is known as the surrogate model. More on how to derive the explanations from the explainable model and which explainable model to pick please refer to the link.

There are other techniques on machine learning interpretibility where we compare the explanations from multiple explanations techniques like LIME and SHAP. More on these techniques can be found at the link.

Feature Engineering

To train a good machine learning model you require good features. Creating good features requires in depth understanding of the raw data and how to transform into the most meaningful features via feature engineering. Some of the feature engineering techniques are captured in the following links:-

Dimensionailty Reduction

PCA A Step-by-Step Explanation of Principal Component Analysis (PCA)

Bias vs Variance

Bias-Variance Trade-Off in Machine Learning link

Using pandas.cut method

All Pandas cut() you should know for transforming numerical data into categorical data link

Ensembling

A Comprehensive Guide to Ensemble Learning link
Ensemble Methods in Machine Learning: What are They and Why Use Them? link

Bias and exclusion in Machine learning

How to Recognize Exclusion in AI linke

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DataScienceResources

Model explanations

Feature Engineering

Dimensionailty Reduction

Bias vs Variance

Using pandas.cut method

Ensembling

Bias and exclusion in Machine learning

About

Releases

Packages

License

gaugup/DataScienceResources

Folders and files

Latest commit

History

Repository files navigation

DataScienceResources

Model explanations

Feature Engineering

Dimensionailty Reduction

Bias vs Variance

Using pandas.cut method

Ensembling

Bias and exclusion in Machine learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages