Welcome to The Researchers' Guide (YouTube Channel) blog posts
Hello, I am Rahul Raoniar (PhD Student at IIT Guwahati, India) and welcome to Rahul_CODIFY !
"If you have knowledge, let others light their candles in it." - Margaret Fuller
This is a Python and R data Science Repository for Learning, Contributing and Improving Data Science Literacy
-
Blog posts
[For readers]: -
Codes and instructions for
Loading data into R and Python
- using base Python and R packages
Data manipulaton
- Using Base R
- dplyr
- forcats
- data.table
- Pandas
- dfply
Data tidying
- tidyr package
- broom package
- Pandas
Static Visualization
- Base R
- ggplot2
- Matplotlib
- Seaborn
- plotnine
- Interactive Visualization
- ggvis
- rbokeh
- plotly
- TrelliscopeJS (Big Data)
Modelling
- Supervised
- Linear + Linear mixed effect models
- Logit models (binary, multinominal classification and ordered) & Mixed effect models
- Survival Analysis [non-parametric, semi-parametric and full parametric models]
- Tree based models (classification and regression)
- naive bayes classifier (Probabilistic models)
- k-nearest neighbour (classification)
- Ensemble learners (Boot strap aggregation, random forest, Boosting, Extreme gradient boosting)
- Support Vector Machines
- Neural Networks using Keras and Tensor Flow
- shalow Neural Network (nntool, neuralnet packages)
- Deep Neural Network (h2o, Keras, MXNet packages etc.)
- Auto ML (h2o package)
- Unsupervised
- Clustering
- K-means
- Hirarchical
- Model based
- Density Based
- Association Analysis and Sequence Mining
- Dimension Reduction
- Principal Component Analysis
- Multidimensional Scaling
- Singular Value Decomposition
- Non-linear dimension reduction (ISOMAP and Locally Linear Embeding)
- Clustering
- Supervised
Model Evaluation
- Contigency Table
- Cross Validation
- Performance metrices (Metrics package)
- ROCR Curve
- F-measure
- Hyperparameter tuning using
- caret
- mlr
- H2O
- Scikit-learn
- Pycaret
- Interpretation of ML models using lime (Local Interpretable Model-Agnostic Explanations)
- Interpretation of ML models using SHAP (Shapley Additive Explanations)
-
Datasets
-
Python and R codes in the form of scripts & markdown documents
-
Interactive dashboard using Tableau
-
Web based application using Streamlit
** TS = Tutorial Series **
- TS 1. Introduction to R and Python
- TS 2. Function
- TS 3. Loading, Data Extracting and Transforming
- Base R and Python, readr, readxl, pandas
- TS 4. Data Preparation
- Base R and Python
- TS 5. Data Manipulation
- dplyr, data.table, dfply and pandas
- TS 6. Visualization using ggplot2
- Base R
- ggplot2
- Plotnine
- Seaborn
- Matplotlib
- TS 7. Map preparation
- ggmap
- tmap
- map using rbokeh
- TS 8. Interactive ploting
- ggvis
- plotly
- rbokeh
- TS 9. R Markdown, Shiny and Streamlit
- rmarkdown (markdown report preparation)
- shiny (Web application development)
- Streamlit (Web application development)
- TS 10. Statistics with R and Python
- Basic statistics
- Statistical Tests and Inferences from Data
- Scipy stats
- TS11. Supervised Machine learning
- Regression
- Classification
- TS12. Unsupervised Machine learning with R
- Clustering
- Association Analysis and Sequence Mining
- Dimension Reduction
These tutorial videos are a small contribution to the society from my side.
**Happy Coding :)