Skip to content

Contains code and dataset for "The Researchers' Guide" YouTube channel

Notifications You must be signed in to change notification settings

AprilCCC/The_Researchers_Guide

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to The Researchers' Guide (YouTube Channel) blog posts

Hello, I am Rahul Raoniar (PhD Student at IIT Guwahati, India) and welcome to Rahul_CODIFY !

"If you have knowledge, let others light their candles in it." - Margaret Fuller

This is a Python and R data Science Repository for Learning, Contributing and Improving Data Science Literacy

The future Video blogs will include the following:

  1. Blog posts [For readers]:

  2. Codes and instructions for

    • Loading data into R and Python
      • using base Python and R packages
    • Data manipulaton
      • Using Base R
      • dplyr
      • forcats
      • data.table
      • Pandas
      • dfply
    • Data tidying
      • tidyr package
      • broom package
      • Pandas
    • Static Visualization
      • Base R
      • ggplot2
      • Matplotlib
      • Seaborn
      • plotnine
    • Interactive Visualization
      • ggvis
      • rbokeh
      • plotly
      • TrelliscopeJS (Big Data)
    • Modelling
      • Supervised
        • Linear + Linear mixed effect models
        • Logit models (binary, multinominal classification and ordered) & Mixed effect models
        • Survival Analysis [non-parametric, semi-parametric and full parametric models]
        • Tree based models (classification and regression)
        • naive bayes classifier (Probabilistic models)
        • k-nearest neighbour (classification)
        • Ensemble learners (Boot strap aggregation, random forest, Boosting, Extreme gradient boosting)
        • Support Vector Machines
        • Neural Networks using Keras and Tensor Flow
          • shalow Neural Network (nntool, neuralnet packages)
          • Deep Neural Network (h2o, Keras, MXNet packages etc.)
        • Auto ML (h2o package)
      • Unsupervised
        • Clustering
          • K-means
          • Hirarchical
          • Model based
          • Density Based
        • Association Analysis and Sequence Mining
        • Dimension Reduction
          • Principal Component Analysis
          • Multidimensional Scaling
          • Singular Value Decomposition
          • Non-linear dimension reduction (ISOMAP and Locally Linear Embeding)
    • Model Evaluation
      • Contigency Table
      • Cross Validation
      • Performance metrices (Metrics package)
      • ROCR Curve
      • F-measure
      • Hyperparameter tuning using
        • caret
        • mlr
        • H2O
        • Scikit-learn
        • Pycaret
      • Interpretation of ML models using lime (Local Interpretable Model-Agnostic Explanations)
      • Interpretation of ML models using SHAP (Shapley Additive Explanations)
  3. Datasets

  4. Python and R codes in the form of scripts & markdown documents

  5. Interactive dashboard using Tableau

  6. Web based application using Streamlit

The table of content

The upcoming tutorials will cover the following.

** TS = Tutorial Series **

  • TS 1. Introduction to R and Python
  • TS 2. Function
  • TS 3. Loading, Data Extracting and Transforming
    • Base R and Python, readr, readxl, pandas
  • TS 4. Data Preparation
    • Base R and Python
  • TS 5. Data Manipulation
    • dplyr, data.table, dfply and pandas
  • TS 6. Visualization using ggplot2
    • Base R
    • ggplot2
    • Plotnine
    • Seaborn
    • Matplotlib
  • TS 7. Map preparation
    • ggmap
    • tmap
    • map using rbokeh
  • TS 8. Interactive ploting
    • ggvis
    • plotly
    • rbokeh
  • TS 9. R Markdown, Shiny and Streamlit
    • rmarkdown (markdown report preparation)
    • shiny (Web application development)
    • Streamlit (Web application development)
  • TS 10. Statistics with R and Python
    • Basic statistics
    • Statistical Tests and Inferences from Data
    • Scipy stats
  • TS11. Supervised Machine learning
    • Regression
    • Classification
  • TS12. Unsupervised Machine learning with R
    • Clustering
    • Association Analysis and Sequence Mining
    • Dimension Reduction

These tutorial videos are a small contribution to the society from my side. **Happy Coding :)

About

Contains code and dataset for "The Researchers' Guide" YouTube channel

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 91.4%
  • HTML 8.3%
  • Other 0.3%