Skip to content
Tansu Dasli edited this page Sep 18, 2023 · 25 revisions

general ML steps

- understanding data
- outliers           | IQR, anomaly detection
- relations          | statistical tests
  • Preprocessing
- handling missing, wrong, null, duplicates
- feature scaling    | standardization vs normalization
- feature selection  |
- feature extraction | dimension reduction      | PCA, SVD
- encoding           | dummy categorical fields
- discretization     | binning continuous fields
- Regression         | supervised   | predict continuous features
- Classification     | supervised   | predict categorized features 
- Clustering         | unsupervised | discover groups, density estimation, dimension reduction