GitHub - fayzi-dev/Scaling_data: caling and normalizing data

In Python, there are several libraries available for scaling data, commonly used in machine learning and data preprocessing. These libraries help ensure that features are on a similar scale, which is important for many algorithms. Here are the top libraries for scaling data:

1. NumPy

While NumPy doesn't have specialized functions for scaling, it allows you to perform custom scaling using simple array operations.

2. Pandas

Although Pandas does not provide dedicated scaling functions, you can scale data using basic operations on DataFrames.

3. scikit-learn (`sklearn.preprocessing`)

The scikit-learn library provides several utilities for scaling and normalizing data. It is one of the most widely used libraries for machine learning in Python.

StandardScaler: Standardizes features by removing the mean and scaling to unit variance (Z-score normalization).
MinMaxScaler: Scales features to a given range, usually between 0 and 1.
MaxAbsScaler: Scales each feature by its maximum absolute value (useful for data that is already centered at zero).
RobustScaler: Scales features using statistics that are robust to outliers.

4. TensorFlow (`tensorflow.data` and `tf.image`)

For deep learning tasks, TensorFlow has built-in utilities to preprocess and scale data, especially for image processing.

5. PyTorch (`torchvision.transforms`)

Similar to TensorFlow, PyTorch provides transformations for scaling and normalizing image data, primarily through the torchvision.transforms module.

6. Feature-engine (`feature_engine.preprocessing`)

Feature-engine is a newer library focused on feature engineering, and it includes tools for scaling. It allows for more flexibility in handling missing data and applying scalers only to selected features.

Each of these libraries offers different utilities based on specific use cases, with scikit-learn being the most popular for general machine learning tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
1_Numpy		1_Numpy
2_Pandas		2_Pandas
3_Scikit-learn		3_Scikit-learn
4_Tensorflow		4_Tensorflow
5_PyTorch		5_PyTorch
6_Feature-engine		6_Feature-engine
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. NumPy

2. Pandas

3. scikit-learn (`sklearn.preprocessing`)

4. TensorFlow (`tensorflow.data` and `tf.image`)

5. PyTorch (`torchvision.transforms`)

6. Feature-engine (`feature_engine.preprocessing`)

About

Releases

Packages

Languages

fayzi-dev/Scaling_data

Folders and files

Latest commit

History

Repository files navigation

1. NumPy

2. Pandas

3. scikit-learn (sklearn.preprocessing)

4. TensorFlow (tensorflow.data and tf.image)

5. PyTorch (torchvision.transforms)

6. Feature-engine (feature_engine.preprocessing)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

3. scikit-learn (`sklearn.preprocessing`)

4. TensorFlow (`tensorflow.data` and `tf.image`)

5. PyTorch (`torchvision.transforms`)

6. Feature-engine (`feature_engine.preprocessing`)

Packages