data-analysis

Definition

Standardisation

Standardisation (or z-score normalisation) is a data preprocessing technique that transforms numerical features to have a mean of zero and a standard deviation of one. Formally, for a feature with values , the transformed value is:

where is the arithmetic mean and is the standard deviation.

Robustness and Assumptions

Unlike min-max scaling, standardisation does not bound values to a fixed range, which makes it more robust to outliers. It is particularly suitable for algorithms that assume a Gaussian distribution of input features, such as linear regression or support vector machines.

Definition

Independent Feature Scaling

Independent feature scaling is the principle that statistical scaling operations must be applied individually to each feature dimension of a dataset. Formally, for a dataset with , the scaling parameters (e.g., ) are computed for each dimension independently:

Link to original