machine-learning unsupervised-learning

Definition

Dimensionality Reduction

Dimensionality reduction is the process of transforming data from a high-dimensional instance space into a lower-dimensional manifold or subspace while preserving essential structural properties. Formally, given a dataset where , the process seeks a mapping function such that .

Methodological Approaches

Feature Selection: The identification and retention of a subset of the original features based on their relevance or information gain.

Feature Projection: The construction of a new latent feature space via linear transformations (e.g., PCA) or non-linear architectures (e.g., Autoencoders).

Objectives

The primary goals are to mitigate the curse of dimensionality, improve computational efficiency during training, and facilitate the visualisation of high-dimensional distributions while maintaining intrinsic information content.