machine-learning regression statistics

Definition

Polynomial Regression

Polynomial regression is a generalisation of linear regression where the relationship between the independent variable and the dependent variable is modelled as an -th degree polynomial. Although the relationship is non-linear, the model remains linear in its parameters .

Formally, the hypothesis is:

where is the feature transformation.

% Axes
\draw[, cyan!30!white] (0,0) — (2.8, 0) node[right, text=cyan!30!white, font=\bfseries] {};
\draw[, cyan!30!white] (0,0) — (0, 2.2) node[above, text=cyan!30!white, font=\bfseries] {};
\end{tikzpicture}
\end{document}

Model Complexity

The degree of the polynomial determines the capacity of the learner. Lower degrees may result in underfitting (high bias), while excessively high degrees lead to overfitting (high variance) by capturing noise in the training data.