Definition
Posterior Predictive Distribution
The posterior predictive distribution is the distribution over a new, unobserved data point given a set of observed (training) data . Formally, it integrates the likelihood of the new data across the entire parameter space , weighted by the posterior probability of the parameters:
where is the posterior distribution derived from observed data.
Bayesian Prediction
Unlike point estimation methods (MLE or MAP) which select a single optimal , the posterior predictive distribution accounts for the uncertainty in the parameter estimation. It represents the full Bayesian approach to prediction, where every possible model in the hypothesis class contributes to the final output according to its consistency with the observed evidence.