Bagging

Architectures

Classification

AI/ML Model Development and Risk Management

Overview

Bagging, or bootstrap aggregation, is an ensemble machine learning technique designed to improve model stability and accuracy by reducing variance. It involves generating multiple versions of a predictor by training each on a different random subset of the original dataset (obtained via bootstrapping), then aggregating their predictions (commonly through voting or averaging). Bagging is especially effective for high-variance, low-bias models such as decision trees. A prominent example is the random forest algorithm, where multiple decision trees are trained on bootstrapped data and their outputs are combined. While bagging can lead to significant performance gains and robustness against overfitting, it may increase computational costs and is less effective for models that are already low in variance. Additionally, bagging does not address model bias and may not perform well when data is highly imbalanced or contains systematic errors.

Governance Context

In AI governance, bagging intersects with model risk management and documentation obligations. For example, under the EU AI Act, organizations must document model development processes, including ensemble methods like bagging, to ensure transparency and traceability (Article 9). The US NIST AI Risk Management Framework (RMF) recommends controls such as regular model performance evaluation and bias impact assessments, which should be applied to bagged models. Organizations are also obligated to monitor model aggregation methods for unintended amplification of bias or errors and to document ensemble strategy selection rationale. Two concrete obligations include: (1) maintaining comprehensive documentation of the ensemble method selection and implementation, and (2) conducting periodic bias and performance assessments on both individual and aggregated models. These controls help ensure that ensemble methods like bagging are used responsibly and that their limitations are understood and managed.

Ethical & Societal Implications

Bagging can improve the reliability of AI systems, but if underlying data is biased or incomplete, the ensemble may reinforce or amplify these issues, potentially leading to unfair or unsafe outcomes. Model aggregation can also obscure transparency, making it harder to interpret individual predictions, which is a concern for explainability and accountability. Furthermore, the increased computational resources required for bagging may have environmental and accessibility implications. Governance must ensure that bagging is applied with careful consideration of data representativeness and that ensemble decisions remain auditable. Additionally, organizations must ensure that the use of bagged models does not inadvertently exclude minority populations or propagate historical inequities.

Key Takeaways

Bagging reduces model variance by aggregating predictions from bootstrapped data subsets.; It is most effective for high-variance, low-bias models like decision trees.; Governance frameworks require documentation and monitoring of ensemble methods.; Bagging can amplify data bias if not carefully managed and evaluated.; Transparency and explainability may be reduced in bagged models, posing governance challenges.; Regular bias and performance assessments are necessary to ensure responsible use.; Bagging increases computational demands, which may have environmental and operational impacts.