BIGpedia.com - Bootstrap Aggregating - Encyclopedia and Dictionary Online
encyclopedia search

Bootstrap Aggregating

Bootstrap Aggregating (Bagging) is a meta-algorithm to improve classification and regression models in terms of stability and classification accuracy. Bagging also reduces variance and helps to avoid over-fitting. Although this method is usually applied to decision tree models, it is not limited to any model type. Bagging is a special case of model averaging approach.

Given a standard training set D of size N, we generate L new training sets Di also of size N by sampling examples uniformly from D, and with replacement. By sampling with replacement it is likely that some examples will be repeated in each Di. This kind of sample is known as a bootstrap sample. The L models are fitted using the above L bootstrap samples and combined by averaging the output (in case of regression) or voting (in case of classification)

References

Leo Breiman. Bagging predictors. Machine Learning, 24(2):123­140, 1996.

See also



The contents of this article are licensed from Wikipedia.org under the GNU Free Documentation License.
How to see transparent copy

01-04-2007 01:21:04