Skip to main content icon/video/no-internet

Relative importance of regressor variables is an old topic that still awaits a satisfactory solution. When interest is in attributing importance in linear regression, averaging over orderings methods for decomposing R2 are among the state-of-the-art methods, although the mechanism behind their behavior is not (yet) completely understood. Random forestsa machine-learning tool for classification and regression proposed a few years agohave an inherent procedure of producing variable importances. This article compares the two approaches (linear model on the one hand and two versions of random forests on the other hand) and finds both striking similarities and differences, some of which can be explained whereas others remain a challenge. The investigation improves understanding of the nature of variable importance in random forests.

Variable Importance Assessment in Regression: Linear Regression versus Random Forest’, UlrikeGrömping The American Statistician, 63 (4) (2009): 308–319. Copyright 2009. Reproduced with permission of Taylor & Francis Informa UK Ltd - Journals in the format Textbook via Copyright Clearance Center.
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading