Journal article icon

Journal article

Hierarchical testing of variable importance

Abstract:
A frequently encountered challenge in high-dimensional regression is the detection of relevant variables. Variable selection suffers from instability and the power to detect relevant variables is typically low if predictor variables are highly correlated. When taking the multiplicity of the testing problem into account, the power diminishes even further. To gain power and insight, it can be advantageous to look for influence not at the level of individual variables but rather at the level of clusters of highly correlated variables. We propose a hierarchical approach. Variable importance is first tested at the coarsest level, corresponding to the global null hypothesis. The method then tries to attribute any effect to smaller subclusters or even individual variables. The smallest possible clusters, which still exhibit a significant influence on the response variable, are retained. It is shown that the proposed testing procedure controls the familywise error rate at a prespecified level, simultaneously over all resolution levels. The method has power comparable to the Bonferroni-Holm procedure on the level of individual variables and dramatically larger power for coarser resolution levels. The best resolution level is selected adaptively. © 2008 Biometrika Trust.
Publication status:
Published

Actions


Access Document


Publisher copy:
10.1093/biomet/asn007

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Role:
Author


Journal:
BIOMETRIKA More from this journal
Volume:
95
Issue:
2
Pages:
265-278
Publication date:
2008-06-01
DOI:
EISSN:
1464-3510
ISSN:
0006-3444


Keywords:
Pubs id:
pubs:97749
UUID:
uuid:b3bb4e46-4db2-4c9d-afc1-52b99104d988
Local pid:
pubs:97749
Source identifiers:
97749
Deposit date:
2012-12-19

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP