Journal article
Statistical model selection with ‘Big Data’
- Abstract:
- Big Data offer potential benefits for statistical modelling, but confront problems including an excess of false positives, mistaking correlations for causes, ignoring sampling biases and selecting by inappropriate methods. We consider the many important requirements when searching for a data-based relationship using Big Data, and the possible role of Autometrics in that context. Paramount considerations include embedding relationships in general initial models, possibly restricting the number of variables to be selected over by non-statistical criteria (the formulation problem), using good quality data on all variables, analyzed with tight significance levels by a powerful selection procedure, retaining available theory insights (the selection problem) while testing for relationships being well specified and invariant to shifts in explanatory variables (the evaluation problem), using a viable approach that resolves the computational problem of immense numbers of possible models.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, 1.3MB, Terms of use)
-
- Publisher copy:
- 10.1080/23322039.2015.1045216
Authors
- Publisher:
- Taylor and Francis
- Journal:
- Cogent Economics and Finance More from this journal
- Volume:
- 3
- Issue:
- 1
- Article number:
- 1045216
- Publication date:
- 2015-05-22
- Acceptance date:
- 2015-04-01
- DOI:
- EISSN:
-
2332-2039
- Language:
-
English
- Keywords:
- UUID:
-
uuid:eea578aa-9c22-410e-8a5a-39f9801ba1f8
- Deposit date:
-
2015-03-31
Terms of use
- Copyright holder:
- Doornik and Hendry
- Copyright date:
- 2015
- Rights statement:
- © 2015 The Author(s). This open access article is distributed under a Creative Commons Attribution (CC-BY) 4.0 license.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record