Journal article icon

Journal article

Robust regression via error tolerance

Abstract:
AbstractReal-world datasets are often characterised by outliers; data items that do not follow the same structure as the rest of the data. These outliers might negatively influence modelling of the data. In data analysis it is, therefore, important to consider methods that are robust to outliers. In this paper we develop a robust regression method that finds the largest subset of data items that can be approximated using a sparse linear model to a given precision. We show that this can yield the best possible robustness to outliers. However, this problem is NP-hard and to solve it we present an efficient approximation algorithm, termed SLISE. Our method extends existing state-of-the-art robust regression methods, especially in terms of speed on high-dimensional datasets. We demonstrate our method by applying it to both synthetic and real-world regression problems.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1007/s10618-022-00819-2

Authors

More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0002-7749-2918
More by this author
Role:
Author
ORCID:
0000-0002-4040-6967
More by this author
Role:
Author
ORCID:
0000-0002-9623-6282
More by this author
Role:
Author
ORCID:
0000-0001-9769-7163
More by this author
Role:
Author
ORCID:
0000-0003-1819-1047


Publisher:
Springer
Journal:
Data Mining and Knowledge Discovery More from this journal
Volume:
36
Issue:
2
Pages:
781-810
Publication date:
2022-01-27
DOI:
EISSN:
1573-756X
ISSN:
1384-5810


Language:
English
Keywords:
Pubs id:
2363382
Local pid:
pubs:2363382
Source identifiers:
W4210813465
Deposit date:
2026-01-23
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP