Journal article
On the selection of the globally optimal prototype subset for Nearest-Neighbor classification
- Abstract:
- The nearest-neighbor classifier has been shown to be a powerful tool for multiclass classification. We explore both theoretical properties and empirical behavior of a variant method, in which the nearest-neighbor rule is applied to a reduced set of prototypes. This set is selected a priori by fixing its cardinality and minimizing the empirical misclassification cost. In this way we alleviate the two serious drawbacks of the nearest-neighbor method: high storage requirements and time-consuming queries. Finding this reduced set is shown to be NP-hard. We provide mixed integer programming (MIP) formulations, which are theoretically compared and solved by a standard MIP solver for small problem instances. We show that the classifiers derived from these formulations are comparable to benchmark procedures. We solve large problem instances by a metaheuristic that yields good classification rules in reasonable time. Additional experiments indicate that prototype-based nearest-neighbor classifiers remain quite stable in the presence of missing values.
Actions
Access Document
- Files:
-
-
(Preview, pdf, 159.1KB, Terms of use)
-
Authors
- Publication date:
- 2007-01-01
- UUID:
-
uuid:ed890122-5dfc-4cda-b783-b1350b909dc9
- Local pid:
-
oai:eureka.sbs.ox.ac.uk:425
- Deposit date:
-
2011-05-19
- ARK identifier:
Terms of use
- Copyright date:
- 2007
If you are the owner of this record, you can report an update to it here: Report update to this record