Graph-Based Approaches for Over-sampling in the context of Ordinal Regression

Hits: 7687
Research areas:
Year:
2015
Type of Publication:
Article
Keywords:
Over-sampling, imbalanced classification, ordinal regression, ordinal classification
Authors:
Journal:
IEEE Transactions on Knowledge and Data Engineering
Volume:
27
Number:
5
Pages:
1233-1245
Month:
May
ISSN:
1041-4347
BibTex:
Note:
JCR(2015): 2.476 Position: 17/143 (Q1) Category: COMPUTER SCIENCE, INFORMATION SYSTEMS
Abstract:
The classification of patterns into naturally ordered labels is referred to as ordinal regression or ordinal classification. Usually, this classification setting is by nature highly imbalanced, because there are classes in the problem that are a priori more probable than others. Although standard over-sampling methods can improve the classification of minority classes in ordinal classification, they tend to introduce severe errors in terms of the ordinal label scale, given that they do not take the ordering into account. A specific ordinal over- sampling method is developed in this paper for the first time in order to improve the performance of machine learning classifiers. The method proposed includes ordinal information by approaching over-sampling from a graph-based perspective. The results presented in this paper show the good synergy of a popular ordinal regression method (a reformulation of support vector machines) with the graph- based proposed algorithms, and the possibility of improving both the classification and the ordering of minority classes. A cost-sensitive version of the ordinal regression method is also introduced and compared with the over-sampling proposals, showing in general lower performance for minority classes.
Comments:
JCR(2015): 2.476 Position: 17/143 (Q1) Category: COMPUTER SCIENCE, INFORMATION SYSTEMS
Back