TY - GEN
T1 - Does It Pay to Optimize AUC?
AU - Zhou, Baojian
AU - Skiena, Steven
N1 - Publisher Copyright:
Copyright © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2023/6/27
Y1 - 2023/6/27
N2 - The Area Under the ROC Curve (AUC) is an important model metric for evaluating binary classifiers, and many algorithms have been proposed to optimize AUC approximately. It raises the question of whether the generally insignificant gains observed by previous studies are due to inherent limitations of the metric or the inadequate quality of optimization. To better understand the value of optimizing for AUC, we present an efficient algorithm, namely AUC-opt, to find the provably optimal AUC linear classifier in R2, which runs in O(n+n- log(n+n-)) where n+ and n- are the number of positive and negative samples respectively. Furthermore, it can be naturally extended to Rd in O((n+n-)d-1 log(n+n-)) by calling AUC-opt in lower-dimensional spaces recursively. We prove the problem is NP-complete when d is not fixed, reducing from the open hemisphere problem. Experiments show that compared with other methods, AUC-opt achieves statistically significant improvements on between 17 to 40 in R2 and between 4 to 42 in R3 of 50 t-SNE training datasets. However, generally the gain proves insignificant on most testing datasets compared to the best standard classifiers. Similar observations are found for nonlinear AUC methods under real-world datasets.
AB - The Area Under the ROC Curve (AUC) is an important model metric for evaluating binary classifiers, and many algorithms have been proposed to optimize AUC approximately. It raises the question of whether the generally insignificant gains observed by previous studies are due to inherent limitations of the metric or the inadequate quality of optimization. To better understand the value of optimizing for AUC, we present an efficient algorithm, namely AUC-opt, to find the provably optimal AUC linear classifier in R2, which runs in O(n+n- log(n+n-)) where n+ and n- are the number of positive and negative samples respectively. Furthermore, it can be naturally extended to Rd in O((n+n-)d-1 log(n+n-)) by calling AUC-opt in lower-dimensional spaces recursively. We prove the problem is NP-complete when d is not fixed, reducing from the open hemisphere problem. Experiments show that compared with other methods, AUC-opt achieves statistically significant improvements on between 17 to 40 in R2 and between 4 to 42 in R3 of 50 t-SNE training datasets. However, generally the gain proves insignificant on most testing datasets compared to the best standard classifiers. Similar observations are found for nonlinear AUC methods under real-world datasets.
UR - https://www.scopus.com/pages/publications/85168253785
U2 - 10.1609/aaai.v37i9.26349
DO - 10.1609/aaai.v37i9.26349
M3 - Conference contribution
AN - SCOPUS:85168253785
T3 - Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
SP - 11408
EP - 11416
BT - AAAI-23 Technical Tracks 9
A2 - Williams, Brian
A2 - Chen, Yiling
A2 - Neville, Jennifer
PB - AAAI Press
T2 - 37th AAAI Conference on Artificial Intelligence, AAAI 2023
Y2 - 7 February 2023 through 14 February 2023
ER -