TY - GEN
T1 - Feature selection for Facebook feed ranking system via a group-sparsity-regularized training algorithm
AU - Ni, Xiuyan
AU - Yu, Yang
AU - Wu, Peng
AU - Li, Youlin
AU - Nie, Shaoliang
AU - Que, Qichao
AU - Chen, Chao
N1 - Publisher Copyright:
© 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2019/11/3
Y1 - 2019/11/3
N2 - In modern production platforms, large scale online learning models are applied to data of very high dimension. To save computational resource, it is important to have an efficient algorithm to select the most significant features from an enormous feature pool. In this paper, we propose a novel neural-network-suitable feature selection algorithm, which selects important features from the input layer during training. Instead of directly regularizing the training loss, we inject group-sparsity regularization into the (stochastic) training algorithm. In particular, we introduce a group sparsity norm into the proximally regularized stochastical gradient descent algorithm. To fully evaluate the practical performance, we apply our method to Facebook News Feed dataset, and achieve favorable performance compared with state-of-the-arts using traditional regularizers.
AB - In modern production platforms, large scale online learning models are applied to data of very high dimension. To save computational resource, it is important to have an efficient algorithm to select the most significant features from an enormous feature pool. In this paper, we propose a novel neural-network-suitable feature selection algorithm, which selects important features from the input layer during training. Instead of directly regularizing the training loss, we inject group-sparsity regularization into the (stochastic) training algorithm. In particular, we introduce a group sparsity norm into the proximally regularized stochastical gradient descent algorithm. To fully evaluate the practical performance, we apply our method to Facebook News Feed dataset, and achieve favorable performance compared with state-of-the-arts using traditional regularizers.
KW - Deep neural networks
KW - Feature selection
KW - Group sparsity
KW - Online learning
KW - Proximal regularization
UR - https://www.scopus.com/pages/publications/85075438326
U2 - 10.1145/3357384.3358114
DO - 10.1145/3357384.3358114
M3 - Conference contribution
AN - SCOPUS:85075438326
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 2085
EP - 2088
BT - CIKM 2019 - Proceedings of the 28th ACM International Conference on Information and Knowledge Management
PB - Association for Computing Machinery
T2 - 28th ACM International Conference on Information and Knowledge Management, CIKM 2019
Y2 - 3 November 2019 through 7 November 2019
ER -