TY - JOUR
T1 - ARLP
T2 - Automatic multi-agent transformer reinforcement learning pruner for one-shot neural network pruning
AU - Guo, Bowen
AU - Chang, Xiang
AU - Chao, Fei
AU - Zheng, Xiawu
AU - Lin, Chih Min
AU - Chen, Yanjie
AU - Shang, Changjing
AU - Shen, Qiang
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/6/27
Y1 - 2024/6/27
N2 - Overparameterized Neural Networks demonstrate state-of-the-art performance; however, the escalating demand for more compact and energy-efficient neural networks has arisen to facilitate the deployment of machine learning applications on devices with limited computational resources. A prevalent approach employs various pruning techniques. However, hyperparameters, such as the pruning ratio for each layer in pruning techniques, are typically set by human experts and often lack optimization. In this paper, we therefore propose a novel method named “Automatic Multi-Agent Transformer Reinforcement Learning Pruner” (ARLP). ARLP leverages a transformer-based multi-agent reinforcement learning controller to autonomously prune networks at initialization by extracting network meta-features. This autonomous process eliminates the need for human intervention in determining the optimal pruning ratio for each layer. The meta-features are derived from various zero-cost pruning-at-initialization proxies to perform One-shot Pruning. Extensive experimental results demonstrate that ARLP outperforms other state-of-the-art methods, establishing its efficacy in achieving superior performance.
AB - Overparameterized Neural Networks demonstrate state-of-the-art performance; however, the escalating demand for more compact and energy-efficient neural networks has arisen to facilitate the deployment of machine learning applications on devices with limited computational resources. A prevalent approach employs various pruning techniques. However, hyperparameters, such as the pruning ratio for each layer in pruning techniques, are typically set by human experts and often lack optimization. In this paper, we therefore propose a novel method named “Automatic Multi-Agent Transformer Reinforcement Learning Pruner” (ARLP). ARLP leverages a transformer-based multi-agent reinforcement learning controller to autonomously prune networks at initialization by extracting network meta-features. This autonomous process eliminates the need for human intervention in determining the optimal pruning ratio for each layer. The meta-features are derived from various zero-cost pruning-at-initialization proxies to perform One-shot Pruning. Extensive experimental results demonstrate that ARLP outperforms other state-of-the-art methods, establishing its efficacy in achieving superior performance.
KW - Automated machine learning
KW - Multi-agent reinforcement learning
KW - Neural network pruning
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85196961198&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.112122
DO - 10.1016/j.knosys.2024.112122
M3 - Article
AN - SCOPUS:85196961198
SN - 0950-7051
VL - 300
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 112122
ER -