Learning an Efficient Optimizer via Hybrid-Policy Sub-Trajectory Balance

Guan, Yunchuan, Liu, Yu, Zhou, Ke, Li, Hui, Jia, Sen, Shen, Zhiqi, Wang, Ziyang, Zhang, Xinglin, Chen, Tao, Hwang, Jenq-Neng and Li, Lei (2025). Learning an Efficient Optimizer via Hybrid-Policy Sub-Trajectory Balance. IN: ECAI 2025: 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy. Lynce, Ines; Murano, Nello; Vallati, Mauro; Villata, Serena; Chesani, Federico; Milano, Michela; Omicini, Andrea and Dastani, Mehdi (eds) Frontiers in Artificial Intelligence and Applications (FAIA), 413 . IOP Press.

Abstract

Recent advances in generative modeling enable neural networks to generate weights without relying on gradient-based optimization. However, current methods are limited by issues of over-coupling and long-horizon. The former tightly binds weight generation with task-specific objectives, thereby limiting the flexibility of the learned optimizer. The latter leads to inefficiency and low accuracy during inference, caused by the lack of local constraints. In this paper, we propose Lo-Hp, a decoupled two-stage weight generation framework that enhances flexibility through learning various optimization policies. It adopts a hybrid-policy sub-trajectory balance objective, which integrates on-policy and off-policy learning to capture local optimization policies. Theoretically, we demonstrate that learning solely local optimization policies can address the long-horizon issue while enhancing the generation of global optimal weights. In addition, we validate Lo-Hp’s superior accuracy and inference efficiency in tasks that require frequent weight updates, such as transfer learning, few-shot learning, domain generalization, and large language model adaptation.

Publication DOI:	https://doi.org/10.3233/FAIA251421
Divisions:	College of Engineering & Physical Sciences College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies Aston University (General)
Funding Information:	This work was supported by the China Scholarship Council (CSC) under Grant No. 202406160071, the Pioneer Centre for AI, DNRF grant number P1, the National Key Research and Development Program of China under Grant No. 2023YFB4502701, and the National Natural Science Foundation of China under Grant No. 62232007.
Additional Information:	Copyright © 2025 The Authors. This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0).
ISBN:	9781643686318
Last Modified:	09 Mar 2026 08:02
Date Deposited:	27 Nov 2025 09:25
Full Text Link:
Related URLs:	https://ebooks. ... 3233/FAIA251421 (Publisher URL) https://www.sco ... ns/105024488249 (Scopus URL)
PURE Output Type:	Conference contribution
Published Date:	2025-11-01
Published Online Date:	2025-10-25
Accepted Date:	2025-09-25
Authors:	Guan, Yunchuan Liu, Yu Zhou, Ke Li, Hui Jia, Sen Shen, Zhiqi Wang, Ziyang ( 0000-0003-1605-0873) Zhang, Xinglin Chen, Tao Hwang, Jenq-Neng Li, Lei

Download

Version: Published Version

License: Creative Commons Attribution Non-commercial

Export / Share Citation

Explore Further

Statistics

Additional statistics for this record

Record administration