Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Learning to Boost Training by Periodic Nowcasting Near Future Weights

Title
Learning to Boost Training by Periodic Nowcasting Near Future Weights
Authors
Jang, JinhyeokYun, Woo-HanKim, Won HwaYoon, YoungwooKim, JaehongLee, JaeyeonHan, ByungOk
Date Issued
2023-07-25
Publisher
ML Research Press
Abstract
Recent complicated problems require large-scale datasets and complex model architectures, however, it is difficult to train such large networks due to high computational issues. Significant efforts have been made to make the training more efficient such as momentum, learning rate scheduling, weight regularization, and meta-learning. Based on our observations on 1) high correlation between past weights and future weights, 2) conditions for beneficial weight prediction, and 3) feasibility of weight prediction, we propose a more general framework by intermittently skipping a handful of epochs by periodically forecasting near future weights, i.e., a Weight Nowcaster Network (WNN). As an add-on module, WNN predicts the future weights to make the learning process faster regardless of tasks and architectures. Experimental results show that WNN can significantly save actual time cost for training with an additional marginal time to train WNN. We validate the generalization capability of WNN under various tasks, and demonstrate that it works well even for unseen tasks. The code and pre-trained model are available at https://github.com/jjh6297/WNN.
URI
https://oasis.postech.ac.kr/handle/2014.oak/121291
Article Type
Conference
Citation
International Conference on Machine Learning (ICML), page. 14730 - 14757, 2023-07-25
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse