REINFORCEMENT LEARNING FOR WEIGHT INITIALIZATION

YAMAN; SHONDHANI, DEEPANSHU

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More

Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/21671

Full metadata record

DC Field	Value	Language
dc.contributor.author	YAMAN	-
dc.contributor.author	SHONDHANI, DEEPANSHU	-
dc.date.accessioned	2025-06-12T05:12:18Z	-
dc.date.available	2025-06-12T05:12:18Z	-
dc.date.issued	2025-05	-
dc.identifier.uri	http://dspace.dtu.ac.in:8080/jspui/handle/repository/21671	-
dc.description.abstract	Effective neural network weight initialization is crucial for successful training, yet standard methods often rely on assumptions violated by modern architectures and advanced activation functions like Swish. This dissertation details a study investigating the feasibility of using Reinforcement Learning (RL) to tune a scaling factor for He initialization when employing Swish activations. An RL agent explored different scaling factors, evaluating them by training a small convolutional neural network on CIFAR-10 for a 7-epoch proxy task. Over 200 episodes, the RL agent demonstrated learning, converging towards a specific range of scaling factors that optimized the 7-epoch validation accuracy. This preliminary investigation highlights the functionality of the RL framework for initialization tuning and underscores the importance of evaluating the fidelity of short-term proxy tasks in predicting longer-term training performance, informing subsequent research into more complex symbolic initialization discovery.	en_US
dc.language.iso	en	en_US
dc.relation.ispartofseries	TD-7876;	-
dc.subject	REINFORCEMENT LEARNING	en_US
dc.subject	WEIGHT INITIALIZATION	en_US
dc.subject	DEEP NEURAL NETWORKS	en_US
dc.subject	ACTIVATION FUNCTIONS	en_US
dc.title	REINFORCEMENT LEARNING FOR WEIGHT INITIALIZATION	en_US
dc.type	Thesis	en_US
Appears in Collections:	M Sc Applied Maths

Files in This Item:

File	Description	Size	Format
YAMAN & DEEPANSHU m.sC..pdf		1.06 MB	Adobe PDF	View/Open

Show simple item record