Please use this identifier to cite or link to this item:
http://dspace.dtu.ac.in:8080/jspui/handle/repository/21671
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | YAMAN | - |
dc.contributor.author | SHONDHANI, DEEPANSHU | - |
dc.date.accessioned | 2025-06-12T05:12:18Z | - |
dc.date.available | 2025-06-12T05:12:18Z | - |
dc.date.issued | 2025-05 | - |
dc.identifier.uri | http://dspace.dtu.ac.in:8080/jspui/handle/repository/21671 | - |
dc.description.abstract | Effective neural network weight initialization is crucial for successful training, yet standard methods often rely on assumptions violated by modern architectures and advanced activation functions like Swish. This dissertation details a study investigating the feasibility of using Reinforcement Learning (RL) to tune a scaling factor for He initialization when employing Swish activations. An RL agent explored different scaling factors, evaluating them by training a small convolutional neural network on CIFAR-10 for a 7-epoch proxy task. Over 200 episodes, the RL agent demonstrated learning, converging towards a specific range of scaling factors that optimized the 7-epoch validation accuracy. This preliminary investigation highlights the functionality of the RL framework for initialization tuning and underscores the importance of evaluating the fidelity of short-term proxy tasks in predicting longer-term training performance, informing subsequent research into more complex symbolic initialization discovery. | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartofseries | TD-7876; | - |
dc.subject | REINFORCEMENT LEARNING | en_US |
dc.subject | WEIGHT INITIALIZATION | en_US |
dc.subject | DEEP NEURAL NETWORKS | en_US |
dc.subject | ACTIVATION FUNCTIONS | en_US |
dc.title | REINFORCEMENT LEARNING FOR WEIGHT INITIALIZATION | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | M Sc Applied Maths |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
YAMAN & DEEPANSHU m.sC..pdf | 1.06 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.