Writings
[_Exploring Aurora's Hyperparameters_](notes/aruora.md.html)
May 21, 2026
A look at Aurora optimizer's hyperparameters — learning rate, weight decay, and pp_beta — across 14M / 50M / 100M / 300M decoder-only transformers, looking for better loss.
[_Of More to Come_](notes/note1.md.html)
December 25, 2025
This is a first of many (hopefully) series of writings on a wide variety of topics. I will try to release as consistently, and with the upmost highest quality, although I can't promise the latter.