Writings

[_Exploring Aurora's Hyperparameters_](notes/aruora.md.html)

May 21, 2026

A look at Aurora optimizer's hyperparameters — learning rate, weight decay, and pp_beta — across 14M / 50M / 100M / 300M decoder-only transformers, looking for better loss.

[_Of More to Come_](notes/note1.md.html)

December 25, 2025

This is a first of many (hopefully) series of writings on a wide variety of topics. I will try to release as consistently, and with the upmost highest quality, although I can't promise the latter.