Contact us
We teach tiny neural networks to think. Sometimes they surprise us. Sometimes they just output zeros.
We build stuff and share it with the community.
Building efficient transformer architectures with Mu-Guided Dynamics and Token-Routed MLP