On Mon, Feb 2, 2026 at 8:56 PM Matt Mahoney <[email protected]> wrote:

> I released another update to my Hutter prize entry.
> https://encode.su/threads/4467-enwik9-preprocessor#post87076
>
> ...This is doable on a neural network with 10^9 parameters because the
> learning rate is only 5 bits per token for 200M tokens.
>

"Token" refers to the residual byte pairs from your byte pair coding
approach?

------------------------------------------
Artificial General Intelligence List: AGI
Permalink: 
https://agi.topicbox.com/groups/agi/Tefdd3e588dd95259-Mbce1b7b96bbc305596b998d8
Delivery options: https://agi.topicbox.com/groups/agi/subscription

Reply via email to