I wrote an article called "The AGI standard model" where I explained a bit of the "Transformer circuit" paper. I don't know if my writing is good or not... hope it helps ☺ https://drive.google.com/file/d/1ROuO1e-STYOflrFbtHO1GDV0LLkTK3jg/view?usp=sharing
------------------------------------------ Artificial General Intelligence List: AGI Permalink: https://agi.topicbox.com/groups/agi/Tb1f6f9085be343b3-M72387ed34f9a903b88c2134b Delivery options: https://agi.topicbox.com/groups/agi/subscription
