I have recently written a paper on understanding transformer learning via the le...

barking_biscuit · on Feb 23, 2023

>Furthermore, if we view transformers as Hopf algebras, one can bring convolutional models, diffusion models and transformers under a single umbrella.

I wish I was smart enough to know what a Hopf algebra was or how it worked, because this sounds awesome.

adamnemecek · on Feb 23, 2023

https://en.wikipedia.org/wiki/Hopf_algebra

Look at the diagram. Do you see the path going through the middle? And the paths at the top and bottom (they are generalized convolutions)? Well a Hopf algebra "learns" by updating it's internal state in order to enforce an invariance between the middle path and the top and bottom paths.

Allow me to restate it, it's an algebra that "learns". Reading about Hopf algebras is tripper than dropping acid.