The brain of the model
What is attention?
How patches relate to each other
Transformers for diffusion
Z-Image's single-stream approach