Viewing a single comment thread. View all comments

thecodethinker t1_j3pichs wrote

Reply to comment by [deleted] in [R] Diffusion language models by benanne

Attention is still pretty confusing for me. I find diffusion much more intuitive fwiw.

2