Submitted by benanne t3_107g3yf in MachineLearning
Hi /r/ML,
I wrote down my thoughts about what it might take for diffusion to displace autoregression in the field of language modelling (as it has in perceptual domains, like image/audio/video generation). Let me know what you think!
https://benanne.github.io/2023/01/09/diffusion-language.html
eyeswideshhh t1_j3mwcf3 wrote
I had this exact thought of using VAE or BYOL etc to generate powerful representation for text/sentences and then train a diffusion model on continuous latent data.