Submitted by benanne t3_107g3yf in MachineLearning
chodegoblin69 t1_j44uao7 wrote
Reply to comment by benanne in [R] Diffusion language models by benanne
Thank you, I will check those out.
Diffusion’s lack of causality constraint seems like a pretty tall hurdle for tasks with output formats requiring “fluency” (like summarization) though. Kind of like drawing hands early on in stable diffusion (or drawing most anything coherently for earlier models like disco diffusion). Multiple-choice question answering seems like a more natural domain, though certainly doesn’t show off the “expressive” generative abilities. Fluency probably improves significantly with scale and fine-tuning though.
Viewing a single comment thread. View all comments