Viewing a single comment thread. View all comments

PilotThen t1_jdppmpl wrote on March 26, 2023 at 5:27 AM

There's also the point that they optimise for computer power at training time.

In mass deployment computer power at inference time starts to matter.