Viewing a single comment thread. View all comments

Co0k1eGal3xy t1_jbzi8wc wrote on March 12, 2023 at 10:45 PM

Double Decent, more parameters are MORE data efficient.
Most of these LLMs barely complete 1 epoch, so there is no concern about overfitting currently.