Submitted by bo_peng t3_1135aew in MachineLearning
farmingvillein t1_j8pni5v wrote
Reply to comment by gwern in [R] RWKV-4 14B release (and ChatRWKV) - a surprisingly strong RNN Language Model by bo_peng
Neither of these offer a comparative look against transformers, although they are certainly a useful look against the limitations of your basic RNN/LSTM.
Viewing a single comment thread. View all comments