Upstairs_Suit_9464 t1_jbz8dyt wrote on March 12, 2023 at 9:35 PM

I have to ask… is this a joke or are people actually working on digitizing trained networks?

kkg_scorpio t1_jbz91de wrote on March 12, 2023 at 9:39 PM

Check out the terms "quantization aware training" and "post training quantization".

8-bit, 4-bit, 2-bit, hell even 1-bit inference are scenarios which are extremely relevant for edge devices.

Isn't 1-bit quantisation qualitatively different as you can do optimizations only available if the parameters are fully binary?

[removed]