1 article
Google's new compression research promises to cut the memory required to run large language models by up to 87% with zero accuracy loss, according to Cnbc .