KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit

(arxiv.org)

43 points | by EGreg  2 hours ago

33 comments