Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation

(arxiv.org)

46 points | by PaulHoule  4 days ago

3 comments