DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

(github.com)

343 points | by aurenvale  3 hours ago

87 comments