vLLM-MLX – Run LLMs on Mac at 464 tok/s

(github.com)

33 points | by waybarrios  3 days ago

3 comments