Towards Compute-Aware In-Switch Computing for LLMs on Multi-GPU Systems

(arxiv.org)

1 points | by rbanffy  15 hours ago

No comments yet.