Hacker Neus
Serving 70B-scale LLMs efficiently on low-resource edge devices [pdf]
(arxiv.org)
246 points
by simonpure
4 days ago |
58 comments