Hacker Neus
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU
(arxiv.org)
217 points
by chrsw
7 hours ago |
41 comments