Hacker Neus
Accelerating Gemma 4: faster inference with multi-token prediction drafters
(blog.google)
582 points
by amrrs
19 hours ago |
273 comments