Hacker Neus
Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment
(arxiv.org)
31 points
by anigbrowl
7 hours ago |
13 comments