Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs

(github.com)

201 points | by reconnecting 6 days ago ago

183 comments