We reproduced Anthropic's Mythos findings with public models

(blog.vidocsecurity.com)

99 points | by __natty__ 14 hours ago ago

60 comments